Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization [2201.01928]