“…Finally, networks can be classified according to the input feature used, such as the complex-valued multichannel STFT [27], its phase [24], or the GCC-PHAT between all microphone pairs [13], [25]. If the input feature consists of the output of a classical signal processing method, such as the SRP maps shown in Fig.…”