“…The internal representations of the pre-processed signals contain the dimensions time, audio frequency, and modulation frequency. Recent studies showed that the relative differences in the internal representation across the different auditory channels provide critical information related to the intelligibility of speech signals (e.g., Elhilali et al, 2003;Chabot-Leclerc et al, 2014;Carney, 2018;Scheidiger et al, 2018). Thus, the sCASP model's backend was designed such that it assumes independent processing across modulation channels but analyzes the contribution of all auditory channels simultaneously for each of the modulation frequencies.…”