“…This frequency-selective characteristic of AM processing has been modelled using the concept of a modulation filterbank, based on the idea that AM fluctuations are decomposed through an array of relatively broad bandpass modulation filters with a constant quality (Q) factor of approximately 1-2 (e.g., Dau et al ., 1997a, 1999; Ewert and Dau, 2000). Computational modelling studies have successfully applied the modulation filterbank concept to simulate data from various experimental paradigms, including simultaneous and non-simultaneous spectral and temporal signal detection and masking conditions (Dau et al ., 1997a, 1997b, 1999; Verhey et al ., 1999; Ewert and Dau, 2000; Ewert et al ., 2002; Piechowiak et al ., 2007; Jepsen et al ., 2008; Jepsen and Dau, 2011; King et al ., 2019), sound texture perception (McDermott and Simoncelli, 2011; McDermott et al ., 2013; McWalter and Dau, 2015, 2017), auditory stream segregation (Elhilali et al ., 2009; Christiansen et al ., 2014), and speech intelligibility (Jørgensen and Dau, 2011; Jørgensen et al, 2013; Relaño-Iborra et al, 2016, 2019; Zaar and Dau, 2017; Zaar et al, 2017; Steinmetzger et al, 2019; Zaar and Carney, 2022; for a review, see Relaño-Iborra and Dau, 2022). Furthermore, the modulation filterbank is conceptually consistent with the temporal dimension of a ‘two-dimensional’ spectro-temporal modulation filterbank, inspired by neural responses to spectro-temporally varying stimuli in the auditory cortex of ferrets (Kowalski et al ., 1996; Depireux et al ., 2001) and supported by data from perceptual learning and masking conditions (Sabin et al ., 2012; Oetjen and Verhey, 2015, 2017; Conroy et al ., 2022), as well as models of speech intelligibility (Elhilali et al ., 2003; Chi et al ., 2005; Zilany and Bruce, 2007; Chabot-Leclerc et al ., 2014).…”