“…To ensure the acoustic analyses were the most accurate representation of the stimuli heard in the scanner, acoustic features were derived from high fidelity recordings of each stimulus from the scanner's playback system. From these recordings we extracted acoustic features known to influence the neural processing of sound in humans (Alluri et al, 2012;Giordano et al, 2012;Allen et al, 2017;Lewis et al, 2012;Warren et al, 2005;Lewis et al, 2009;Patterson et al, 2002;Ogg & Slevc, 2019a;2019bMenon et al, 2002Langers et al, 2007;Norman-Haignere et al, 2015;Schönwiesner & Zatorre, 2009) Temporal dynamics of the stimuli (regarding the speed of sound onset) were characterized via the log-attack-time and temporal centroid of the energy envelope (Peeters et al, 2011, Kazazis et al, 2017. Spectral qualities were assessed via spectral centroid (similar to overall brightness), spectral flatness (similar to overall noisiness of the spectrum) and spectral variability (index of spectral change over time) extracted from an ERB (equivalent rectangular-bandwidth) gammatone representation (i.e., a cochleagram) of the sounds (Peeters et al, 2011, Kazazis et al, 2017 in 5-ms windows/increments.…”