1Segmenting the continuous speech stream into units for further perceptual and linguistic 2 analyses is fundamental to speech recognition. The speech amplitude envelope (SE) has 3 long been considered a fundamental temporal cue for segmenting speech. Does the 4 temporal fine structure (TFS), a significant part of speech signals often considered to 5 contain primarily spectral information, contribute to speech segmentation? Using 6 magnetoencephalography, we show that the TFS entrains cortical oscillatory responses 7 between 3-6 Hz and demonstrate, using mutual information analysis, that (i) the temporal 8 information in the TFS can be reconstructed from a measure of frame-to-frame spectral 9 change and correlates with the SE and (ii) that spectral resolution is key to the extraction 10 of such temporal information. Furthermore, we show behavioural evidence that, when the 11 SE is temporally distorted, the TFS provides cues for speech segmentation and aids 12 speech recognition significantly. Our findings show that it is insufficient to investigate 13 solely the SE to understand temporal speech segmentation, as the SE and the TFS derived 14 from a band-filtering method convey comparable, if not inseparable, temporal 15 information. We argue for a more synthetic view of speech segmentationthe auditory 16 system groups speech signals coherently in both temporal and spectral domains. 17 18