The amplitude envelope of speech is crucial for accurate comprehension, and several studies have shown 1 that the phase of neural activity in the theta-delta bands (1 -10 Hz) tracks the phase of the speech 2 amplitude envelope during listening, a process referred to as envelope tracking. However, the 3 mechanisms underlying envelope tracking have been heavily debated. Envelope tracking may reflect 4 either continuous entrainment of endogenous low-frequency oscillations to the speech envelope or the 5 combination of a series of evoked responses to acoustic landmarks within the envelope. To distinguish 6 between these two accounts for envelope tracking, and to identify the acoustic features driving it, we 7 recorded magnetoencephalography (MEG) while participants listened to natural and slowed speech. First, 8 we found that acoustic edges in speech amplitude drove evoked responses and induced phase-resetting in 9 the theta-delta band, supporting the evoked response account. In line with this, phase locking in the theta 10 band was transient and independent of the modulation rate of the speech envelope but matched the 11 temporal extent of the evoked response. Further analysis showed that the magnitude of theta phase-12 locking reflected the slope of amplitude increases. However, although amplitude increases were more 13 gradual in slowed speech, the magnitude of phase-locking did not differ between slow and regular speech, 14 reflecting a normalization for speech rate. Taken together, our results are in line with predictions of the 15 evoked response account of speech envelope tracking and support acoustic edge detection as a flexible 16 mechanism for tracking of temporal dynamics in speech. 17 18