This article examines speech errors in Cantonese with the aim of fleshing out a larger speech production architecture for encoding phonological tone. A corpus was created by extracting 2,462 speech errors, including 668 tone errors, from audio recordings of natural conversations. The structure of these errors was then investigated in order to distinguish two contemporary approaches to tone in speech production. In the tonal frames account, tone is encoded like metrical stress, represented in abstract structural frames for a word. Because tone cannot be mis-selected in tonal frames, tone errors are expected to be rare and non-contextual, as observed with stress. An alternative is that tone is actively selected in phonological encoding like phonological segments. This approach predicts that tone errors will be relatively common and exhibit the contextual patterns observed with segments, like perseveration and anticipation. In our corpus, tone errors are the second most common type of error, and the majority of errors exhibit contextual patterns that parallel segmental errors. Building on prior research, a two-stage model of phonological tone encoding is proposed, following the patterns seen in tone errors: Tone is phonologically selected concurrently with segments, but then sequentially assigned after segments to a syllable.