“…More powerful models have included conditional random fields [4], boosted hierarchical prediction [5], and punctuation as neural machine translation (NMT) [6]. However, state of the art results have been achieved using convolutional neural networks (CNNs) [7], long short term memory (LSTM) networks [8,9,10], and transformers [1,11], treating the problem as a classification task. In these latter approaches, the task is to consider which punctuation symbol should follow each token in an utterance, rather than, say, detecting just sentence boundaries in text.…”