“…Model-based approaches typically formulate the confidence estimation problem as a binary classification task [20,21,22,23,25,24,27,28,29], where correct tokens, words, or utterances should have confidence scores close to 1, and 0 otherwise. For word-level confidence estimation, scores between 1 and 0 are only assigned to words that appear in the hypotheses.…”