“…It is widely acknowledged that speaker adaptive training (SAT) is effective in improving ASR performance, especially for large vocabulary tasks [18][19][20]. Approaches to SAT are divided into two categories: model-based approaches, e.g., maximum likelihood linear regression (MLLR) [21], and feature-based approaches, e.g., feature-space MLLR (fMLLR) [22], i-vectors [23], speaker codes [24], and other appending features [25]. All of these methods are based on the assumption that speech transcription and/or speaker identity information are available.…”