“…The computer-assisted pronunciation training (CAPT) system, which can conduct assessments and provide detailed feedback on pronunciation proficiency, is thus attracting attention as an ESL learning service and platform [1,2]. There are two technical approaches to the CAPT system: mispronunciation detection and diagnosis (MDD) [3,4,5,6,7,8,9,10,11,12] and automatic pronunciation assessment [6,13,14,15,16,17,10]. MDD is a task of detecting pronunciation errors by calculating multiple measures using estimated and canonical phones from an automatic speech recognizer.…”