“…The idea of training an ASR system based on its own transcription is closely related to the widely applied pseudo-labeling method in the field of ASR, which involves using a model’s own transcription of degraded speech as a supervision signal to adapt the model. 10 , 11 , 12 , 13 Recent advances in DNN-based ASR systems have offered a potential tool to investigate the computational strategy underlying speech recognition behavior, as these systems have reached human-level speech recognition performance in many scenarios. 14 , 15 Therefore, despite dramatic differences in the implementation of ASR systems and the human brain, we utilize the ASR system to probe the computational-level principle behind the rapid human adaptation to acoustically degraded speech.…”