“…Image captioning is a traditional task and has received extensive research interest (You et al, 2016;Aneja et al, 2018;Xu et al, 2021). Radiology report generation can be treated as an extension of image captioning tasks to the medical domain, aiming to describe radiology images in the text (i.e., findings), and has achieved considerable improvements in recent years (Chen et al, 2020;Zhang et al, 2020a;Liu et al, 2019bLiu et al, , 2021bZhou et al, 2021;Boag et al, 2020;Pahwa et al, 2021;Jing et al, 2019;Zhang et al, 2020b;You et al, 2021;Liu et al, 2019a). Liu et al (2021a) employed competence-based curriculum learning to promote report generation, which started from simple reports and then attempted to consume harder reports.…”