“…There are a large body of recent works on learning with noisy labels, which include but do not limit to estimating the noise transition matrix [9,20,53,54], reweighting examples [38,44,45,47], selecting confident examples [4,25,33,56], designing robust loss functions [10,12,49,64], introducing regularization [5,23,61], generating pseudo labels [17,34,46,63,66], and etc. In addition, some advanced start-of-the-art methods combine serveral techniques, e.g., DivideMix [30] and ELR+ [37].…”