“…Four studies, i.e., Sheng et al (2020), , Zhu et al (2020), and Wang et al (2021), used "PofB20" (Jiang et al, 2013) to measure the percentage of defects that a developer can identify by inspecting the top 20% lines of code. Four studies, i.e., Qiao & Wang (2019), Xu et al (2019), Xu et al (2021b), andZhao et al (2021b), Effort-Aware recall (EARecall), which is defined as the percent of reviewed defective commit instances to the whole defective commit instances. Three studies, i.e., Xu et al (2019), Xu et al (2021b), andZhao et al (2021b), Effort-Aware F-measure (EAF-measure), which is defined as the weighted harmonic average between EARecall and EAPrecision.…”