“…Of the remaining 1,810, a further 1,743 were irrelevant based on their title and/or abstract. Full‐text screening of the remaining 67 eligible studies resulted in exclusion of 37 irrelevant studies [1–3, 5, 6, 8–11, 13, 14, 21, 23–25, 28, 29, 32, 33, 37, 38, 42, 44–46, 51, 55–57, 60, 61, 63, 66–70], five studies that did not report intra‐ or inter‐observer agreement [2, 35, 43, 52, 61], four studies that were conference proceedings [40, 49, 53, 59], two studies that assessed agreement for classification systems other than Dejour classification [34, 48], one study that included skeletally immature patients [15], two studies that expressed inter‐observer agreement only as a percentage [36, 64], three studies that were not in English or French [12, 39, 71] and three studies that could not be retrieved [7, 19, 47]. No additional studies were identified from reference lists, which left ten relevant studies for data extraction [4, 6, 22, 30, 31, 39, 41, 54, 58, 65].…”