“…The sample sizes ranged from n = 24 (Ross & Oliver, 2002) to n = 4,069 (Tsiouris et al., 2011). Sixteen studies were determined to be of high quality (Bernstein et al., 2015; Cervantes & Matson, 2015; Clark et al., 2016; Crocker et al., 2014; Davies et al., 2015; Esbensen & Benson, 2006; Hartley & MacLean, 2007; Hemmings et al., 2006; Horovitz et al., 2013; Koritsas & Iacono, 2015; Matson et al., 2009; Matson & Rivet, 2008; Novaco & Taylor, 2004; Sappok et al., 2014; Tenneij et al., 2009; Tsiouris et al., 2011), the remaining 22 studies were rated as low quality (Alexander et al., 2010, 2015; Allen et al., 2012; Bowring et al., 2017; Crocker et al., 2006; Didden et al., 2009; Drieschner et al., 2013; Hurley, 2008; Larson et al., 2011; Lindsay et al., 2013; Lundqvist, 2013; Lunsky et al., 2012; Nøttestad & Linaker, 2002; Owen et al., 2004; Phillips & Rose, 2010; Rojahn et al., 2004, 2010; Ross & Oliver, 2002; Thorson et al., 2008; Totsika et al., 2008; Turygin et al., 2013; Tyrer et al., 2006). A total of 27 different instruments were used to assess aggressive behaviour.…”