“…Recent studies on both entity and event coreference resolution use several metrics to evaluate system performance (Bejan and Harabagiu, 2010;Lee et al, 2012;Durrett et al, 2013;Lassalle and Denis, 2013) since there is no agreement on a single metric. Currently, five metrics are widely used: MUC (Vilain et al, 1995), B-CUBED (Bagga and Baldwin, 1998), two CEAF metrics CEAF-φ 3 and CEAF-φ 4 (Luo, 2005), and BLANC (Recasens and Hovy, 2011).…”