“…System 10 all w/o gold POS tags DMV (Klein and Manning, 2004) 49.6 35.8 E-DMV (Headden III et al, 2009) 52.1 38.2 UR-A E-DMV (Tu and Honavar, 2012) 58.9 46.1 CS * (Spitkovsky et al, 2013) 72.0 * 64.4 * Neural E-DMV (Jiang et al, 2016) 55.3 42.7 CRFAE (Cai et al, 2017) 37. (Klein and Manning, 2004) 55.1 39.7 UR-A E-DMV (Tu and Honavar, 2012) 71.4 57.0 MaxEnc (Le and Zuidema, 2015) 73.2 65.8 Neural E-DMV (Jiang et al, 2016) 72.5 57.6 CRFAE (Cai et al, 2017) 71.7 55.7 L-NDMV (Big training data) (Han et al, 2017) 77.2 63.2 parameters are initialized in the same way as in the POS tagging experiment. The directed dependency accuracy (DDA) is used for evaluation and we report accuracy on sentences of length 10 and all lengths.…”