“…Early work in unsupervised PCFG induction from raw text (Johnson et al, 2007;Liang et al, 2009;Tu, 2012) was not as successful as models of unsupervised constituency parsing (Seginer, 2007;Ponvert et al, 2011). However, recent work from unsupervised parsing (Shen et al, 2019;Drozdov et al, 2019Drozdov et al, , 2020 and grammar induction (Jin et al, 2018a(Jin et al, , 2019Zhu et al, 2020;Jin and Schuler, 2020;Li et al, 2020) shows much improvement over previous results with grammars learned solely from raw text, indicating that statistical regularities relevant to syntactic acquisition can be found in word collocations. For example, propose a word-based neural compound PCFG induction model for accurate grammar induction on English.…”