“…There is much prior work that induces, operates over, or otherwise uses a tree structure in neural net-work models (Socher et al, 2013a;Tai et al, 2015;Le and Zuidema, 2015;Dyer et al, 2016;Bradbury and Socher, 2017;Choi et al, 2017Choi et al, , 2018Drozdov et al, 2019;Ahmed et al, 2019;Wang et al, 2019;Mrini et al, 2021;Hu et al, 2021;Sartran et al, 2022). Such models are especially of interest due to the prevalence of trees in natural language.…”