“…The MWP is the task of translating a short paragraph consisting with multiple short sentences into target mathematical equations. Previous approaches usually solve the MWP by using rulebased methods (Yuhui et al, 2010;Bakman, 2007), statistical machine learning methods (Kushman et al, 2014;Mitra and Baral, 2016;Roy and Roth, 2018;Zou and Lu, 2019), semantic parsing methods (Shi et al, 2015;Roy and Roth, 2015;Huang et al, 2017) and deep learning methods (Ling et al, 2017a;Wang et al, 2018b;Liu et al, 2019b;Wang et al, 2017;Zhang et al, 2020a). Recently, the deep learning based methods have been paid more attention for their significant improvement.…”