“…This paper primarily focuses on tracking entities' states and properties throughout a procedural text. Recent research has addressed this problem by predicting actions and properties on local context (Prolocal) , autoregressive global predictions based on distance vectors (Proglobal) , integrating structural common-sense knowledge built over VerbNet (ProStruct) , building dynamic knowledge graphs over entities (KG-MRC) (Das et al, 2018), explicitly encoding the model to explain dependencies between actions (XPAD) , formulating local predictions and global sequential information flow and sequential constraints (NCET) (Gupta and Durrett, 2019), formulating the task in a QA setting (DynaPro, TSLM) (Amini et al, 2020;Faghihi and Kordjamshidi, 2021), inte-grating common-sense knowledge from Concetp-Net (KOALA) , utilizing large generative language models (LEMON) (Shi et al, 2022), or using both the question answering setting and sequential structural constraints at the same time (CGLI) (Ma et al, 2022). All the models mentioned above investigate different neural architectures to tackle the task, while we are more interested in augmenting them with additional knowledge from semantic parsers.…”