2022
DOI: 10.48550/arxiv.2201.04275
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PhysNLU: A Language Resource for Evaluating Natural Language Understanding and Explanation Coherence in Physics

Abstract: In order for language models to aid physics research, they must first encode representations of mathematical and natural language discourse which lead to coherent explanations, with correct ordering and relevance of statements. We present a collection of datasets developed to evaluate the performance of language models in this regard, which measure capabilities with respect to sentence ordering, position, section prediction, and discourse coherence. Analysis of the data reveals equations and sub-disciplines wh… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 26 publications
0
1
0
Order By: Relevance
“…Communicating quantitative science occurs through the medium of mathematical text, which contains expressions, formulae, and equations, most of which requiring accompanying description. Formulae and their explanations interweave with non-mathematical language to form cohesive discourse (Meadows et al, 2022). Approaches that consider mathematical text have been proposed to solve a number of related tasks, but are yet to surpass human-level performance.…”
Section: Introductionmentioning
confidence: 99%
“…Communicating quantitative science occurs through the medium of mathematical text, which contains expressions, formulae, and equations, most of which requiring accompanying description. Formulae and their explanations interweave with non-mathematical language to form cohesive discourse (Meadows et al, 2022). Approaches that consider mathematical text have been proposed to solve a number of related tasks, but are yet to surpass human-level performance.…”
Section: Introductionmentioning
confidence: 99%