2021
DOI: 10.1162/tacl_a_00376
|View full text |Cite
|
Sign up to set email alerts
|

An Error Analysis Framework for Shallow Surface Realization

Abstract: The metrics standardly used to evaluate Natural Language Generation (NLG) models, such as BLEU or METEOR, fail to provide information on which linguistic factors impact performance. Focusing on Surface Realization (SR), the task of converting an unordered dependency tree into a well-formed sentence, we propose a framework for error analysis which permits identifying which features of the input affect the models’ results. This framework consists of two main components: (i) correlation analyses between a wide ra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 37 publications
0
1
0
Order By: Relevance
“…3 There has been some effort to automate this process. For example, Shimorina et al (2021) describe an automatic error analysis procedure for shallow surface realisation, and Stevens-Guille et al (2020) automate the detection of repetitions, omissions, and hallucinations. However, for many NLG tasks, this kind of automation is still out of reach, given the wide range of possible correct outputs that are available in language generation tasks.…”
Section: Defining Errorsmentioning
confidence: 99%
“…3 There has been some effort to automate this process. For example, Shimorina et al (2021) describe an automatic error analysis procedure for shallow surface realisation, and Stevens-Guille et al (2020) automate the detection of repetitions, omissions, and hallucinations. However, for many NLG tasks, this kind of automation is still out of reach, given the wide range of possible correct outputs that are available in language generation tasks.…”
Section: Defining Errorsmentioning
confidence: 99%