Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019
DOI: 10.18653/v1/p19-1446
|View full text |Cite
|
Sign up to set email alerts
|

SemBleu: A Robust Metric for AMR Parsing Evaluation

Abstract: Evaluating AMR parsing accuracy involves comparing pairs of AMR graphs. The major evaluation metric, SMATCH , searches for one-to-one mappings between the nodes of two AMRs with a greedy hill-climbing algorithm, which leads to search errors. We propose SEMBLEU, a robust metric that extends BLEU (Papineni et al., 2002) to AMRs. It does not suffer from search errors and considers non-local correspondences in addition to local ones. SEMBLEU is fully content-driven and punishes situations where a system's output d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(20 citation statements)
references
References 17 publications
0
20
0
Order By: Relevance
“…The metric SEMBLEU (Song and Gildea, 2019) is most closely related to ours. It evaluates AMR graphs by calculating precision based on n-gram overlap.…”
Section: Related Workmentioning
confidence: 63%
“…The metric SEMBLEU (Song and Gildea, 2019) is most closely related to ours. It evaluates AMR graphs by calculating precision based on n-gram overlap.…”
Section: Related Workmentioning
confidence: 63%
“…a graph-based encoding of the Discourse Representation Structures of Basile et al (2012). Further, we plan on refining and extending the available training data (in particular for UCCA) and will put greater focus on the systematic exploration of variant evaluation perspectives, for example scoring at the level of larger sub-graphs in the spirit of the 'complete predications' metric of , or 'semantic n-grams' along the lines of the SemBleu proposal by Song and Gildea (2019). Aiming for increased linguistic diversity, it will of course also be tempting to seek to include meaning representations for additional languages.…”
Section: Reflections and Outlookmentioning
confidence: 99%
“…Simplify and match -SEMBLEU The SEMBLEU metric in Song and Gildea (2019) can also be described as a two-step procedure. But unlike SMATCH it operates on a variable-free reduction of an AMR graph G, which we denote by G vf (vf : variable-free, Figure 1, right-hand side).…”
Section: Amr Metrics: Smatch and Sembleumentioning
confidence: 99%
“…Its backbone is an alignment-search be-tween the graphs' variables. Recently, the SEMBLEU metric (Song and Gildea, 2019) has been proposed that operates on the basis of a variable-free AMR (Figure 1, right), 1 converting it to a bag of k-grams. Circumventing a variable alignment search reduces computational cost and ensures full determinacy.…”
Section: Introductionmentioning
confidence: 99%