2023
DOI: 10.48550/arxiv.2301.00876
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding

Abstract: Reading comprehension of legal text can be a particularly challenging task due to the length and complexity of legal clauses and a shortage of expert-annotated datasets. To address this challenge, we introduce the Merger Agreement Understanding Dataset (MAUD), an expert-annotated reading comprehension dataset based on the American Bar Association's 2021 Public Target Deal Points Study, with over 39,000 examples and over 47,000 total annotations. Our fine-tuned Transformer baselines show promising results, with… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 5 publications
0
3
0
Order By: Relevance
“…In recent years, a few legal corpora have been proposed in English, such as LEDGAR [18], CUAD [10], BillSum [19], MAUD [11], and EUR-Lex-Sum [20]. The first, LEDGAR, consists of 100,000 provisions to be classified as provisions types (e.g.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…In recent years, a few legal corpora have been proposed in English, such as LEDGAR [18], CUAD [10], BillSum [19], MAUD [11], and EUR-Lex-Sum [20]. The first, LEDGAR, consists of 100,000 provisions to be classified as provisions types (e.g.…”
Section: Related Workmentioning
confidence: 99%
“…Thus, creating curated large legal annotated corpora has been proven to be costly [10,11]. For example, MAUD, an expert-annotated merger agreement understanding dataset, has been estimated to cost $5 million using the standard hourly fees of specialized lawyers [11].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation