2020
DOI: 10.1007/978-3-030-66527-2_23
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Taxonomy Generation: A Use-Case in the Legal Domain

Abstract: A key challenge in the legal domain is the adaptation and representation of the legal knowledge expressed through texts, in order for legal practitioners and researchers to access this information easier and faster to help with compliance related issues. One way to approach this goal is in the form of a taxonomy of legal concepts. While this task usually requires a manual construction of terms and their relations by domain experts, this paper describes a methodology to automatically generate a taxonomy of lega… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…Typically, the starting point for the taxonomy construction is a text corpus that accurately characterizes a specific domain. Robin et al [ 15 ] describes a methodology to automatically generate a taxonomy of legal concepts, and apply this methodology on a corpus consisting of statutory instruments for the UK, Wales, Scotland, and Northern Ireland laws. Bai et al [ 16 ] first propose a method to construct business taxonomies automatically from corporate reports.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Typically, the starting point for the taxonomy construction is a text corpus that accurately characterizes a specific domain. Robin et al [ 15 ] describes a methodology to automatically generate a taxonomy of legal concepts, and apply this methodology on a corpus consisting of statutory instruments for the UK, Wales, Scotland, and Northern Ireland laws. Bai et al [ 16 ] first propose a method to construct business taxonomies automatically from corporate reports.…”
Section: Related Workmentioning
confidence: 99%
“…Word embedding, NLP and clustering techniques have been applied to the textual descriptions of the surveyed initiatives to categorize them into semantically homogeneous groups. To the best of our knowledge, such techniques have been used in different domains, such as law [ 15 ] and economics [ 16 ], but this is one of the first attempts to clustering initiatives against Covid19 from textual descriptions.…”
Section: Introductionmentioning
confidence: 99%
“…A general problem with domain-specific corpora, however, is that they may not meet requirements for token size. In a study of Robin, O'Neill, and Buitelaar (2017), this lead to the rejection of embeddings trained on domain-specific corpora in favor of embeddings trained on general-purpose corpora, since these performed better in a down-stream task. Furthermore, Roy, Park, and Pan (2017) claim that domain-specific corpora may not suffice to guarantee reliable domain-specific embeddings, since even their items can be sparse and not (co-)occur often enough although they are of relevance to the domain.…”
Section: Related Workmentioning
confidence: 99%