2018
DOI: 10.1111/cobi.13044
|View full text |Cite
|
Sign up to set email alerts
|

Using machine learning to disentangle homonyms in large text corpora

Abstract: Systematic reviews are an increasingly popular decision-making tool that provides an unbiased summary of evidence to support conservation action. These reviews bridge the gap between researchers and managers by presenting a comprehensive overview of all studies relating to a particular topic and identify specifically where and under which conditions an effect is present. However, several technical challenges can severely hinder the feasibility and applicability of systematic reviews, for example, homonyms (ter… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
34
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 38 publications
(34 citation statements)
references
References 50 publications
0
34
0
Order By: Relevance
“…Scientific names represent a reliable proxy and preferable alternative to vernacular names, due to a strong and culturally independent association between their representation in digital corpora (Jarić et al, 2016;Correia et al, 2017Correia et al, , 2018. At the same time, search based on scientific names avoids numerous problems related to vernacular language, such as frequent vernacular synonyms and homonyms (Roll et al, 2018), differing names among languages, as well as lack of vernacular names for some species (Jarić et al, 2016). Accounting for taxonomic synonyms is also critical, as they can strongly affect the accuracy of species data retrieval .…”
Section: Methodsmentioning
confidence: 99%
“…Scientific names represent a reliable proxy and preferable alternative to vernacular names, due to a strong and culturally independent association between their representation in digital corpora (Jarić et al, 2016;Correia et al, 2017Correia et al, , 2018. At the same time, search based on scientific names avoids numerous problems related to vernacular language, such as frequent vernacular synonyms and homonyms (Roll et al, 2018), differing names among languages, as well as lack of vernacular names for some species (Jarić et al, 2016). Accounting for taxonomic synonyms is also critical, as they can strongly affect the accuracy of species data retrieval .…”
Section: Methodsmentioning
confidence: 99%
“…This is an area that we believe will be of increasing interest to users, particularly for updating existing reviews (algorithms can be trained to identify relevant studies based on similarity to previously included studies) [31] and dealing with very large bodies of literature.…”
Section: Future Developments Of Cadimamentioning
confidence: 99%
“…Third, the topic of this systematic map straddles multiple disciplines and sectors (conservation, development, natural resource management), thus substantial semantic diversity is more than likely to exist. Unlike medical fields, these fields lack a standardized ontology [45,46] and are characterized by rapid radiation of terms in use over the past few decades. While our search strategy attempted to capture this diversity through piloting and testing with an interdisciplinary and multisector review team, we recognize that not including specific terms in our search may have resulted in literature areas missed.…”
Section: Discussionmentioning
confidence: 99%