2021
DOI: 10.1111/2041-210x.13778
|View full text |Cite
|
Sign up to set email alerts
|

TaxoNERD: Deep neural models for the recognition of taxonomic entities in the ecological and evolutionary literature

Abstract: Easy access to multi‐taxa information (e.g. distribution, traits, diet) in the scientific literature is essential to understand, map and predict all‐inclusive biodiversity. Tools are needed to automatically extract useful information from the ever‐growing corpus of ecological texts and feed this information to open data repositories. A prerequisite is the ability to recognise mentions of taxa in text, a special case of named entity recognition (NER). In recent years, deep learning‐based NER systems have become… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
37
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 37 publications
(38 citation statements)
references
References 61 publications
0
37
0
1
Order By: Relevance
“…In ecology, for example, RNNs have been used to predict population dynamics (Joseph, 2020) or animal movements (Rew et al, 2019; see Borowiec et al, 2022 for more details on different DL algorithms). DL algorithms have also been used to synthesize taxonomic information from literature (Le Guillarme & Thuiller, 2022) or to predict species interactions in ecological networks (Strydom et al, 2021), or to predict species distributions (Deneu et al, 2021). In the following, we treat DL as a subfield of ML and only mention DL when relevant differences to classical ML algorithms are involved.…”
Section: Important ML and Dl Algorithms In More Detailmentioning
confidence: 99%
“…In ecology, for example, RNNs have been used to predict population dynamics (Joseph, 2020) or animal movements (Rew et al, 2019; see Borowiec et al, 2022 for more details on different DL algorithms). DL algorithms have also been used to synthesize taxonomic information from literature (Le Guillarme & Thuiller, 2022) or to predict species interactions in ecological networks (Strydom et al, 2021), or to predict species distributions (Deneu et al, 2021). In the following, we treat DL as a subfield of ML and only mention DL when relevant differences to classical ML algorithms are involved.…”
Section: Important ML and Dl Algorithms In More Detailmentioning
confidence: 99%
“…The species studied in each paper were identified from abstracts using the Python package TaxoNERD (53). Each biological entity was assigned to a NCBI taxonomy ID and higher-level taxonomic classifications were extracted with ETE Toolkit (54).…”
Section: Study Species Analysesmentioning
confidence: 99%
“…The operation of spaCy can be broken down into a four step process (see figure 2); initially terms are embedded via a bloom filter into a continuous vector space [18]. Next, a convolutional neural network is used to encode the terms into General neural architecture for spaCy NER [20] a sentence matrix [19] therefore taking context into account.…”
Section: Consequencementioning
confidence: 99%