2019
DOI: 10.1186/s13321-018-0327-2
|View full text |Cite
|
Sign up to set email alerts
|

LSTMVoter: chemical named entity recognition using a conglomerate of sequence labeling tools

Abstract: Background Chemical and biomedical named entity recognition (NER) is an essential preprocessing task in natural language processing . The identification and extraction of named entities from scientific articles is also attracting increasing interest in many scientific disciplines. Locating chemical named entities in the literature is an essential step in chemical text mining pipelines for identifying chemical mentions, their properties, and rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
24
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 36 publications
(25 citation statements)
references
References 26 publications
0
24
0
1
Order By: Relevance
“…There are methods aimed at NER that have been developing during the last years (Kaewphan et al, 2018 ; Korvigo et al, 2018 ; Hemati and Mehler, 2019 ; Hong and Lee, 2020 ; Huang et al, 2020 ; Kilicoglu et al, 2020 ). Most of them are based on algorithms for NER related either to chemicals or biological objects.…”
Section: Introductionmentioning
confidence: 99%
“…There are methods aimed at NER that have been developing during the last years (Kaewphan et al, 2018 ; Korvigo et al, 2018 ; Hemati and Mehler, 2019 ; Hong and Lee, 2020 ; Huang et al, 2020 ; Kilicoglu et al, 2020 ). Most of them are based on algorithms for NER related either to chemicals or biological objects.…”
Section: Introductionmentioning
confidence: 99%
“…Early techniques for chemical text mining, such as dictionary-based methods (Rebholz-Schuhmann et al, 2007 ; Hettne et al, 2009 ; Akhondi et al, 2016 ) and grammar-based methods (Narayanaswamy et al, 2002 ; Liu et al, 2012 ; Akhondi et al, 2015 ), heavily rely on expert knowledge in the chemical domain. Recently, machine learning-based techniques have reported state-of-the-art effectiveness in chemical text mining (Hemati and Mehler, 2019 ; Zhai et al, 2019 ). However, such techniques require a large amount of annotated text data, which still remains limited.…”
Section: Related Workmentioning
confidence: 99%
“…We used the PyMedTermino library (Lamy et al, 2015) for concept indexing. A full-text search with the Levenshtein distance algorithm (Miller et al, 2009) was applied in a first instance for concept indexing and fuzzy search with threshold using FuzzyDict implementation (Hemati and Mehler, 2019) as a second approach for concepts not found by partial matching. The FastText model uses a combination of various subcomponents to produce high-quality embeddings.…”
Section: Medical Word and Concept Embeddingsmentioning
confidence: 99%
“…One of the most effective methods is Conditional Random Fields (CRF) (Lafferty et al, 2001) since CRF is one of the most reliable sequence labeling methods. Recently, deep learning-based methods have also demonstrated state-of-the-art performance for English (Hemati and Mehler, 2019;Pérez-Pérez et al, 2017;Suárez-Paniagua et al, 2019) texts by automatically learning relevant patterns from corpora, which allows language and domain independence. However, until now, to the best of our knowledge, there is only one work that addresses the generation of Spanish biomedical word embeddings (Armengol-Estapé Jordi, 2019;Soares et al, 2019).…”
Section: Introductionmentioning
confidence: 99%