2015
DOI: 10.1109/tpds.2014.2368568
|View full text |Cite
|
Sign up to set email alerts
|

Hadoop Recognition of Biomedical Named Entity Using Conditional Random Fields

Abstract: Processing large volumes of data has presented a challenging issue, particularly in data-redundant systems. As one of the most recognized models, the conditional random fields (CRF) model has been widely applied in biomedical named entity recognition (Bio-NER). Due to the internally sequential feature, performance improvement of the CRF model is nontrivial, which requires new parallelized solutions. By combining and parallelizing the limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) and Viterbi algorith… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
30
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(30 citation statements)
references
References 36 publications
0
30
0
Order By: Relevance
“…For example, majority of the systems submitted to the JNLPBA challenge made use of machine learning algorithms which have been observed to significantly outperform the dictionary based methods. Some of the recent works in BNER includes the unsupervised model as proposed in (Zhang and Elhadad, 2013), and the system based on CRF (Li et al, 2015a). A two-phase approach based on semi-Markov CRF is proposed in (Yang and Zhou, 2014).…”
Section: Related Workmentioning
confidence: 99%
“…For example, majority of the systems submitted to the JNLPBA challenge made use of machine learning algorithms which have been observed to significantly outperform the dictionary based methods. Some of the recent works in BNER includes the unsupervised model as proposed in (Zhang and Elhadad, 2013), and the system based on CRF (Li et al, 2015a). A two-phase approach based on semi-Markov CRF is proposed in (Yang and Zhou, 2014).…”
Section: Related Workmentioning
confidence: 99%
“…To attack the unsymmetrical co-occurrence problem of PMI, EPMI was proposed and defined to extract prototypical words based on extended mutual information (EMI) and PMI 2 [37]. To generate the co-occurrence vector v for the word wi, the co-occurrence relation between the word wi and every word wj from the dataset was determined using EPMI, which is derived from extended mutual information EMI and PMI , as per Equations (4) and (5):…”
Section: Extended Distributed Prototypical Methodsmentioning
confidence: 99%
“…The CRF model is a discriminant probability, undirected graph learning model proposed by Lafferty [8] based on the maximum entropy model [54] and hidden Markov model [55]. CRF was first proposed for sequence data analysis and has been successfully applied in the fields of natural language processing (NLP), bioinformatics, machine vision, and network intelligence [56][57][58][59].…”
Section: Crfmentioning
confidence: 99%