2018
DOI: 10.1093/database/bay091
|View full text |Cite
|
Sign up to set email alerts
|

Assisting document triage for human kinome curation via machine learning

Abstract: In the era of data explosion, the increasing frequency of published articles presents unorthodox challenges to fulfill specific curation requirements for bio-literature databases. Recognizing these demands, we designed a document triage system with automatic methods that can improve efficiency to retrieve the most relevant articles in curation workflows and reduce workloads for biocurators. Since the BioCreative VI (2017), we have implemented texting mining processing in our system in hopes of providing higher… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…tmVar detects mentions of genomic variants in literature and normalizes them into unique dbSNP RSIDs. Although tmVar may not facilitate literature triage directly, it is quite useful for positive or negative sampling in literature triage systems [2,12,19,20,21]. The READBiomed system [19] creates term lists describing interactions, mutations, and expected effects on interactions mutations, and it trains an SVM-based literature classification model using these term lists along with a range of standard bag-of-word features.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…tmVar detects mentions of genomic variants in literature and normalizes them into unique dbSNP RSIDs. Although tmVar may not facilitate literature triage directly, it is quite useful for positive or negative sampling in literature triage systems [2,12,19,20,21]. The READBiomed system [19] creates term lists describing interactions, mutations, and expected effects on interactions mutations, and it trains an SVM-based literature classification model using these term lists along with a range of standard bag-of-word features.…”
Section: Introductionmentioning
confidence: 99%
“…The READBiomed system [19] creates term lists describing interactions, mutations, and expected effects on interactions mutations, and it trains an SVM-based literature classification model using these term lists along with a range of standard bag-of-word features. In [20], literature triage is achieved by different classifiers (Glmnet, SVM and CNN) for the task of Human Kinome Curation. However, the work still requires a time-consuming feature engineering, including the selection of frequency, location, and linguistic features, together with manually generated keyword groups of the genetic disease field.…”
Section: Introductionmentioning
confidence: 99%
“…More recently, deep neural network-based approaches have shown improved results in many natural language processing (NLP) tasks including text classification (11). Neural network-based approaches such as the convolutional neural network (CNN) have been applied to assist document triage of kinome curation, genomic variation and protein–protein interactions (12–14). Lee et al (13) employed the CNN to identify publications that are relevant for variant curation.…”
Section: Introductionmentioning
confidence: 99%