2003
DOI: 10.1186/1471-2105-4-11
|View full text |Cite
|
Sign up to set email alerts
|

PreBIND and Textomy – mining the biomedical literature for protein-protein interactions using a support vector machine

Abstract: Background: The majority of experimentally verified molecular interaction and biological pathway data are present in the unstructured text of biomedical journal articles where they are inaccessible to computational methods. The Biomolecular interaction network database (BIND) seeks to capture these data in a machine-readable format. We hypothesized that the formidable task-size of backfilling the database could be reduced by using Support Vector Machine technology to first locate interaction information in the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
41
0
1

Year Published

2004
2004
2014
2014

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 254 publications
(42 citation statements)
references
References 28 publications
0
41
0
1
Order By: Relevance
“…We anticipate that the two most useful formats will be the Cytoscape SIF (as noted below) and CSV (for Microsoft Excel), from the perspective of the research biologist user of BIND. PreBIND (16) is currently a separate information system with interactions derived from text mining of MEDLINE abstracts (17). When the user cannot find BIND records, we suggest that they next search through the PreBIND database.…”
Section: Database Query and Retrievalmentioning
confidence: 99%
“…We anticipate that the two most useful formats will be the Cytoscape SIF (as noted below) and CSV (for Microsoft Excel), from the perspective of the research biologist user of BIND. PreBIND (16) is currently a separate information system with interactions derived from text mining of MEDLINE abstracts (17). When the user cannot find BIND records, we suggest that they next search through the PreBIND database.…”
Section: Database Query and Retrievalmentioning
confidence: 99%
“…They have high precision but low recall, because recognition patterns are usually too specific. Other machine learning approaches have classified abstracts and sentences for relevant interactions, but have not extracted information (Marcotte et al 2001; Donaldson et al 2003). For a more detailed report of these and related projects, see reviews by Andrade and Bork (2000), de Bruijn and Martin (2002), and Staab (2002).…”
Section: Introductionmentioning
confidence: 99%
“…PreBind (Donaldson et al 2003) is an approach complementary to BIND for finding interaction data in the over 14 million PubMed abstracts. PreBind functions as follows: (1) SVM technology is used to identify articles about biomolecular interactions and confirm sentences that mention specific protein-protein interactions; (2) protein names and their gene symbols are derived from a non-redundant sequence database (RefSeq) and from the S. cerevisiae Genome Database (SGD); (3) this information extraction system is coupled to a human-reviewed data-entry queue for a publicly available BIND.…”
Section: Literature Mining For Protein-protein Interactionsmentioning
confidence: 99%