2010
DOI: 10.1093/bioinformatics/btq520
|View full text |Cite
|
Sign up to set email alerts
|

PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids

Abstract: PubDNA Finder can be freely accessed at http://servet.dia.fi.upm.es:8080/pubdnafinder

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
5
0
4

Year Published

2011
2011
2013
2013

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 1 publication
1
5
0
4
Order By: Relevance
“…Here, we show that DNA sequences in full-text articles provide a rich source of ‘unique identifiers’ that can be automatically extracted and mapped to genomic data in order to link articles to species, genes and genomic regions. We confirm recent findings that a substantial number of OA articles in PMC contain extractable DNA sequences (Garcia-Remesal et al , 2010b), and provide the first quantitative estimate of the proportion of PMC-OA articles with DNA sequences (∼22%), the majority of which we show are short sequences that are not found in GenBank.…”
Section: Discussionsupporting
confidence: 90%
See 1 more Smart Citation
“…Here, we show that DNA sequences in full-text articles provide a rich source of ‘unique identifiers’ that can be automatically extracted and mapped to genomic data in order to link articles to species, genes and genomic regions. We confirm recent findings that a substantial number of OA articles in PMC contain extractable DNA sequences (Garcia-Remesal et al , 2010b), and provide the first quantitative estimate of the proportion of PMC-OA articles with DNA sequences (∼22%), the majority of which we show are short sequences that are not found in GenBank.…”
Section: Discussionsupporting
confidence: 90%
“…The possibility that DNA and protein sequences can be extracted from biomedical text was first demonstrated by Wren et al (2005) and subsequently by several other groups (Aerts et al , 2008; Garcia-Remesal et al , 2010a, b; Shtatland et al , 2007). Aerts et al (2008) extended this technique to show that DNA sequences extracted from biomedical text could be mapped to genome sequences to identify the location, organism and target gene mentioned in an article.…”
Section: Introductionmentioning
confidence: 99%
“…In fact, the latter task is a significant research topic in biomedical informatics research, since many different systems for automatically indexing and searching the biomedical literature have been developed over the last few years. Examples include Pharmspresso [60], an information retrieval and extraction system for pharmacogenomic-related literature that follows a dictionary-based approach to identify instances of genes, drugs, polymorphisms and diseases, or PubDNA Finder [17], an online repository that we developed to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them, following a hybrid approach that combines a rule-based system and lookup lists. We have already begun working in this direction with the development of a prototype of the “nanotoxicity searcher”.…”
Section: Resultsmentioning
confidence: 99%
“…For this purpose, the automatic assignment of categories to the nanomaterials could be carried out with the use of text mining and NLP techniques to extract relevant information from papers describing them. Clustering engines could also help in this categorization by the application of standard mechanisms and algorithms, such as normalization, feature selection, consensus clustering and multidimensional scaling, among others (García-Remesal et al 2010). Nanomaterials would be clustered into sets so as to discover useful patterns (García-Remesal et al 2013).…”
Section: Cyberinfrastructures For Nanomedical Data Management: Curmentioning
confidence: 99%