1995
DOI: 10.1089/cmb.1995.2.557
|View full text |Cite
|
Sign up to set email alerts
|

Challenges in Integrating Biological Data Sources

Abstract: Scientific data of importance to biologists reside in a number of different data sources, such as GenBank, GSDB, SWISS-PROT, EMBL, and OMIM, among many others. Some of these data sources are conventional databases implemented using database management systems (DBMSs) and others are structured files maintained in a number of different formats (e.g., ASN.1 and ACE). In addition, software packages such as sequence analysis packages (e.g., BLAST and FASTA) produce data and can therefore be viewed as data sources. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
79
0
1

Year Published

2005
2005
2012
2012

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 139 publications
(80 citation statements)
references
References 26 publications
0
79
0
1
Order By: Relevance
“…Life Sciences data is characterized by its complexity, its high interrelatedness, its heterogeneity, and by a multitude of naming and identity issues [17], [18], [19]. Graph based models are a natural fit, as they are in many disciplines, to deal with these problems.…”
Section: A the Ondex Data Structure Life Sciences And The Semanticmentioning
confidence: 99%
“…Life Sciences data is characterized by its complexity, its high interrelatedness, its heterogeneity, and by a multitude of naming and identity issues [17], [18], [19]. Graph based models are a natural fit, as they are in many disciplines, to deal with these problems.…”
Section: A the Ondex Data Structure Life Sciences And The Semanticmentioning
confidence: 99%
“…Among these challenges, data complexity is the most challenging problem in our context. Biological data are complex because they are heterogeneous [7], incomplete, uncertain and inconsistent [31]. Despite these characteristics, the expertise of proteomic experiments requires high data quality to make pertinent conclusions.…”
Section: Problems Related To the Use Of Datamentioning
confidence: 99%
“…Data integration in the life science is an ongoing challenge in Bioinformatics; problems arise because standards for data formats, identifiers, common vocabularies and agreed semantics between databases are lacking [5,6]. Data in the life sciences are complex and volatile that, when taken with the issues outlined, makes the necessary integration of life sciences data hard work.…”
Section: Introductionmentioning
confidence: 99%