2006
DOI: 10.1186/1471-2105-7-495
|View full text |Cite
|
Sign up to set email alerts
|

DWARF – a data warehouse system for analyzing protein families

Abstract: Background: The emerging field of integrative bioinformatics provides the tools to organize and systematically analyze vast amounts of highly diverse biological data and thus allows to gain a novel understanding of complex biological systems. The data warehouse DWARF applies integrative bioinformatics approaches to the analysis of large protein families.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
26
0

Year Published

2007
2007
2016
2016

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 59 publications
(26 citation statements)
references
References 32 publications
0
26
0
Order By: Relevance
“…Sequence data is extracted from the NCBI protein database by BLAST (basic local alignment search tool) searches starting from as et of seed sequences. [40] Model-derived information for sequence data comprises hierarchical sorting of individual se- ChemBioChem 2016ChemBioChem , 17,2093ChemBioChem -2098 www.chembiochem.org quences into proteins and homologous families by the USEARCHc lustering algorithm, [41] sequence alignments generated by Clustal Omega, [42] the generation of family-specifics equencep rofiles by using hidden Markov models, [43] assignment of standard position numbers, [44] links to literature entries, [45] and the annotation of structurally or functionally relevant positions. Additionally,s tructure data can be used for homology modeling.…”
Section: Resultsmentioning
confidence: 99%
“…Sequence data is extracted from the NCBI protein database by BLAST (basic local alignment search tool) searches starting from as et of seed sequences. [40] Model-derived information for sequence data comprises hierarchical sorting of individual se- ChemBioChem 2016ChemBioChem , 17,2093ChemBioChem -2098 www.chembiochem.org quences into proteins and homologous families by the USEARCHc lustering algorithm, [41] sequence alignments generated by Clustal Omega, [42] the generation of family-specifics equencep rofiles by using hidden Markov models, [43] assignment of standard position numbers, [44] links to literature entries, [45] and the annotation of structurally or functionally relevant positions. Additionally,s tructure data can be used for homology modeling.…”
Section: Resultsmentioning
confidence: 99%
“…In comparison to the previous release [1] with 4322 protein and 167 structure entries this update represents a significant increase in data volume. By comparing CALA to representative structures from all superfamilies, a structure from the deacetylase superfamily was found to be most similar to the structure of CALA.…”
Section: Resultsmentioning
confidence: 99%
“…Esterases/lipases belong to the family of ␣/␤ hydrolases which share a highly conserved catalytic triad: a serine, histidine, and an aspartic acid or glutamic acid. A consensus signature sequence containing the active site serine is usually characterized by a conserved pentapeptide GXSXG motif [5,6]. In general, esterases have the ability to hydrolyze ester substrates with short-chain fatty acids (≤C10), whereas lipases hydrolyze long-chain fatty acids (≥C10) [7].…”
Section: Introductionmentioning
confidence: 99%