2009
DOI: 10.1093/nar/gkp254
|View full text |Cite
|
Sign up to set email alerts
|

ANNIE: integrated de novo protein sequence annotation

Abstract: Function prediction of proteins with computational sequence analysis requires the use of dozens of prediction tools with a bewildering range of input and output formats. Each of these tools focuses on a narrow aspect and researchers are having difficulty obtaining an integrated picture. ANNIE is the result of years of close interaction between computational biologists and computer scientists and automates an essential part of this sequence analytic process. It brings together over 20 function prediction algori… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
53
0

Year Published

2010
2010
2016
2016

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 49 publications
(53 citation statements)
references
References 52 publications
0
53
0
Order By: Relevance
“…Sequences with a predicted modification site were further analyzed using SignalP (Bendtsen et al 2004) to ensure that they also contain a signal peptide. To check for possible transmembrane domains in the mature proteins, the resulting set was then analyzed with the ANNIE protein sequence annotation and interpretation environment (Ooi et al 2009 …”
Section: Bioinformaticsmentioning
confidence: 99%
“…Sequences with a predicted modification site were further analyzed using SignalP (Bendtsen et al 2004) to ensure that they also contain a signal peptide. To check for possible transmembrane domains in the mature proteins, the resulting set was then analyzed with the ANNIE protein sequence annotation and interpretation environment (Ooi et al 2009 …”
Section: Bioinformaticsmentioning
confidence: 99%
“…Except for this polymorphism, the C-terminal 1,455 AAs (i.e., the sequence of NFAT5a) are identical among all variants. All isoforms but isoform a have the same sequence architecture [20][21][22] (Fig. 1A) including (1) a N-terminal compositionally biased region rich in serine/ threonine (35% in first 76 AAs of NFAT5c) and proline [12% in first 76 AAs of NFAT5c; a most likely non-globular region of variable length TAD1 (transactivation domain 1)], (2) an AED (auxiliary export domain) and a NLS (nuclear localization signal), 15 (3) a RHD (Rel homology domain, PFAM PF00554 and PF01833) 23 mediating both protein-DNA interactions and NFAT5 dimerization 17,24 and (4) a C-terminal low-complexity region (glutamine 21%, serine/threonine 23%) as a TAD2.…”
Section: Resultsmentioning
confidence: 99%
“…For the HMMER 44 search against known Pfam domains, 21 only entries producing hits with E-value 0.001 were accounted as a true hit. In addition to similarity searches to known domains and databases of nucleotide sequences, the analysis of disorder content predicted by IUPred long 45,46 as implemented in the ANNIE/ ANNOTATOR framework 47 was carried out. Sequences that were predicted to have more than 70% of its content to be disordered were counted as a hit.…”
Section: Assessing Characteristic Features Of Proteome Setsmentioning
confidence: 99%