2008
DOI: 10.1186/1471-2105-9-s1-s18
|View full text |Cite
|
Sign up to set email alerts
|

Identification of human-to-human transmissibility factors in PB2 proteins of influenza A by large-scale mutual information analysis

Abstract: Background: The identification of mutations that confer unique properties to a pathogen, such as host range, is of fundamental importance in the fight against disease. This paper describes a novel method for identifying amino acid sites that distinguish specific sets of protein sequences, by comparative analysis of matched alignments. The use of mutual information to identify distinctive residues responsible for functional variants makes this approach highly suitable for analyzing large sets of sequences. To s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
115
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 99 publications
(116 citation statements)
references
References 45 publications
1
115
0
Order By: Relevance
“…It selects a set of positions whose combination provides optimal discrimination between two groups (e.g., pH1N1 and swine-H1N1 sequences). For comparison we analyzed the same datasets using a mutual-information-based method (29). Reassuringly, the majority of the highly ranked positions presented here were also obtained in that analysis (SI Text S3 and Tables S5 and S6).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…It selects a set of positions whose combination provides optimal discrimination between two groups (e.g., pH1N1 and swine-H1N1 sequences). For comparison we analyzed the same datasets using a mutual-information-based method (29). Reassuringly, the majority of the highly ranked positions presented here were also obtained in that analysis (SI Text S3 and Tables S5 and S6).…”
Section: Discussionmentioning
confidence: 99%
“…As manual inspection of this volume of data is virtually impossible; it is necessary to use computational methods. Entropy (mutual-information)-based methods that use the amino acid frequency in each position are commonly used to this effect (27)(28)(29). These methods consider each position in the alignment separately, disregarding the possible relations between the sequence positions.…”
Section: Discussionmentioning
confidence: 99%
“…In our earlier report, the consensus approach has been used for epitope prediction of H1N1 HA and NA proteins (28). The current study also employed different prediction tools: NetCTL 1.2, SYFPEITHI, and BI-MAS (26,34,39) for class I binding (CD8+ T cell) epitopes and Immune Epitope Database (IEDB)-based SMM align, ProPred and NetMHC II for class II binding (CD4+ T cell) epitopes (32,33,43).…”
Section: Identification Of T Cell Epitopesmentioning
confidence: 99%
“…MUSCLE and AVANA tools were used to obtain the conserved peptide stretches ( ‡90%) (17,27,28). Conservation refers to identical peptide sequences that are present in at least 90% of total M1 protein sequences, and these conserved peptide sequences were used for further epitope prediction.…”
Section: Sequence Retrieval and Conservation Analysismentioning
confidence: 99%
“…Support vector machines, entropy, and mutual information were utilized to uncover the unique molecular features of these viruses [14][15][16][17][18][19][20][21][22]. Most recently, Random Forests [23] were applied successfully to tackle the same problem, where novel host markers in the proteins and genes of pandemic 2009 H1N1 were identified [24,25].…”
Section: Introductionmentioning
confidence: 99%