Gene set analysis methods are widely used to provide insight into high-throughput gene expression data. There are many gene set analysis methods available. These methods rely on various assumptions and have different requirements, strengths and weaknesses. In this paper, we classify gene set analysis methods based on their components, describe the underlying requirements and assumptions for each class, and provide directions for future research in developing and evaluating gene set analysis methods.
Thirty viral proteomes were examined for amino acid sequence similarity to the human proteome, and, in parallel, a control of 30 sets of human proteins was analyzed for internal human overlapping. We find that all of the analyzed 30 viral proteomes, independently of their structural or pathogenic characteristics, present a high number of pentapeptide overlaps to the human proteome. Among the examined viruses, human T-lymphotropic virus 1, Rubella virus, and hepatitis C virus present the highest number of viral overlaps to the human proteome. The widespread and ample distribution of viral amino acid sequences through the human proteome indicates that viral and human proteins are formed of common peptide backbone units and suggests a fluid compositional chimerism in phylogenetic entities canonically classified distantly as viruses and Homo sapiens. Importantly, the massive viral to human peptide overlapping calls into question the possibility of a direct causal association between virus-host sharing of amino acid sequences and incitement to autoimmune reactions through molecular recognition of common motifs.
Short amino acid motifs, either linear sequences or discontinuous amino acid groupings, can interact with specific protein domains, so exerting a central role in cell adhesion, signal transduction, hormone activity, regulation of transcript expression, enzyme activity, and antigen-antibody interaction. Here, we analyze the literature for such critical short amino acid motifs to determine the minimal peptide length involved in biologically important interactions. We report the pentapeptide unit as a common minimal amino acid sequence critically involved in peptide-protein interaction and immune recognition. The present survey may have implications in defining the dimensional module for peptide-based therapeutical approaches such as the development of novel antibiotics, enzyme inhibitors/activators, mimetic agonists/antagonists of neuropeptides, thrombolitic agents, specific anti-viral agents, etc. In such a therapeutical context, it is of considerable interest that low molecular weight peptides can easily cross biological barriers, are less susceptible to protease attacks, and can be administered at high concentrations. In addition, small peptides are a rational target for strategies aimed at antigen-specific immunotherapeutical intervention. As an example, specific short peptide fragments might be used to elicit antibodies capable of reacting with the full-length proteins containing the peptide fragment's amino acid sequence, so abolishing the risk of cross-reactivity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.