Background: Mass spectrometry (MS) based label-free protein quantitation has mainly focused on analysis of ion peak heights and peptide spectral counts. Most analyses of tandem mass spectrometry (MS/MS) data begin with an enzymatic digestion of a complex protein mixture to generate smaller peptides that can be separated and identified by an MS/MS instrument. Peptide spectral counting techniques attempt to quantify protein abundance by counting the number of detected tryptic peptides and their corresponding MS spectra. However, spectral counting is confounded by the fact that peptide physicochemical properties severely affect MS detection resulting in each peptide having a different detection probability. Lu et al. (2007) described a modified spectral counting technique, Absolute Protein Expression (APEX), which improves on basic spectral counting methods by including a correction factor for each protein (called O i value) that accounts for variable peptide detection by MS techniques. The technique uses machine learning classification to derive peptide detection probabilities that are used to predict the number of tryptic peptides expected to be detected for one molecule of a particular protein (O i ). This predicted spectral count is compared to the protein's observed MS total spectral count during APEX computation of protein abundances.
Background: 'Fold-change' cutoffs have been widely used in microarray assays to identify genes that are differentially expressed between query and reference samples. More accurate measures of differential expression and effective data-normalization strategies are required to identify highconfidence sets of genes with biologically meaningful changes in transcription. Further, the analysis of a large number of expression profiles is facilitated by a common reference sample, the construction of which must be carefully addressed.
Cardiovascular disorders are influenced by genetic and environmental factors. The TIGR rodent expression web-based resource (TREX) contains over 2,200 microarray hybridizations, involving over 800 animals from 18 different rat strains. These strains comprise genetically diverse parental animals and a panel of chromosomal substitution strains derived by introgressing individual chromosomes from normotensive Brown Norway (BN/NHsdMcwi) rats into the background of Dahl salt sensitive (SS/JrHsdMcwi) rats. The profiles document gene-expression changes in both genders, four tissues (heart, lung, liver, kidney) and two environmental conditions (normoxia, hypoxia). This translates into almost 400 high-quality direct comparisons (not including replicates) and over 100,000 pairwise comparisons. As each individual chromosomal substitution strain represents on average less than a 5% change from the parental genome, consomic strains provide a useful mechanism to dissect complex traits and identify causative genes. We performed a variety of data-mining manipulations on the profiles and used complementary physiological data from the PhysGen resource to demonstrate how TREX can be used by the cardiovascular community for hypothesis generation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.