High-throughput technologies were employed over the past decade to study the expression profiles of cells and tissues. There are large collections of accumulated data from public databases and numerous research articles were published on these data. In the current study, we performed meta-analysis on the gene expression data from human liver and kidney tissues produced from five different technologies: EST, SAGE, MPSS, microarray, and RNA-Seq. We found RNA-Seq was the most sensitive in the number of genes it detected while SAGE and MPSS were the least sensitive. For the genes detected by all the platforms, there were generally good correlations to the measured expression levels of corresponding genes. We further compared detected genes to liver/kidney proteomics data from the Human Protein Atlas, and found 960 of the 8764 genes only detected by RNA-Seq were validated by proteomics results. In conclusion, RNA-Seq is a more sensitive and consistent technology compared to the other four high-throughput platforms, though their results are in general agreement. Average coverage was determined to be the preferred measurement to represent gene expression levels by RNA-Seq data and will be used in future works.
high-throughput sequencing, tissue transcriptome, comparative analysis
Citation:Ping J, Wang Y J, Yu Y, et al. A comparative analysis of tissue gene expression data from high-throughput studies.