SummaryThe integration of metabolomics and transcriptomics can provide precise information on gene-to-metabolite networks for identifying the function of unknown genes unless there has been a post-transcriptional modification. Here, we report a comprehensive analysis of the metabolome and transcriptome of Arabidopsis thaliana over-expressing the PAP1 gene encoding an MYB transcription factor, for the identification of novel gene functions involved in flavonoid biosynthesis. For metabolome analysis, we performed flavonoid-targeted analysis by high-performance liquid chromatography-mass spectrometry and non-targeted analysis by Fourier-transform ion-cyclotron mass spectrometry with an ultrahigh-resolution capacity. This combined analysis revealed the specific accumulation of cyanidin and quercetin derivatives, and identified eight novel anthocyanins from an array of putative 1800 metabolites in PAP1 over-expressing plants. The transcriptome analysis of 22 810 genes on a DNA microarray revealed the induction of 38 genes by ectopic PAP1 overexpression. In addition to well-known genes involved in anthocyanin production, several genes with unidentified functions or annotated with putative functions, encoding putative glycosyltransferase, acyltransferase, glutathione S-transferase, sugar transporters and transcription factors, were induced by PAP1. Two putative glycosyltransferase genes (At5g17050 and At4g14090) induced by PAP1 expression were confirmed to encode flavonoid 3-O-glucosyltransferase and anthocyanin 5-O-glucosyltransferase, respectively, from the enzymatic activity of their recombinant proteins in vitro and results of the analysis of anthocyanins in the respective T-DNA-inserted mutants. The functional genomics approach through the integration of metabolomics and transcriptomics presented here provides an innovative means of identifying novel gene functions involved in plant metabolism.
Plant metabolism is a complex set of processes that produce a wide diversity of foods, woods, and medicines. With the genome sequences of Arabidopsis and rice in hands, postgenomics studies integrating all ''omics'' sciences can depict precise pictures of a whole-cellular process. Here, we present, to our knowledge, the first report of investigation for gene-to-metabolite networks regulating sulfur and nitrogen nutrition and secondary metabolism in Arabidopsis, with integration of metabolomics and transcriptomics. Transcriptome and metabolome analyses were carried out, respectively, with DNA macroarray and several chemical analytical methods, including ultra high-resolution Fourier transform-ion cyclotron MS. Mathematical analyses, including principal component analysis and batch-learning self-organizing map analysis of transcriptome and metabolome data suggested the presence of general responses to sulfur and nitrogen deficiencies. In addition, specific responses to either sulfur or nitrogen deficiency were observed in several metabolic pathways: in particular, the genes and metabolites involved in glucosinolate metabolism were shown to be coordinately modulated. Understanding such geneto-metabolite networks in primary and secondary metabolism through integration of transcriptomics and metabolomics can lead to identification of gene function and subsequent improvement of production of useful compounds in plants. P lants produce a huge array of compounds used for foods, medicines, flavors, and industrial materials. These plant metabolites are synthesized and accumulated by the networks of proteins encoded in the genome of each plant. However, even after the completion of the genome sequencing of Arabidopsis (1) and rice (2, 3), function of those genes and networks of gene-to-metabolite are largely unknown. To reveal the function of genes involved in metabolic processes and gene-to-metabolite networks, the metabolomics-based approach is regarded as a direct way (4-7). In particular, integration of comprehensive gene expression profile with targeted metabolite analysis is shown to be an innovative way for identification of gene function for specific product accumulation in plant (8) and microorganisms (9). However, to depict a whole-cellular process of metabolism, integration of comprehensive gene expression analysis (transcriptomics), and nontargeted metabolite profiling (metabolomics) is needed. Bioinformatics designed suitably for data mining helps the integration efficiently.The gene expression profiling can be achieved by DNA array analysis. For metabolomics, a nontargeted, high-throughput analytical system is required. Traditionally, GC-MS has been used to detect Ͼ300 metabolites in plant tissues (5, 6). Fourier transform-ion cyclotron MS (FT-MS) is a system for metabolome analysis in which crude plant extract is introduced by means of direct injection without prior separation of metabolites by chromatography (10). The mass resolution (Ͼ100,000) and accuracy (Ͻ1 ppm) of FT-MS is extremely high; hence, comple...
Since the completion of genome sequences of model organisms, functional identification of unknown genes has become a principal challenge in biology. Postgenomics sciences such as transcriptomics, proteomics, and metabolomics are expected to discover gene functions. This report outlines the elucidation of gene-togene and metabolite-to-gene networks via integration of metabolomics with transcriptomics and presents a strategy for the identification of novel gene functions. Metabolomics and transcriptomics data of Arabidopsis grown under sulfur deficiency were combined and analyzed by batch-learning self-organizing mapping. A group of metabolites/genes regulated by the same mechanism clustered together. The metabolism of glucosinolates was shown to be coordinately regulated. Three uncharacterized putative sulfotransferase genes clustering together with known glucosinolate biosynthesis genes were candidates for involvement in biosynthesis. In vitro enzymatic assays of the recombinant gene products confirmed their functions as desulfoglucosinolate sulfotransferases. Several genes involved in sulfur assimilation clustered with O-acetylserine, which is considered a positive regulator of these genes. The genes involved in anthocyanin biosynthesis clustered with the gene encoding a transcriptional factor that up-regulates specifically anthocyanin biosynthesis genes. These results suggested that regulatory metabolites and transcriptional factor genes can be identified by this approach, based on the assumption that they cluster with the downstream genes they regulate. This strategy is applicable not only to plant but also to other organisms for functional elucidation of unknown genes.In the era of post-genomics, a systematic and comprehensive understanding of the complex events of life is a great concern in biology. The first step in this process is to identify all gene functions and gene-to-gene networks as the components of the system, the whole events of life. The metabolome is the final product of a series of gene actions. Hence, metabolomics has a potential to elucidate gene functions and networks, especially when integrated with transcriptomics. A promising approach is pair-wise transcript-metabolite correlation analysis, which can reveal unexpected correlations and bring to light candidate genes for modifying the metabolite content (1). Gene functions involved in the specific pathway of interest have been identified by the integration of transcript and targeted metabolic profiling in experimental systems where the pathway was activated (2-6). However, up to now, no gene function has been identified by non-targeted analyses of the transcriptome and metabolome. In this report, we analyzed the non-targeted metabolome and transcriptome of a model plant Arabidopsis under sulfur (S) 1 deficiency whose genome sequencing has been completed. Our strategy for integrated analyses using batch-learning-selforganizing mapping (BL-SOM) (7-9) enabled the identification of gene-to-gene and metabolite-to-gene networks and new gene fun...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.