2013
DOI: 10.1093/pcp/pct041
|View full text |Cite
|
Sign up to set email alerts
|

Systematization of the Protein Sequence Diversity in Enzymes Related to Secondary Metabolic Pathways in Plants, in the Context of Big Data Biology Inspired by the KNApSAcK Motorcycle Database

Abstract: Biology is increasingly becoming a data-intensive science with the recent progress of the omics fields, e.g. genomics, transcriptomics, proteomics and metabolomics. The species-metabolite relationship database, KNApSAcK Core, has been widely utilized and cited in metabolomics research, and chronological analysis of that research work has helped to reveal recent trends in metabolomics research. To meet the needs of these trends, the KNApSAcK database has been extended by incorporating a secondary metabolic path… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
10
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
2
1

Relationship

4
4

Authors

Journals

citations
Cited by 19 publications
(11 citation statements)
references
References 143 publications
1
10
0
Order By: Relevance
“…The combined variance explained by the first two principal components is 30.02 (Figure 6 left) percent, whereas the variance explained by the first two components in binary encoded set is 14.84 percent (Figure 6 right). Triterpenoid synthases can be clearly distinguished from the other categories, which was also consistent with our previous findings [30]. …”
Section: Resultssupporting
confidence: 93%
“…The combined variance explained by the first two principal components is 30.02 (Figure 6 left) percent, whereas the variance explained by the first two components in binary encoded set is 14.84 percent (Figure 6 right). Triterpenoid synthases can be clearly distinguished from the other categories, which was also consistent with our previous findings [30]. …”
Section: Resultssupporting
confidence: 93%
“…Species‐metabolite information is accumulated in the KNApSAcK Core DB that has been utilized extensively in omics science 4,7. Metabolic reactions in enzymes are also accumulated in Motorcycle DB 4. Activity‐species and activity‐metabolite relations are accumulated in Biological Activity DB and Metabolite Activity DB, respectively.…”
Section: Resultsmentioning
confidence: 99%
“…To attain this purpose, we have developed KNApSAcK Family Databases (DBs), which are utilized in a number of researches in metabolomics. A review of the KNApSAcK DB utilization in scientific work is presented by Ikeda et al 4. Data has been collected in the KNApSAcK database in order to facilitate the comprehensive understanding of the medicinal usage of plants based on traditional as well as modern knowledge of healthy cuisine ingredients and metabolomics 4–8…”
Section: Introductionmentioning
confidence: 99%
“…This work further used SOM to examine codon usage heterogeneity in the E. coli O157 genome, which contains “O157-unique segments” (O-islands), and showed that SOM is a powerful tool for characterization of horizontally transferred genes. Another example of the application of BL-SOM is the investigation of the enzyme sequence diversity related to secondary metabolism [64]. Initially, a map was constructed by using a big data matrix that consisted of the frequencies of all possible dipeptides in the protein sequence segments of plants and bacteria.…”
Section: Multivariate Analysis In Systems Biologymentioning
confidence: 99%