BackgroundThe development of genetically modified crops (GM) includes the discovery of candidate genes through bioinformatics analysis using genomics data, gene expression, and others. Proteins of unknown function (PUFs) are interesting targets for GM crops breeding pipelines for the novelty associated to such targets and also to avoid copyright protections. One method of inferring the putative function of PUFs is by relating them to factors of interest such as abiotic stresses using orthology and co-expression networks, in a guilt-by-association manner.
ResultsIn this regard, we have downloaded, analyzed, and processed genomics data of 53 angiosperms, totaling 1,862,010 genes and 2,332,974 RNA. Diamond and InterproScan were used to discover 72,266 PUFs for all organisms. RNA-seq datasets related to abiotic stresses were downloaded from NCBI/GEO. The RNA-seq data was used as input to the LSTrAP software to construct co-expression networks. LSTrAP also created clusters of transcripts with correlated expression, whose members are more probably related to the molecular mechanisms associated to abiotic stresses in the plants.Orthologous groups were created (OrhtoMCL) using all 2,332,974 proteins in order to associate PUFs to abiotic stress related clusters of co-expression and therefore infer their function in a guilt-byassociation manner.
ConclusionA freely available web resource named "Plant Co-expression Annotation Resource" (https://www.machado.cnptia.embrapa.br/plantannot), Plantannot, was created to provide indexed queries to search for PUF putatively associated to abiotic stresses. The web interface also allows browsing, querying and retrieving of public genomics data from 53 plants. We hope Plantannot to be useful for researchers trying to obtain novel GM crops resistant to climate change hazards.
BACKGROUNDIn the last decades, the ability to genetically engineer plants with success showed the potential to create genetically modified (GM) crops with favourable economic outcomes [1]. As well, in the last decades, the main achievements in this area were genetically improved plants tolerant to herbicide and resistant to insects. Others, like nutritional composition improvements are ongoing [2]. Furthermore, new mechanisms for genome editing are improving the accuracy and speed of genome modifications in plants, such as the CRISPR/CAS system [3,4].Regarding climate change and environmental factors, plants are being genetically modified to become resilient to abiotic stresses, such as drought, high temperature, rising atmospheric CO2, to potentially overcome the yield losses due to these factors [5,6].Intellectual property rights (IPR) are vastly used by biotechnology enterprises for their GM plants, to allow exclusive rights and provide better returns for the high investments in research and development [7]. In this way, over the last years many patents applications for genetically improved crops regarding stress tolerance were filled [8].