2020
DOI: 10.3390/genes11060614
|View full text |Cite
|
Sign up to set email alerts
|

Identification of Regulatory SNPs Associated with Vicine and Convicine Content of Vicia faba Based on Genotyping by Sequencing Data Using Deep Learning

Abstract: Faba bean (Vicia faba) is a grain legume, which is globally grown for both human consumption as well as feed for livestock. Despite its agro-ecological importance the usage of Vicia faba is severely hampered by its anti-nutritive seed-compounds vicine and convicine (V+C). The genes responsible for a low V+C content have not yet been identified. In this study, we aim to computationally identify regulatory SNPs (rSNPs), i.e., SNPs in promoter regions of genes that are deemed to govern the V+C content of Vicia fa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
28
0

Year Published

2020
2020
2025
2025

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(44 citation statements)
references
References 87 publications
1
28
0
Order By: Relevance
“…However, the selection of the promoter regions is crucial: (i) to avoid the redundancy between sequences which could lead to the overestimation of some TFBSs [ 68 ] (ii) to address the inaccuracy of transcription start site (TSS) positions resulting from their imprecise prediction. To overcome these issues, we followed a similar strategy to those suggested in previous studies [ 7 , 49 , 68 , 69 , 70 , 71 , 72 , 73 , 74 ] and accordingly extracted two sets of promoter sequences for each tissue ranging from bp to bp relative to the TSS using the reference genome version 4.1 and gene annotation given in [ 63 ]. While the first sequence set refers to the promoter sequences of the DEGs (foreground set), the second set contains the promoter sequences of genes having the same GC-content as the foreground set (background set) [ 75 ].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…However, the selection of the promoter regions is crucial: (i) to avoid the redundancy between sequences which could lead to the overestimation of some TFBSs [ 68 ] (ii) to address the inaccuracy of transcription start site (TSS) positions resulting from their imprecise prediction. To overcome these issues, we followed a similar strategy to those suggested in previous studies [ 7 , 49 , 68 , 69 , 70 , 71 , 72 , 73 , 74 ] and accordingly extracted two sets of promoter sequences for each tissue ranging from bp to bp relative to the TSS using the reference genome version 4.1 and gene annotation given in [ 63 ]. While the first sequence set refers to the promoter sequences of the DEGs (foreground set), the second set contains the promoter sequences of genes having the same GC-content as the foreground set (background set) [ 75 ].…”
Section: Methodsmentioning
confidence: 99%
“…Following the regulatory SNP (rSNP) detection method of Heinrich et al [ 7 ], we selected the SNPs from the genome data which are located in the promoter regions of B. napus genes and analyzed them to detect their impact on the TFBSs. For this purpose, we first extracted the flanking sequence of ±25 bp for each selected SNP resulting in a 51 bp long sequence with the SNP in the central position.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We offer this data here for the benefit of researches into Vicia faba and its V+C content in particular. We have so far successfully used this data for the prediction of regulatory regions in Vicia faba and the identification of regulatory single nucleotide polymorphisms (SNPs) that are associated with V+C content [8]. For this we built a partial genome for Vicia faba from the GBS reads that spanned ∼1 % of the total genome and performed variant calling with it, which resulted in more than 600,000 high quality SNPs.…”
Section: Discussionmentioning
confidence: 99%
“…In recent years, neural networks have acquired great importance due to their high performance in different fields like medical imaging [29][30][31], agriculture [32][33][34][35][36][37], image quality assessment and others. Moreover, neural networks have exhibited great performance in the identification of 6mA sites [38,39], m6A sites [40,41], 4mC sites [23,24,42], functional piRNAs [43], N4-acetylcytidine sites [44], promoters classification [45] and others.…”
Section: Introductionmentioning
confidence: 99%