2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop 2009
DOI: 10.1109/bibmw.2009.5332130
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of Weight Matrix Models in the splice junction recognition problem

Abstract: The amount of data produced by the several genomic sequencing projects has increased dramatically in recent years. One of the main goals of bioinformatics is to analyze biological data aiming at identifying genes. The splice junction recognition problem is an important part of the gene detection problem. This work evaluates the performance of two classification models, derived from the Weight Matrix Model, when applied to the splice junction recognition problem. Two splice junction data sets were used in this … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 22 publications
0
5
0
Order By: Relevance
“…MSC = multi-scale component; Pos = position; APR = adjacent position relationship; SVM = support vector machine; Sn = sensitivity; Sp = specificity; Mcc = Matthew's correlation coefficients. Compared to SVM+B and MM1-SVM from Zhang et al (2010) and MDD/WWAM from Tavares et al (2009), our method gave a better performance. For donor sites, our MSC+Pos+APR model gave the best prediction with an Mcc of 0.922 m, which was 0.068 higher than that of SVM+B and 0.082 higher than that of MDD/WWAM.…”
Section: Parameter Optimization Based On Apr Featuresmentioning
confidence: 99%
See 2 more Smart Citations
“…MSC = multi-scale component; Pos = position; APR = adjacent position relationship; SVM = support vector machine; Sn = sensitivity; Sp = specificity; Mcc = Matthew's correlation coefficients. Compared to SVM+B and MM1-SVM from Zhang et al (2010) and MDD/WWAM from Tavares et al (2009), our method gave a better performance. For donor sites, our MSC+Pos+APR model gave the best prediction with an Mcc of 0.922 m, which was 0.068 higher than that of SVM+B and 0.082 higher than that of MDD/WWAM.…”
Section: Parameter Optimization Based On Apr Featuresmentioning
confidence: 99%
“…For donor sites, the optimal model was the one with integrated MSC, Pos and APR, and its Mcc was 0.922, but for acceptor sites, the optimal model was the one with integrated MSC and Pos, which had an Mcc of 0.887. SVM+B denotes the prediction method using SVM with a Bayes kernel; MM1-SVM is a prediction method that used probabilistic parameters and SVM classifier (Zhang et al, 2010), and MDD/WWAM denotes the method using maximum dependence decomposition and windowed weight array model (Tavares et al, 2009). MSC = multi-scale component; Pos = position; APR = adjacent position relationship; SVM = support vector machine; Sn = sensitivity; Sp = specificity; Mcc = Matthew's correlation coefficients.…”
Section: Parameter Optimization Based On Apr Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…Eukaryotic genes are composed of two types of segments: exons and introns; while the exons are regions of the genetic material that are encoded proteins, on the other hand the introns are non-coding regions that are removed from the primary transcript [3]. The intron-exon boundary is referred to as acceptor splice site and the exon-intron boundary as donor splice site [4].…”
Section: Introductionmentioning
confidence: 99%
“…Position weight matrix (PWM) is a common model for splice site prediction [2], [3], [4]. The varieties of PWMs have been used for splice site prediction such as Weight Array Models [5] and Windowed Weight Array Model [6].…”
Section: Introductionmentioning
confidence: 99%