Large-Scale Optimization-Based Classification Models in Medicine and Biology

Lee, Eva K.

doi:10.1007/s10439-007-9317-7

Cited by 60 publications

(39 citation statements)

References 67 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since 1996, Lee and her medical colleagues have explored and demonstrated the capability of DAMIP in classifying various types of data arising from real biological and medical problems. DAMIP has been able to consistently maximize the correct classification rate (80%-100% correct rates were obtained) while satisfying pre-set limits on inter-group misclassifications (Gallagher et al 1996(Gallagher et al , 1997Feltus et al 2003Feltus et al , 2006Lee et al 2002Lee et al , 2004Lee 2007aLee , 2007bLee and Wu 2007). In these real applications, beyond reporting the tenfold cross-validation results, the resulting classification rule was also blind tested against new data of unknown group identity and resulted in remarkable rates of correct prediction.…”

Section: Estimating the Anderson Optimal Rule Via A Mixed Integer Promentioning

confidence: 86%

“…DAMIP consistently returned superior classification rates compared to other methods (80-100% correct classification rates) while satisfying pre-set limits on inter-group misclassifications (Gallagher et al 1996(Gallagher et al , 1997Feltus et al 2003Feltus et al , 2006Lee et al 2002Lee et al , 2004Lee 2007a, 2007b, Lee and Wu 2007. In these applications, the DAMIP model was solved using a specialized solver, MIPSOL, developed by Lee.…”

Section: Performance On Real-world Datamentioning

confidence: 99%

“…In addition to heart disease and thyroid disease classification, DAMIP has been applied successfully to prediction of ultrasonic cell disruption for drug delivery (Lee et al 2004), genomic analysis and prediction of aberrant CpG island methylation in human cancer (Feltus et al 2003(Feltus et al , 2006, and identification of tumor shape and volume in treatment of sarcoma (Lee et al 2002) applications. Further, Lee (2007aLee ( , 2007b, Lee and Wu (2007) demonstrated the classification capability of DAMIP over a wide variety of medical and biological problems in which DAMIP offers superior classification results (both in ten-fold cross-validation as well as in blind tests) when compared to other classification approaches.…”

Section: Introductionmentioning

confidence: 98%

See 2 more Smart Citations

Analysis of the consistency of a mixed integer programming-based multi-category constrained discriminant model

Brooks

Lee

2008

Ann Oper Res

Self Cite

View full text Add to dashboard Cite

Classification is concerned with the development of rules for the allocation of observations to groups, and is a fundamental problem in machine learning. Much of previous work on classification models investigates two-group discrimination. Multi-category classification is less-often considered due to the tendency of generalizations of two-group models to produce misclassification rates that are higher than desirable. Indeed, producing "good" two-group classification rules is a challenging task for some applications, and producing good multi-category rules is generally more difficult. Additionally, even when the "optimal" classification rule is known, inter-group misclassification rates may be higher than tolerable for a given classification model. We investigate properties of a mixed-integer programming based multi-category classification model that allows for the pre-specification of limits on inter-group misclassification rates. The mechanism by which the limits are satisfied is the use of a reserved judgment region, an artificial category into which observations are placed whose attributes do not sufficiently indicate membership to any particular group. The method is shown to be a consistent estimator of a classification rule with misclassification limits, and performance on simulated and real-world data is demonstrated.

show abstract

Section: Estimating the Anderson Optimal Rule Via A Mixed Integer Promentioning

confidence: 86%

Section: Performance On Real-world Datamentioning

confidence: 99%

Section: Introductionmentioning

confidence: 98%

See 1 more Smart Citation

Analysis of the consistency of a mixed integer programming-based multi-category constrained discriminant model

Brooks

Lee

2008

Ann Oper Res

Self Cite

View full text Add to dashboard Cite

show abstract

“…This approach enabled us to identify 131 differentially methylated gene promoters. In stage 2, a binary particle swarm optimization (PSO) algorithm combined with DAMIP was used to identify genes that displayed changes in promoter methylation (11,12,24,34). We reported the results with 100% 10-fold cross validation accuracy for both DCM and NF groups.…”

Section: Methodsmentioning

confidence: 99%

Thymidine kinase and mtDNA depletion in human cardiomyopathy: epigenetic and translational evidence for energy starvation

Koczor

Torres

Fields

et al. 2013

Physiological Genomics

Self Cite

View full text Add to dashboard Cite

Koczor CA, Torres RA, Fields EJ, Boyd A, He S, Patel N, Lee EK, Samarel AM, Lewis W. Thymidine kinase and mtDNA depletion in human cardiomyopathy: epigenetic and translational evidence for energy starvation. Physiol Genomics 45: 590 -596, 2013. First published May 21, 2013 doi:10.1152/physiolgenomics.00014.2013.-This study addresses how depletion of human cardiac left ventricle (LV) mitochondrial DNA (mtDNA) and epigenetic nuclear DNA methylation promote cardiac dysfunction in human dilated cardiomyopathy (DCM) through regulation of pyrimidine nucleotide kinases. Samples of DCM LV and right ventricle (n ϭ 18) were obtained fresh at heart transplant surgery. Parallel samples from nonfailing (NF) controls (n ϭ 12) were from donor hearts found unsuitable for clinical use. We analyzed abundance of mtDNA and nuclear DNA (nDNA) using qPCR. LV mtDNA was depleted in DCM (50%, P Ͻ 0.05 each) compared with NF. No detectable change in RV mtDNA abundance occurred. DNA methylation and gene expression were determined using microarray analysis (GEO accession number: GSE43435). Fifty-seven gene promoters exhibited DNA hypermethylation or hypomethylation in DCM LVs. Among those, cytosolic thymidine kinase 1 (TK1) was hypermethylated. Expression arrays revealed decreased abundance of the TK1 mRNA transcript with no change in transcripts for other relevant thymidine metabolism enzymes. Quantitative immunoblots confirmed decreased TK1 polypeptide steady state abundance. TK1 activity remained unchanged in DCM samples while mitochondrial thymidine kinase (TK2) activity was significantly reduced. Compensatory TK activity was found in cardiac myocytes in the DCM LV. Diminished TK2 activity is mechanistically important to reduced mtDNA abundance and identified in DCM LV samples here. Epigenetic and genetic changes result in changes in mtDNA and in nucleotide substrates for mtDNA replication and underpin energy starvation in DCM.

show abstract

“…. ; ng: There has recently been considerable interest in solving instances of (1) of large dimensions (i.e., large values of n), motivated by the emergence of applications in bio-computing, web and data mining (Lee 2007;Nasiri et al 2009). Many current optimization techniques can efficiently solve instances of (1) of small to moderate dimension, involving up to 50 variables.…”

Section: Introductionmentioning

confidence: 99%

EM323: a line search based algorithm for solving high-dimensional continuous non-linear optimization problems

et al. 2010

View full text Add to dashboard Cite

This paper presents a performance study of a one-dimensional search algorithm for solving general high-dimensional optimization problems. The proposed approach is a hybrid between a line search algorithm of Glover (The 3-2-3, stratified split and nested interval line search algorithms. To this end, we report the algorithm's performance on the 50, 100, 200, 500 and 1,000-dimension versions of each function. Computational results are given comparing our method with three leading evolutionary algorithms. Statistical analysis discloses that our method outperforms the other methods by a significant margin.

show abstract

Large-Scale Optimization-Based Classification Models in Medicine and Biology

Cited by 60 publications

References 67 publications

Analysis of the consistency of a mixed integer programming-based multi-category constrained discriminant model

Analysis of the consistency of a mixed integer programming-based multi-category constrained discriminant model

Thymidine kinase and mtDNA depletion in human cardiomyopathy: epigenetic and translational evidence for energy starvation

EM323: a line search based algorithm for solving high-dimensional continuous non-linear optimization problems

Contact Info

Product

Resources

About