Methods of combining multiple classifiers and their applications to handwriting recognition

Xu, Lei; Krzyżak, Adam; Suen, Ching Y.

doi:10.1109/21.155943

Cited by 1,838 publications

(1,028 citation statements)

References 20 publications

Supporting

Mentioning

1,000

Contrasting

Unclassified

Order By: Relevance

“…Let θ j be an l × l matrix. Each element θ j (t′, t) describes the probability that rater j labels a voxel with t ′ when the true label is t. This matrix is similar to the normalized confusion matrix of a Bayesian classifier (Xu et al, 1992), and we will use this terminology for the remainder of the paper. Let θ= [θ 1 , …, θ r ] be the unknown set of all confusion matrices characterizing all r raters.…”

Section: Multi-label Staple Algorithmmentioning

confidence: 99%

On evaluating brain tissue classifiers without a ground truth

et al. 2007

View full text Add to dashboard Cite

In this paper, we present a set of techniques for the evaluation of brain tissue classifiers on a large data set of MR images of the head. Due to the difficulty of establishing a gold standard for this type of data, we focus our attention on methods which do not require a ground truth, but instead rely on a common agreement principle. Three different techniques are presented: the Williams' index, a measure of common agreement; STAPLE, an Expectation Maximization algorithm which simultaneously estimates performance parameters and constructs an estimated reference standard; and Multidimensional Scaling, a visualization technique to explore similarity data. We apply these different evaluation methodologies to a set eleven different segmentation algorithms on forty MR images. We then validate our evaluation pipeline by building a ground truth based on human expert tracings. The evaluations with and without a ground truth are compared. Our findings show that comparing classifiers without a gold standard can provide a lot of interesting information. In particular, outliers can be easily detected, strongly consistent or highly variable techniques can be readily discriminated, and the overall similarity between different techniques can be assessed. On the other hand, we also find that some information present in the expert segmentations is not captured by the automatic classifiers, suggesting that common agreement alone may not be sufficient for a precise performance evaluation of brain tissue classifiers.

show abstract

Section: Multi-label Staple Algorithmmentioning

confidence: 99%

On evaluating brain tissue classifiers without a ground truth

et al. 2007

View full text Add to dashboard Cite

show abstract

“…This approach can be refined assigning different weights to each classifier to optimize the performance of the combined classifier on the training set [86], or, assuming mutual independence between classifiers, a Bayesian decision rule selects the class with the highest posterior probability computed through the estimated class conditional probabilities and the Bayes' formula [130,122]. A Bayesian approach has also been used in Consensus based classification of multisource remote sensing data [10,9,19], outperforming conventional multivariate methods for classification.…”

Section: Non-generative Ensemblesmentioning

confidence: 99%

“…Feature subspace methods performed by partitioning the set of features, where each subset is used by one classifier in the team, are proposed in [130,99,18]. Other methods for combining different feature sets using genetic algorithms are proposed in [81,79].…”

Section: Generative Ensemblesmentioning

confidence: 99%

“…In this last decade one of the main research areas in machine learning has been represented by methods for constructing ensembles of learning machines. Although in the literature [86,129,130,69,61,23,33,12,7,37] a plethora of terms, such as committee, classifier fusion, combination, aggregation and others are used to indicate sets of learning machines that work together to solve a machine learning problem, in this paper we shall use the term ensemble in its widest meaning, in order to include the whole range of combining methods. This variety of terms and specifications reflects the absence of an unified theory on ensemble methods and the youngness of this research area.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Ensembles of Learning Machines

2002

View full text Add to dashboard Cite

Abstract. Ensembles of learning machines constitute one of the main current directions in machine learning research, and have been applied to a wide range of real problems. Despite of the absence of an unified theory on ensembles, there are many theoretical reasons for combining multiple learners, and an empirical evidence of the effectiveness of this approach. In this paper we present a brief overview of ensemble methods, explaining the main reasons why they are able to outperform any single classifier within the ensemble, and proposing a taxonomy based on the main ways base classifiers can be generated or combined together.

show abstract

“…A large number of strategies have been proposed for this purpose, see for example [5][6][7][8][9]. Since the aggregation problem also occurs in all other decomposition methods and in ensemble methods, these research areas as well provide a large number of aggregation strategies (sometimes called classifier combination schemes); see for example [10] and references therein. However, since the semantics of these problems are different, we note that the aggregation strategies from different fields are not always interchangeable.…”

Section: Introductionmentioning

confidence: 99%

Combining predictions in pairwise classification: An optimal adaptive voting strategy and its relation to weighted voting

Hüllermeier

Vanderlooy

2010

Pattern Recognition

113

View full text Add to dashboard Cite

Weighted voting is the commonly used strategy for combining predictions in pairwise classification. Even though it shows good classification performance in practice, it is often criticized for lacking a sound theoretical justification. In this paper, we study the problem of combining predictions within a formal framework of label ranking and, under some model assumptions, derive a generalized voting strategy in which predictions are properly adapted according to the strengths of the corresponding base classifiers. We call this strategy adaptive voting and show that it is optimal in the sense of yielding a MAP prediction of the class label of a test instance. Moreover, we offer a theoretical justification for weighted voting by showing that it yields a good approximation of the optimal adaptive voting prediction. This result is further corroborated by empirical evidence from experiments with real and synthetic data sets showing that, even though adaptive voting is sometimes able to achieve consistent improvements, weighted voting is in general quite competitive, all the more in cases where the aforementioned model assumptions underlying adaptive voting are not met. In this sense, weighted voting appears to be a more robust aggregation strategy.

show abstract

Methods of combining multiple classifiers and their applications to handwriting recognition

Cited by 1,838 publications

References 20 publications

On evaluating brain tissue classifiers without a ground truth

On evaluating brain tissue classifiers without a ground truth

Ensembles of Learning Machines

Combining predictions in pairwise classification: An optimal adaptive voting strategy and its relation to weighted voting

Contact Info

Product

Resources

About