A K Nearest Classifier design

Prudent, Y.; Ennaji, Abdellatif

doi:10.5565/rev/elcvia.96

Cited by 5 publications

(2 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our aim is rather to understand the role of the parameter values on the behavior of the RF. That is why we have decided to arbitrarily choose a commonly used feature extraction technique based on a greyscale multi-resolution pyramid [14]. We have extracted for each image of our set, 84 greyscale mean values based on four resolution levels of the image, as illustrated in figure 1.…”

Section: Experimental Protocolmentioning

confidence: 99%

Using Random Forests for Handwritten Digit Recognition

Bernard

Heutte

Adam

2007

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2

View full text Add to dashboard Cite

In the Pattern Recognition field, growing interest has been shown in recent years for Multiple Classifier Systems and particularly for Bagging, Boosting and Random Subspaces. Those methods aim at inducing an ensemble of classifiers by producing diversity at different levels. Following this principle, Breiman has introduced in 2001 another family of methods called Random Forest. Our work aims at studying those methods in a strictly pragmatic approach, in order to provide rules on parameter settings for practitioners. For that purpose we have experimented the Forest-RI algorithm, considered as the Random Forest reference method, on the MNIST handwritten digits database. In this paper, we describe Random Forest principles and review some methods proposed in the literature. We present next our experimental protocol and results. We finally draw some conclusions on Random Forest global behavior according to their parameter tuning.

show abstract

Section: Experimental Protocolmentioning

confidence: 99%

Using Random Forests for Handwritten Digit Recognition

Bernard

Heutte

Adam

2007

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2

View full text Add to dashboard Cite

show abstract

“…For instance, Bernard et al [16] test random forest classifier on MNIST dataset. In this work, the grayscale multi-resolution pyramid method [17] is used as a feature extraction technique. Using the verified data for selecting parameters of random forest classifier, they obtain a success accuracy of 93:27%.…”

Section: Handwritten Digit Recognition Methodsmentioning

confidence: 99%

ARDIS: a Swedish historical handwritten digit dataset

Kusetoğulları

Yavariabdi

Cheddad

et al. 2019

Neural Comput & Applic

View full text Add to dashboard Cite

This paper introduces a new image-based handwritten historical digit dataset named Arkiv Digital Sweden (ARDIS). The images in ARDIS dataset are extracted from 15,000 Swedish church records which were written by different priests with various handwriting styles in the nineteenth and twentieth centuries. The constructed dataset consists of three single-digit datasets and one-digit string dataset. The digit string dataset includes 10,000 samples in red-green-blue color space, whereas the other datasets contain 7600 single-digit images in different color spaces. An extensive analysis of machine learning methods on several digit datasets is carried out. Additionally, correlation between ARDIS and existing digit datasets Modified National Institute of Standards and Technology (MNIST) and US Postal Service (USPS) is investigated. Experimental results show that machine learning algorithms, including deep learning methods, provide low recognition accuracy as they face difficulties when trained on existing datasets and tested on ARDIS dataset. Accordingly, convolutional neural network trained on MNIST and USPS and tested on ARDIS provide the highest accuracies 58:80% and 35:44%, respectively. Consequently, the results reveal that machine learning methods trained on existing datasets can have difficulties to recognize digits effectively on our dataset which proves that ARDIS dataset has unique characteristics. This dataset is publicly available for the research community to further advance handwritten digit recognition algorithms.Neural Computing and Applications https://doi.org/10.1007/s00521-019-04163-3( 0123456789().,-volV) (0123456789(). ,-volV)Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Neural Computing and Applications

show abstract