2018
DOI: 10.1155/2018/6747098
|View full text |Cite
|
Sign up to set email alerts
|

Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks

Abstract: In spite of advances in object recognition technology, handwritten Bangla character recognition (HBCR) remains largely unsolved due to the presence of many ambiguous handwritten characters and excessively cursive Bangla handwritings. Even many advanced existing methods do not lead to satisfactory performance in practice that related to HBCR. In this paper, a set of the state-of-the-art deep convolutional neural networks (DCNNs) is discussed and their performance on the application of HBCR is systematically eva… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
36
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 85 publications
(36 citation statements)
references
References 65 publications
0
36
0
Order By: Relevance
“…There is a good survey on DL approaches for image processing and computer vision related tasks, including image classification, segmentation, and detection [102]. For examples, single image super-resolution using CNN method [103], image denoising using block-matching CNN [104], photo aesthetic assessment using A-Lamp (Adaptive Layout-Aware Multi-Patch Deep CNN) [105], DCNN for hyperspectral imaging segmentation [106], image registration [107], fast artistic style transfer [108], image background segmentation using DCNN [109], handwritten character recognition [110], optical image classification [111], crop mapping using high-resolution satellite imagery [112], object recognition with cellular simultaneous recurrent networks and CNN [113]. The DL approaches are massively applied to human activity recognition tasks and achieved state-of-the-art performance compared to exiting approaches [114][115][116][117][118][119].…”
Section: Image Processing and Computer Visionmentioning
confidence: 99%
“…There is a good survey on DL approaches for image processing and computer vision related tasks, including image classification, segmentation, and detection [102]. For examples, single image super-resolution using CNN method [103], image denoising using block-matching CNN [104], photo aesthetic assessment using A-Lamp (Adaptive Layout-Aware Multi-Patch Deep CNN) [105], DCNN for hyperspectral imaging segmentation [106], image registration [107], fast artistic style transfer [108], image background segmentation using DCNN [109], handwritten character recognition [110], optical image classification [111], crop mapping using high-resolution satellite imagery [112], object recognition with cellular simultaneous recurrent networks and CNN [113]. The DL approaches are massively applied to human activity recognition tasks and achieved state-of-the-art performance compared to exiting approaches [114][115][116][117][118][119].…”
Section: Image Processing and Computer Visionmentioning
confidence: 99%
“…This model of CNN has been used extensively in some image recognition problems since it was introduced in 2014 by [25]. The authors of [32] have then explored its performance for Bengali handwritten character recognition. In our model, we have incorporated some extra layers, like batch normalization, zero padding, dropout and a favorable selection of the number of convolution layers, max-pooling layer, and the number of filters that boosted the classification performance by a significant margin.…”
Section: Discussionmentioning
confidence: 99%
“…Simonyan and Zisserman [25] in 2014 introduced a model of CNN to investigate the effect of convolutional depth on its accuracy in large-scale image recognition problem and named it VGG according to the name of their research group: Visual Geometry Group. Two special architectures of VGG, VGG-11 [31] and VGG-16 [32] were used for image segmentation and handwritten Bengali characters recognition, respectively. The architecture of VGG-11 consists of 8 convolutional layers and 3 fully connected layers followed by a single Softmax layer, and the architecture of VGG-16 consists of 13 convolutional layers and 3 fully connected layers followed by a single Softmax layer.…”
Section: Vggmentioning
confidence: 99%
“…A deep convolution neural network (DCNN) is a multilayer neural network that is performed as a deep supervised learning method [15]. The DCNN has achieved excellent performances on image recognition tasks for the last few years [7,8,16,17]. It can perform both feature extraction and image classification tasks [15].…”
Section: Dcnn Classifiermentioning
confidence: 99%