2017
DOI: 10.1016/j.patcog.2016.12.026
|View full text |Cite
|
Sign up to set email alerts
|

Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
66
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 159 publications
(66 citation statements)
references
References 60 publications
0
66
0
Order By: Relevance
“…In Table 3, we provide the comparison between ACE loss and previous methods. It is evident that the proposed ACE loss function exhibits higher performance than previous methods, including MDLSTM-based models [34,47], HMM-based model [10], and over-segmentation methods [27,44,45,48] with and without language model (LM). Compared to scene text recognition, handwritten Chinese text recognition problem possesses its unique challenges, such as large character set (7357 classes) and charactertouching problem.…”
Section: Resultsmentioning
confidence: 95%
See 1 more Smart Citation
“…In Table 3, we provide the comparison between ACE loss and previous methods. It is evident that the proposed ACE loss function exhibits higher performance than previous methods, including MDLSTM-based models [34,47], HMM-based model [10], and over-segmentation methods [27,44,45,48] with and without language model (LM). Compared to scene text recognition, handwritten Chinese text recognition problem possesses its unique challenges, such as large character set (7357 classes) and charactertouching problem.…”
Section: Resultsmentioning
confidence: 95%
“…For 1D prediction problems, the topmost feature maps of the network are collapsed across the vertical dimension to generate 1D prediction [5] because characters in the original images are generally distributed sequentially. Typical examples are regular scene text recognition [38,54], online/offline handwritten text recognition [12,34,48], and speech recognition [14,2]. For 2D prediction problems, characters in the input image are dis- tributed in a specific spatial structure.…”
Section: Introductionmentioning
confidence: 99%
“…Although the confusion among the 7360 classes is higher, Table IX shows an overall comparison of our proposed method and other state-of-the-art methods without/with a language model on the ICDAR 2013 competition set. we list the state-of-theart oversegmentation method heterogeneous CNN [7], CNNs-RNNLM [8] and the segmentationfree method SMDLSTM-CTC [15], CNN-ACE [16] in Table IX for comparison. With the same configuration of vocabulary size (4 more garbage classes adopted in our HMM system), the proposed WCNN-PHMM yielded the best performance whether a language model was employed or not.…”
Section: ) Visualization Analysis For Writer Codementioning
confidence: 99%
“…In general, the research efforts for offline HCTR can be divided into two categories: oversegmentationbased approaches and segmentation-free approaches. The former approaches [5], [6], [7], [8] often build several modules by first including character oversegmentation, character classification, and modeling the linguistic and geometric contexts, and then incorporating them to calculate the score for path search. The recent work in [8], with the neural network language model, adopted three different CNN models to replace the conventional character classifier, segmentation and geometric models to achieve the best performance of oversegmentation-based methods on the ICDAR 2013 competition dataset [9].…”
Section: Introductionmentioning
confidence: 99%
“…Unlike the existing methods that usually employ generic (category-level) human detectors, our approach targets on assigning each moving person a specific tracker to reduce ambiguities in complex scenes. Additionally, modern advances in the development of deep feature representation learning [1,2,3] for object appearance have created new opportunities for MPT methods, which par- tially motivate us to learn instance-level object representations by deep neural nets. Therefore, we develop a multibranch neural network (MBN) that dynamically learns instance-level representations of tracked persons at a low cost, which facilitates robustly online data association for multiple target tracking and thus gives birth to our INstance-Aware Representation Learning and Association (INARLA) framework.…”
Section: Introductionmentioning
confidence: 99%