2012 International Conference on Frontiers in Handwriting Recognition 2012
DOI: 10.1109/icfhr.2012.273
|View full text |Cite
|
Sign up to set email alerts
|

Statistical Machine Translation as a Language Model for Handwriting Recognition

Abstract: When performing handwriting recognition on natural language text, the use of a word-level language model (LM) is known to significantly improve recognition accuracy. The most common type of language model, the n-gram model, decomposes sentences into short, overlapping chunks.In this paper, we propose a new type of language model which we use in addition to the standard n-gram LM. Our new model uses the likelihood score from a statistical machine translation system as a reranking feature. In general terms, we a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 11 publications
0
5
0
Order By: Relevance
“…Furthermore, in certain document collections, several languages are used, sometimes even in the same paragraph. Corpora for transcription systems for contemporary texts usually contain millions of words gathered from various sources [21,22], which we can not provide for the bootstrapping of handwriting recognition for the document collections in Monk.…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, in certain document collections, several languages are used, sometimes even in the same paragraph. Corpora for transcription systems for contemporary texts usually contain millions of words gathered from various sources [21,22], which we can not provide for the bootstrapping of handwriting recognition for the document collections in Monk.…”
Section: Discussionmentioning
confidence: 99%
“…Highly advanced Artificial Intelligence (AI), along with its associated sub-disciplines like machine learn-ing (ML), paved the way for a wide range of real-world applications in sectors such as visual recognition systems, language understanding, automated translation techniques, and robotic innovations [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18]. In recent years, the most prominent sub-field of machine learning (ML), Artificial Neural Networks (ANNs), have been widely and effectively utilized to transform technological advancements and elevate both businesses and daily life towards a new stage of AI sophistication [19][20][21][22][23][24][25].…”
Section: Generalmentioning
confidence: 99%
“…OCR is utilised to recognise languages of different origins not only in Roman script [42] but also in the case of degraded language documents. Another use of OCR to recognise handwriting is presented in [43], where the authors used the n-gram model, which produces groups of words that overlay on each other. The authors extended such model and utilised the likelihood score, from a statistical machine translation system, as main feature, so as to be able to capture the image of the words and to translate each image into other languages.…”
Section: Optical Character Recognitionmentioning
confidence: 99%