Proceedings of the 2003 ACM Symposium on Document Engineering 2003
DOI: 10.1145/958220.958239
|View full text |Cite
|
Sign up to set email alerts
|

Infty

Abstract: An integrated OCR system for mathematical documents, called INFTY, is presented. INFTY consists of four procedures, i.e., layout analysis, character recognition, structure analysis of mathematical expressions, and manual error correction. In those procedures, several novel techniques are utilized for better recognition performance. Experimental results on about 500 pages of mathematical documents showed high character recognition rates on both mathematical expressions and ordinary texts, and sufficient perform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2006
2006
2024
2024

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 119 publications
(6 citation statements)
references
References 9 publications
0
6
0
Order By: Relevance
“…In recent years, these systems have evolved rapidly through deep learning. Usually, transformer-based approaches [2][3][4] have proven to outperform traditional statistical models [15,16] and convolutional neural networks [5,6,[17][18][19]. These neural networks are able to learn and recognize intricate patterns and features within images automatically, making them particularly well-suited for accurately extracting text with subscripts such as mathematical formulas from scanned documents or images [20].…”
Section: Mathematical Expression Recognitionmentioning
confidence: 99%
“…In recent years, these systems have evolved rapidly through deep learning. Usually, transformer-based approaches [2][3][4] have proven to outperform traditional statistical models [15,16] and convolutional neural networks [5,6,[17][18][19]. These neural networks are able to learn and recognize intricate patterns and features within images automatically, making them particularly well-suited for accurately extracting text with subscripts such as mathematical formulas from scanned documents or images [20].…”
Section: Mathematical Expression Recognitionmentioning
confidence: 99%
“…However, both MER systems comprise three stages: symbol segmentation, symbol recognition, and 2D structure analysis [4]. Classic approaches, as the Infty system [13], [14] solve these stages separately, whereas end-to-end approaches address them all at once. With recent progress in deep learning, end-to-end approaches with an encoder-decoder structure have become prevalent [15].…”
Section: Related Workmentioning
confidence: 99%
“…• The Infty Reader [37] is a mathematical OCR solution intended to remedy the above-mentioned lack of mathematical literature accessible to blind individuals. Although under the right circumstances the tool can produce quite accurate recognition results, the terms produced by it are ex-tremely hard to read because they contain lots of formatting information which will distract the blind reader.…”
Section: Present Statementioning
confidence: 99%