2019
DOI: 10.1109/access.2019.2945825
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Mathematical Expressions in Scientific Document Images Using a U-Net Trained on a Diverse Dataset

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 38 publications
(24 citation statements)
references
References 32 publications
0
24
0
Order By: Relevance
“…In recent years, DNNs have proved the outstanding performance in the recognition and detection of mathematical expressions tasks [27]- [29]. The work in [27] takes the advantages of CNNs in the detection of isolated and inline expressions in document images. A CNN architecture based on the U-net [30] is used for detecting mathematical expressions.…”
Section: B Mathematical Expression Detection 1) Mathematical Expressmentioning
confidence: 99%
See 2 more Smart Citations
“…In recent years, DNNs have proved the outstanding performance in the recognition and detection of mathematical expressions tasks [27]- [29]. The work in [27] takes the advantages of CNNs in the detection of isolated and inline expressions in document images. A CNN architecture based on the U-net [30] is used for detecting mathematical expressions.…”
Section: B Mathematical Expression Detection 1) Mathematical Expressmentioning
confidence: 99%
“…The training and testing datasets are described in Table 3. The number of isolated and inline expressions in each page is described in Figure 9 The second one is GTDB public dataset [27]. It has recently been used for performance evaluation of researches [48].…”
Section: A Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…The method depends on the geometrical and statistical characteristics of symbol images. The authors in Ohyama, Suzuki, and Uchida (2019) used U‐Net framework (Ronneberger, Fischer, and Brox (2015)) to detect mathematical equations in scientific document images. A method Tangent‐CFT for embedding mathematical formulas is presented in Mansouri et al (2019).…”
Section: Related Workmentioning
confidence: 99%
“…Aligning words from OCR and full-text documents is challenging for several reasons. The OCR output contains various types of recognition errors, many of which involve special symbols, Greek letters like µ or sub-and superscript characters and numbers, which are particularly frequent in chemical names, formulae, and measurement units, and which are notoriously difficult for OCR (Ohyama et al, 2019). If the printed paper document is based on PDF, it usually has an explicit page layout, which is different from the way the corresponding full-text XML document is displayed in a web browser.…”
Section: Introductionmentioning
confidence: 99%