2017
DOI: 10.1016/j.patcog.2017.06.017
|View full text |Cite
|
Sign up to set email alerts
|

Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
132
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
4

Relationship

4
5

Authors

Journals

citations
Cited by 228 publications
(132 citation statements)
references
References 23 publications
0
132
0
Order By: Relevance
“…Hence the coverage vector contains the information of alignment history. We append the coverage vector in the attention model so that the decoder is capable to know which part of input image has been attended or not [13], [26]. Let n denote the attention dimension and q denote the number of output channels of convolution function Q; then ν att ∈ R n , U s ∈ R n ×n , U a ∈ R n ×C and U f ∈ R n ×q .…”
Section: B Decodermentioning
confidence: 99%
“…Hence the coverage vector contains the information of alignment history. We append the coverage vector in the attention model so that the decoder is capable to know which part of input image has been attended or not [13], [26]. Let n denote the attention dimension and q denote the number of output channels of convolution function Q; then ν att ∈ R n , U s ∈ R n ×n , U a ∈ R n ×C and U f ∈ R n ×q .…”
Section: B Decodermentioning
confidence: 99%
“…Besides, [30] adopted residual connection in encoder and a transition probability matrix in decoder. As for the offline approach, [11] utilized a WAP model, which adopted CNNbased encoder to extract features from static images. [12] proposed a coarse-tofine attention to improve efficiency.…”
Section: Attention Based Encoder-decoder Approaches For Hmermentioning
confidence: 99%
“…For online HMER, [9,10] treat the handwritten mathematical expression (HME) as a point sequence and extract point-level features from input traces. While for offline HMER, [11,12] take the HME as a static image and extract pixel-level features from the input image. Benefiting from rich dynamic (spatial and temporal) information which is extremely helpful for handwritten recognition, online HMER tends to meet fewer difficulties caused by ambiguous handwriting.…”
Section: Introductionmentioning
confidence: 99%
“…As for the unseen 16,079 Chinese characters, we choose 2,000 characters as the validation set and 14,079 characters as the testing set. We also employ an ensemble method during testing procedure [13] because the performances vary severely due to the small training set. We illustrate the performance in Fig.…”
Section: Experiments On Recognition Of Unseen Charactersmentioning
confidence: 99%