2018
DOI: 10.1007/978-3-030-00794-2_33
|View full text |Cite
|
Sign up to set email alerts
|

Recognition of OCR Invoice Metadata Block Types

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 10 publications
0
7
0
Order By: Relevance
“…This work mainly uses words, the smallest unit in document layout, to extract features. [37] OCRMiner system designed to extract the indexing metadata of structured documents obtained from an image scanning process and OCR.…”
Section: Natural Language Processing (Nlp)mentioning
confidence: 99%
“…This work mainly uses words, the smallest unit in document layout, to extract features. [37] OCRMiner system designed to extract the indexing metadata of structured documents obtained from an image scanning process and OCR.…”
Section: Natural Language Processing (Nlp)mentioning
confidence: 99%
“…The next closest research area focuses on information extraction from invoices. The area is connected with optical character recognition (OCR) and copy machine producers (Ha et al, 2018). Besides textual data, other layout and visual features are extracted from a document (Ha, 2017; Ha et al, 2018; Palm et al, 2017.…”
Section: Related Workmentioning
confidence: 99%
“…There are studies aimed at invoice detection among other business documents (Ha, 2017), or the detection of invoice type (handwritten, printed, receipt) (Tarawneh et al, 2019). The identification of the right blocks on an invoice that include the price field by a combination of visual and text features has reached an accu racy of 80% (Ha et al, 2018), and F1 of 84% with the use of recurrent neural networks (Palm et al, 2017).…”
Section: Related Workmentioning
confidence: 99%
“…The EoIs that assigned to the decoders are Ticket Number (TCN), (Starting Station (SS), Train Number (TAN), Destination Station (DS)), Date (DT), (Ticket Rates (TR), Seat Category (SC)), and Name (NM). In passport scenario, we design five decoders to cover seven EoIs, decoding steps of each decoder are 25, 5,15,35,35. The EoIs assigned to the decoders are Passport Number, Name, (Gender, Birth Date), Birth Place, (Issue Place, Expiry Date).…”
Section: B Experiments Settingmentioning
confidence: 99%
“…"China" or "USA" for the entity "Country", "Jack" or "Rose" for the entity "Name". Previous approaches [3]- [5] mainly adopt two steps, in which text information is extracted firstly via OCR (Optical Character Recognition), and then EoIs are extracted by handcrafted rules or layout analysis.…”
Section: Introductionmentioning
confidence: 99%