2023
DOI: 10.21203/rs.3.rs-2888654/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Advancements in data extraction from natural history collections: automatic label extraction from specimen images using OCR and NER

Atsuko Takano,
Theodor C. H. Cole,
Hajime Konagai

Abstract: Digital extraction of label data from natural history specimens along with more efficient procedures of data entry will become essential for documentation and global information availability in the near future. Herbarium collections have made great advances in this direction lately. In this study, using optical character recognition (OCR) and named entity recognition (NER) techniques, we have been able to almost automatically extract label data from herbarium specimen images. This system can be developed and r… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 14 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?