2020
DOI: 10.5808/gi.2020.18.3.e33
|View full text |Cite
|
Sign up to set email alerts
|

Organizing an in-class hackathon to correct PDF-to-text conversion errors of Genomics & Informatics 1.0

Abstract: This paper describes a community effort to improve earlier versions of the full-text corpus of Genomics & Informatics by semi-automatically detecting and correcting PDF-to-text conversion errors and optical character recognition errors during the first hackathon of Genomics & Informatics Annotation Hackathon (GIAH) event. Extracting text from multi-column biomedical documents such as Genomics & Informatics is known to be notoriously difficult. The hackathon was piloted as part of a coding competition of the EL… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 8 publications
(8 reference statements)
0
0
0
Order By: Relevance