2024
DOI: 10.20944/preprints202402.0858.v6
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Detecting CSV File Dialects by Table Uniformity Measurement and Data Type Inference

Wilfredo García

Abstract: The human-readable simplicity with which the CSV format was devised, together with the absence of a standard that strictly defines this format, has allowed the proliferation of several variants in the dialects with which these files are written. The latter has meant that the exchange of information between data management systems, or between countries and regions, requires human intervention during the data mining and cleansing process. This has led to the development of various computational tools that aim to… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 9 publications
(15 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?