Proceedings of the 20th ACM International Conference on Information and Knowledge Management 2011
DOI: 10.1145/2063576.2063829
|View full text |Cite
|
Sign up to set email alerts
|

Spreadsheet-based complex data transformation

Abstract: Spreadsheets are used by millions of users as a routine allpurpose data management tool. It is now increasingly necessary for external applications and services to consume spreadsheet data. In this paper, we investigate the problem of transforming spreadsheet data to structured formats required by these applications and services. Unlike prior methods, we propose a novel approach in which transformation logic is embedded into a familiar and expressive spreadsheet-like formula mapping language. Popular transform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
16
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 26 publications
(17 citation statements)
references
References 11 publications
0
16
0
Order By: Relevance
“…Recent studies [1,2,8,11] attempted to transform spreadsheet data into the relational model, making further integration among spreadsheets possible. Some extraction systems require explicit sheet-specific user-provided rules [2,11], which might yield good results for a single spreadsheet. But they are not feasible for our setting: the corpus is large and users are not aware of the target spreadsheets to be processed ahead of time.…”
Section: Introductionmentioning
confidence: 99%
“…Recent studies [1,2,8,11] attempted to transform spreadsheet data into the relational model, making further integration among spreadsheets possible. Some extraction systems require explicit sheet-specific user-provided rules [2,11], which might yield good results for a single spreadsheet. But they are not feasible for our setting: the corpus is large and users are not aware of the target spreadsheets to be processed ahead of time.…”
Section: Introductionmentioning
confidence: 99%
“…The DIOM framework differs significantly from other systems [1,6] by exploiting both explicit and implicit categories of knowledge to ensure the correct extraction and integration of the semantics contained in spreadsheets.…”
Section: Contributionsmentioning
confidence: 99%
“…Second, the rule-based approach requires the user to explicitly specify the transformation in the form of conversion rules [1]. The approach is flexible in that the rules could be applied to a variety of spreadsheets.…”
Section: Contributionsmentioning
confidence: 99%
See 2 more Smart Citations