2018 IEEE 34th International Conference on Data Engineering (ICDE) 2018
DOI: 10.1109/icde.2018.00020
|View full text |Cite
|
Sign up to set email alerts
|

Towards a Holistic Integration of Spreadsheets with Databases: A Scalable Storage Engine for Presentational Data Management

Abstract: Spreadsheet software is the tool of choice for interactive ad-hoc data management, with adoption by billions of users. However, spreadsheets are not scalable, unlike database systems. On the other hand, database systems, while highly scalable, do not support interactivity as a first-class primitive. We are developing DATASPREAD, to holistically integrate spreadsheets as a frontend interface with databases as a back-end datastore, providing scalability to spreadsheets, and interactivity to databases, an integra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
13
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 27 publications
1
13
0
Order By: Relevance
“…Second, we can apply greedy decomposition instead of dynamic programming. Both these tricks give us a solution that is close to the overall optimal decomposition [13]. Our results show that the recursive-decomposition-based hybrid representation gives us a 2-3× reduction in storage or time to perform formula computation relative to a single table or triples-only representation [13].…”
Section: Spreadsheet Organizationmentioning
confidence: 78%
See 2 more Smart Citations
“…Second, we can apply greedy decomposition instead of dynamic programming. Both these tricks give us a solution that is close to the overall optimal decomposition [13]. Our results show that the recursive-decomposition-based hybrid representation gives us a 2-3× reduction in storage or time to perform formula computation relative to a single table or triples-only representation [13].…”
Section: Spreadsheet Organizationmentioning
confidence: 78%
“…Figure 7 displays one such decomposition, where the spreadsheet is decomposed into red and blue areas; each red area is stored as a separate table, while the blue areas are stored together in a single triples table. Unfortunately, we determined that identifying this optimal decomposition is NP-HARD via a reduction from the minimum edge-length partitioning of rectilinear polygons [13]. So, instead, we opt for a restricted space of decompositions called recursive decompositions that are formed by recursively decomposing areas of the spreadsheet (an analogy is to use scissors to cut a sheet of paper with vertical or horizontal cuts that cut the remaining region entirely).…”
Section: Spreadsheet Organizationmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, not only are scalability issues very real problems plaguing those who want to analyze large amounts of data, but these issues touched on all four areas of actions users take to process data. This motivates the need for developing a spreadsheet application which continues to perform as expected on very large volumes of data [12].…”
Section: Discussionmentioning
confidence: 99%
“…The last class of problems are what we call scalability problems, i.e., situations where users are unable to complete standard operations in a reasonable amount of time when the scale or complexity of the dataset increases [7]. With the increasing availability of large volumes of data in most domains, we suspect that scalability problems are only going to become more common-as an example, our biologist collaborators routinely use spreadsheets, and can now generate large genomic (VCF) files, but are unable to open these files even simply to verify correctness in spreadsheet software [12].…”
Section: Introductionmentioning
confidence: 99%