Proceedings of the 2017 ACM on Conference on Information and Knowledge Management 2017
DOI: 10.1145/3132847.3132882
|View full text |Cite
|
Sign up to set email alerts
|

Spreadsheet Property Detection With Rule-assisted Active Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
24
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(24 citation statements)
references
References 23 publications
0
24
0
Order By: Relevance
“…There is a considerable number of works tackling layout inference and information extraction in spreadsheets. Recent publications propose approaches involving to some extent machine learning techniques, such as [2], [3], [4], [5], and [6]. Also, we find rule-based approaches, like [7].…”
Section: Related Workmentioning
confidence: 80%
See 1 more Smart Citation
“…There is a considerable number of works tackling layout inference and information extraction in spreadsheets. Recent publications propose approaches involving to some extent machine learning techniques, such as [2], [3], [4], [5], and [6]. Also, we find rule-based approaches, like [7].…”
Section: Related Workmentioning
confidence: 80%
“…While there is some support to perform spreadsheet data extraction, like [1] and [2], it can not be considered a general purpose solution for arbitrary inputs. Previous work often assumes just one table per sheet.…”
Section: Introductionmentioning
confidence: 99%
“…VizNet currently centralizes four corpora of data from the web, open data portals, and online visualization galleries. We plan to expand the VizNet corpus with the 410,554 Microsoft Excel workbook files (1,181,530 sheets) [8] extracted from the ClueWeb09 web crawl 1 . Furthermore, Morton et.…”
Section: Discussionmentioning
confidence: 99%
“…Recent attempts, such as Ideas in Excel and Explore in Google Sheets, aim at providing insights and recommendations to users (e.g., summary statistics and charts), based on background analysis of tabular data in the sheet. Other works [1,3,5,17], including ours [10][11][12][13][14][15], focus on integrating and extracting data from spreadsheets. One of the main concerns comes with data and knowledge being scattered in multiple spreadsheet files.…”
Section: Introductionmentioning
confidence: 99%
“…https://ironpython.net/ 4 http://officeopenxml.com/anatomyofOOXML-xlsx.php5 https://openpyxl.readthedocs.io/en/stable/…”
mentioning
confidence: 99%