2015 IEEE/ACM 37th IEEE International Conference on Software Engineering 2015
DOI: 10.1109/icse.2015.129
|View full text |Cite
|
Sign up to set email alerts
|

Enron's Spreadsheets and Related Emails: A Dataset and Analysis

Abstract: Abstract-Spreadsheets are used extensively in business processes around the world and as such, a topic of research interest. Over the past few years, many spreadsheet studies have been performed on the EUSES spreadsheet corpus. While this corpus has served the spreadsheet community well, the spreadsheets it contains are mainly gathered with search engines and as such do not represent spreadsheets used in companies. This paper presents a new dataset, extracted for the Enron Email Archive, containing over 15,000… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
55
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
5
3
1
1

Relationship

2
8

Authors

Journals

citations
Cited by 63 publications
(55 citation statements)
references
References 21 publications
0
55
0
Order By: Relevance
“…It has 4, 498 unique spreadsheets, which are gathered through Google searches using keywords such as nancial and inventory. The ENRON corpus [9] contains over 15, 000 spreadsheets, extracted from the Enron email archive. This corpus is of a particular interest, since it provides access to real-world business spreadsheets used in industry.…”
Section: Dataset Of Annotated Tablesmentioning
confidence: 99%
“…It has 4, 498 unique spreadsheets, which are gathered through Google searches using keywords such as nancial and inventory. The ENRON corpus [9] contains over 15, 000 spreadsheets, extracted from the Enron email archive. This corpus is of a particular interest, since it provides access to real-world business spreadsheets used in industry.…”
Section: Dataset Of Annotated Tablesmentioning
confidence: 99%
“…However it is a large set, the spreadsheets have been collected from practice, and it has been used in several works of spreadsheet research [16]. In his work Jansen [17] shows how the EUSES corpus is also similar to the more recent ENRON corpus [18], which is a collection of spreadsheets obtained from the e-mail archives of Enron Corporation, disclosed during the trials related to its bankruptcy.…”
Section: A Covering Other Approaches Of Metadata Extractionmentioning
confidence: 99%
“…In order to better understand the use of lookup functions, we analyze their use in the Enron corpus, a recently released set of more than 16.000 spreadsheets from the bankrupt company Enron [2]. We are especially interested in learning more about the two different ways in which lookup functions can be applied: for exact matching, where only exactly corresponding results can be returned-often used to combine two worksheets-and the approximate match, where approximate results may be returned, used mainly for simple classification.…”
Section: Introductionmentioning
confidence: 99%