Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data 2012
DOI: 10.1145/2213836.2213848
|View full text |Cite
|
Sign up to set email alerts
|

InfoGather

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 180 publications
(20 citation statements)
references
References 15 publications
0
20
0
Order By: Relevance
“…Thus, the input to the process is a table and the corresponding output is an enriched table. [145] identified three core tasks in the augmentation of tables.…”
Section: Tabular Searchmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus, the input to the process is a table and the corresponding output is an enriched table. [145] identified three core tasks in the augmentation of tables.…”
Section: Tabular Searchmentioning
confidence: 99%
“…The values of the most similar table are then used to populate the input table's additional column. The Infogather system [145] uses a similar approach but instead of just calculating the direct similarity between the input table and potential augmenting tables it also takes into the account the neighborhood around the potential augmenting tables. These indirect tables provide ancillary information that can be better suited for augmentation than the tables with the highest similarity to the input tables.…”
Section: Tabular Searchmentioning
confidence: 99%
“…Before feeding the record sets returned by data extraction into a particular application, it is commonly necessary to perform some of the following integration tasks: semantisation [25,45,54,55,60,63,71], which either maps the descriptors onto the terminology box of a particular ontology or the tuples onto its assertion box [19]; union [23], which merges record sets that provide similar data; finding primary keys [62], which determines which components of the tuples identify them as univocally as possible; record linkage [8,11,12], which finds different records that refer to the same actual entities; augmentation [6,52,67], which joins record sets on the same topic to complete the information that they provide individually; and cleaning [10,31,61], which fixes data. Note that the integration tasks are orthogonal to data extraction because they are independent from the source of the record sets, which is the reason why they fall out of the scope of this article.…”
Section: Data-extraction Vocabularymentioning
confidence: 99%
“…In this context, data extraction consists in transforming tables into structured formats that focus on their data and abstract away from how they are displayed. Data extraction has many applications to text mining [24,64,65], data (meta-)search [3,9,18,26,44,51,[63][64][65], query expansion [16], document summarisation [40,64], question answering [1,20,44,46,65], knowledge discovery [9,22,26,32,44,46], knowledge base construction [17,72], knowledge augmentation [1,9,18,20,56,56,57,67], synonym finding [1,3,39], improving accessibility [43,47,49,64,65], textual advertising [15], data compression [2,49], or creating linked data…”
Section: Introductionmentioning
confidence: 99%
“…Table extension and augmentation aims at gathering relational tables that contain the same entities but cover complementary attributes of the entities, and integrate these tables by joining them on the same entities. For example, Yakout et al [38] propose InfoGather for populating a table of entities with their attributes by harvesting related tables on the Web. The users need to either provide the desired attribute names of the entities, or example values of their attributes.…”
Section: General Nlp and Iementioning
confidence: 99%