Proceedings of the Web Conference 2020 2020
DOI: 10.1145/3366423.3380205
|View full text |Cite
|
Sign up to set email alerts
|

Novel Entity Discovery from Web Tables

Abstract: When working with any sort of knowledge base (KB) one has to make sure it is as complete and also as up-to-date as possible. Both tasks are non-trivial as they require recall-oriented efforts to determine which entities and relationships are missing from the KB. As such they require a significant amount of labor. Tables on the Web on the other hand are abundant and have the distinct potential to assist with these tasks. In particular, we can leverage the content in such tables to discover new entities, propert… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
40
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 37 publications
(41 citation statements)
references
References 29 publications
1
40
0
Order By: Relevance
“…The ODE layer in the generator transforms ⊕ , the concatenation of a noisy vector and a condition vector , into another latent vector ′ that will be fed into the generator (See Section 3.3). reasons, many web-oriented researchers focus on various tasks on tabular data [10,12,27,30,32,45,56,59,62,63]. In this work, generating realistic synthetic tabular data is of our utmost interest.…”
Section: ) (C)mentioning
confidence: 99%
“…The ODE layer in the generator transforms ⊕ , the concatenation of a noisy vector and a condition vector , into another latent vector ′ that will be fed into the generator (See Section 3.3). reasons, many web-oriented researchers focus on various tasks on tabular data [10,12,27,30,32,45,56,59,62,63]. In this work, generating realistic synthetic tabular data is of our utmost interest.…”
Section: ) (C)mentioning
confidence: 99%
“…We follow FAIR (Findable, Accessible, Interoperable and Reusable) guidelines to release our contributions. 14 We release our dataset in Zenodo (DOI: https://doi.org/10.5281/zenodo. 3840646), in such a way that researchers in the community can benefit from this.…”
Section: Availability and Long-term Planmentioning
confidence: 99%
“…A recent study reported that many of the approaches tested on such datasets are focused on "obviously linkable" cells [14], showing that a tool like T2K [10] manages to match only 2.85% of a large corpus of Web tables to DBpedia. However, the performance of T2K evaluated on T2D relatively to the CEA task is very high (F1: 0.82, Precision: 0.90, Recall: 0.76).…”
Section: Introductionmentioning
confidence: 99%
“…Additionally, knowledge graphs are typically incomplete -entities and types mentioned in tables may not always exist in the knowledge graph. Therefore, it becomes necessary to expand the knowledge graph with new entities (Zhang et al, 2020) and types for annotating tables.…”
Section: Introductionmentioning
confidence: 99%
“…More recent approaches using embeddings (Gentile et al, 2017;Zhang and Balog, 2018;Zhang et al, 2019;Chen et al, 2019;Yin et al, 2020) only partly capture the syntactic structure of tables, and also ignore the structure of the knowledge graph. The problem of incompleteness of the knowledge representation (Zhang et al, 2020) is mostly not addressed.…”
Section: Introductionmentioning
confidence: 99%