2018
DOI: 10.1007/978-3-030-03667-6_11
|View full text |Cite
|
Sign up to set email alerts
|

Making Sense of Numerical Data - Semantic Labelling of Web Tables

Abstract: With the increasing amount of structured data on the web the need to understand and support search over this emerging data space is growing. Adding semantics to structured data can help address existing challenges in data discovery, as it facilitates understanding the values in their context. While there are approaches on how to lift structured data to semantic web formats to enrich it and facilitate discovery, most work to date focuses on textual fields rather than numerical data. In this paper, we propose a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 19 publications
0
8
0
Order By: Relevance
“…For instance, [102] deal with the problem that the underlying dataset cannot be exposed, but good summaries may help the user undertake the task of data access. Meanwhile, [93] use annotations to help support searching over data types and entities within a dataset, while [73] provide better labeling for numerical data in tables.…”
Section: Data Handlingmentioning
confidence: 99%
See 1 more Smart Citation
“…For instance, [102] deal with the problem that the underlying dataset cannot be exposed, but good summaries may help the user undertake the task of data access. Meanwhile, [93] use annotations to help support searching over data types and entities within a dataset, while [73] provide better labeling for numerical data in tables.…”
Section: Data Handlingmentioning
confidence: 99%
“…Then, the Linked Data cycle can then be applied to these descriptions, ultimately enabling the querying of datasets. The main challenge is the generation and maintenance of these descriptions, with some works tackling the problem of extracting specific properties from specific formats, like [104] for extracting spatio-temporal properties, and [74] for identifying the numerical properties in CSV tables.…”
Section: Entity-centric Search Building Blocksmentioning
confidence: 99%
“…Another option is to use a profiling tool such as Loupe [22]. For table generation with numeric columns, refer to [18].…”
Section: Profilingmentioning
confidence: 99%
“…Vocabulary re-use is limited Our results suggest that publishers are having issues finding, using and/or aligning to vocabularies: from the different ways of assigning values to dct:format, through the default parameters of Socrata's csvtordf conversion, to the low usage of the vocabularies defined by the EC. Follow-up actions Considering that lifting from tabular formats appears to be the best way to move forward, EDP will study the feasibility of applying recent research methods in this area [8,1] to find alignments to tabular headers. This could be done by intermediaries such as the EDP, by portals or the publishers themselves.…”
Section: Lessons Learned and Implicationsmentioning
confidence: 99%
“…Linked Data and the 5-star scheme are at the core of the open data strategy of the European Commission (EC), described in the Public Sector Information (PSI) Directive. This includes investment in the development and promotion of: metadata specifications such as DCAT-AP 5 to describe datasets; catalogs of resources with persistent URIs; 6 , a data portal to host EC data; 7 as well as the European Data Portal (EDP), 8 which provides a single point of access, search and exploration of open government data by various European public institutions. In November 2015, the EDP has started to harvest metadata from all national portals of the 28 EU countries and associated countries, the EC data portal, and a set of other sources such as geospatial portals.…”
Section: Introductionmentioning
confidence: 99%