Proceedings of the International Workshop on Semantic Big Data 2019
DOI: 10.1145/3323878.3325802
|View full text |Cite
|
Sign up to set email alerts
|

Parallel RDF generation from heterogeneous big data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

3
5

Authors

Journals

citations
Cited by 18 publications
(24 citation statements)
references
References 6 publications
0
24
0
Order By: Relevance
“…This is achieved by aligning the mapping rules with access-specific definitions for e.g., databases or files, to integrate data from heterogeneous formats, e.g., CSV, XML, JSON. However, we observe that: (i) data velocity is not well supported in mapping languages and corresponding processors, compared to data variety and volume [8,18,20]; and, (ii) the characteristics and destination of the generated knowledge graph remain unexplored.…”
Section: Introductionmentioning
confidence: 99%
“…This is achieved by aligning the mapping rules with access-specific definitions for e.g., databases or files, to integrate data from heterogeneous formats, e.g., CSV, XML, JSON. However, we observe that: (i) data velocity is not well supported in mapping languages and corresponding processors, compared to data variety and volume [8,18,20]; and, (ii) the characteristics and destination of the generated knowledge graph remain unexplored.…”
Section: Introductionmentioning
confidence: 99%
“…Tools and mappings for different formats have been developed (e.g. [25,26,13]) and the mapping languages have been applied to convert data sources in different domains (e.g. [8,18]).…”
Section: Related Workmentioning
confidence: 99%
“…This method helps in correcting the tendency of decision trees to overfit their training set. We use the scikit-learn implementation 13 of the random forest classifier.…”
Section: Datatype Property Classification Modelmentioning
confidence: 99%
“…We selected our in-house RMLStreamer [45] for this component, which is a streaming implementation of the RMLMapper [46], a tool that executes RML (RDF Mapping Language [47]) mappings. Using RML, we can define a mapping from various common input formats, including JSON, XML, or CSV, to a semantic format.…”
Section: Semantic Conversion: Rmlmentioning
confidence: 99%