2018
DOI: 10.1186/s40537-018-0123-x
|View full text |Cite
|
Sign up to set email alerts
|

SemLinker: automating big data integration for casual users

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 34 publications
0
7
0
Order By: Relevance
“…During our analysis, we mapped who the authors of the papers references when using a definition for data lakes. We found that James Dixon was the first one to use the term lake in big data context, in a post in its blog in 2010 [20], and he is referenced by ten papers [4], [6], [17], [32], [38], [44], [62], [63], [67], [91]. The first author to reference Dixon's Concept in academic context was O'Leary [63], in a paper published in 2014.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…During our analysis, we mapped who the authors of the papers references when using a definition for data lakes. We found that James Dixon was the first one to use the term lake in big data context, in a post in its blog in 2010 [20], and he is referenced by ten papers [4], [6], [17], [32], [38], [44], [62], [63], [67], [91]. The first author to reference Dixon's Concept in academic context was O'Leary [63], in a paper published in 2014.…”
Section: Resultsmentioning
confidence: 99%
“…Initial Accepted Scopus 108 53 papers: [1]- [3], [5], [9], [10], [13]- [19], [23]- [29], [31]- [33], [37], [40], [45], [49], [50], [57], [60]- [66], [68], [70], [71], [73], [76]- [78], [81]- [84], [88], [90], [91], [93]- [95] Springer 222 20 papers: [4], [6], [12], [21], [30], [36], [38], [39], [41]- [43], [47], [51], [53], [69], [74], [79], [85], [86], [92] Google Scholar 197 6 papers:...…”
Section: Sourcementioning
confidence: 99%
“…Although Data Civilizer has a similar scope and objectives to VADA, typically users have a greater involvement with the individual data preparation steps, for example though mapping [39] or workflow [40] construction, so the emphasis is more on supporting developers in creating ETL flows than on the more fully automated approach being explored here. Building instead on semantic web technologies, SemLinker [41] extracts a graph of source data features, which are then aligned with a global ontology. Here the emphasis is on providing a consistent route into the data sets in a personal data lake, using plugins where necessary to provide more specialised processing for particular domains or data types.…”
Section: Discussionmentioning
confidence: 99%
“…In this case, the accuracy of the algorithm is of concern [22,23]. [25,26]). In general, researchers are aware of the difficulty of detecting duplicates within incomplete data sets [12,18].…”
Section: Related Workmentioning
confidence: 99%