2022 IEEE 18th International Conference on E-Science (E-Science) 2022
DOI: 10.1109/escience55777.2022.00034
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Versioning of Time Series Datasets: a FAIR Algorithmic Approach

Abstract: MotivationMethodology Datasets Results Conclusions and Future workHow is data provenance defined?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 17 publications
0
3
0
Order By: Relevance
“…We empirically evaluated our approach using time series data in three common scenarios of data versioning, namely i) data (cells) with missing values, ii) data with the row-wise transformation of values (values expressed as percentages) and data with column-wise transformation (values expressed on a logarithmic scale), and iii) sample size reduction by sub-setting rows. Initial results presented in [19] show that our proposed approach successfully detects different versions of a dataset, for up to 60% of cell changes, the deletion of up to 60% of rows, and column-wise transformation. However, there are minimal similarities detected for row-wise transformations.…”
Section: A Dimensionality Reductionmentioning
confidence: 98%
See 2 more Smart Citations
“…We empirically evaluated our approach using time series data in three common scenarios of data versioning, namely i) data (cells) with missing values, ii) data with the row-wise transformation of values (values expressed as percentages) and data with column-wise transformation (values expressed on a logarithmic scale), and iii) sample size reduction by sub-setting rows. Initial results presented in [19] show that our proposed approach successfully detects different versions of a dataset, for up to 60% of cell changes, the deletion of up to 60% of rows, and column-wise transformation. However, there are minimal similarities detected for row-wise transformations.…”
Section: A Dimensionality Reductionmentioning
confidence: 98%
“…Data provenance and lineage play a key role in fostering reusability. In this context, we propose a novel approach for automatic data versioning with the goal to automatically generate FAIR-compliant provenance metadata [19]. We systematically detect and measure changes in datasets by using dimensionality reduction techniques.…”
Section: A Dimensionality Reductionmentioning
confidence: 99%
See 1 more Smart Citation