Semantic typing of linked geoprocessing workflows

Scheider, Simon; Ballatore, Andrea

doi:10.1080/17538947.2017.1305457

Cited by 25 publications

(32 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Fostering reproducibility across contexts would require adapting both data inputs and tools. A prerequisite for doing so is that nodes and links in a workflow are semantically typed, so that tools and data of equivalent, similar or even different type can be substituted [40]. In analogous fashion, ontology design patterns were proposed as a cure to limited re-usability of modeling solutions, as they can be adapted to particular applications by filling in open slots [90].…”

Section: A Research Road Mapmentioning

confidence: 99%

“…This requires abstracting from particular tools and data sets and describing data analysis on a conceptual level [40], which would allow partial adaption of workflows across computing environments. It would also allow analysts to focus on the questions they want to answer and the methods to answer them, instead of the software and data formats needed for computation [41,42].…”

Section: The Curse Of Modularity: Arguments From Reusability Of Analysismentioning

confidence: 99%

See 1 more Smart Citation

Why good data analysts need to be critical synthesists. Determining the role of semantics in data analysis

Scheider¹,

Ostermann²,

Adams

2017

Future Generation Computer Systems

Self Cite

View full text Add to dashboard Cite

In this article, we critically examine the role of semantic technology in data driven analysis. We explain why learning from data is more than just analyzing data, including also a number of essential synthetic parts that suggest a revision of George Box's model of data analysis in statistics. We review arguments from statistical learning under uncertainty, workflow reproducibility, as well as from philosophy of science, and propose an alternative, synthetic learning model that takes into account semantic conflicts, observation, biased model and data selection, as well as interpretation into background knowledge. The model highlights and clarifies the different roles that semantic technology may have in fostering reproduction and reuse of data analysis across communities of practice under the conditions of informational uncertainty. We also investigate the role of semantic technology in current analysis and workflow tools, compare it with the requirements of our model, and conclude with a roadmap of 8 challenging research problems which currently seem largely unaddressed.

show abstract

Section: A Research Road Mapmentioning

confidence: 99%

Section: The Curse Of Modularity: Arguments From Reusability Of Analysismentioning

confidence: 99%

Why good data analysts need to be critical synthesists. Determining the role of semantics in data analysis

Scheider¹,

Ostermann²,

Adams

2017

Future Generation Computer Systems

Self Cite

View full text Add to dashboard Cite

show abstract

“…The task of assembling geoprocessing workflows is central to any GIS. Sharing and integrating models over the web can help organizations to save labor and computational resources by reusing methods and data (Scheider & Ballatore, 2018), thus promoting modeling research (Nativi et al, 2013).…”

Section: Introductionmentioning

confidence: 99%

A provenance metadata model integrating ISO geospatial lineage and the OGC WPS: Conceptual model and implementation

Closa

Masó

Zabala

et al. 2019

Transactions in GIS

View full text Add to dashboard Cite

Nowadays, there are still some gaps in the description of provenance metadata. These gaps prevent the capture of comprehensive provenance, useful for reuse and reproducibility. In addition, the lack of automated tools for capturing provenance hinders the broad generation and compilation of provenance information. This work presents a provenance engine (PE) that captures and represents provenance information using a combination of the Web Processing Service (WPS) standard and the ISO 19115 geospatial lineage model. The PE, developed within the MiraMon GIS & RS software, automatically records detailed information about sources and processes. The PE also includes a metadata editor that shows a graphical representation of the provenance and allows users to complement provenance information by adding missing processes or deleting redundant process steps or sources, thus building a consistent geospatial workflow. One use case is presented to demonstrate the usefulness and effectiveness of the PE: the generation of a radiometric pseudo‐invariant areas bench for the Iberian Peninsula. This remote‐sensing use case shows how provenance can be automatically captured, also in a non‐sequential complex flow, and its essential role in the automation and replication tasks in work with very large amounts of geospatial data.

show abstract

“…One of the application examples is the extraction of potential glacier loss and glacier gain from land cover data. Scheider and Ballatore (2018) contribute a method and a tool to semantically annotate geoprocessing workflows for successful search, interpretation and reuse of workflows. In their paper 'Semantic typing of linked geoprocessing workflows', they propose vocabularies for basic geodata data types, geoprocessing operations and relations which are used to describe the geoprocessing provenance of a data set.…”

Section: Innovation In Geoprocessing For a Digital Earthmentioning

confidence: 99%

“…Several papers show the progress in a better formalisation, management and usage of geoprocessing semantics, including semantics of the processing input, output, operations, workflows and provenance (Scheider and Ballatore 2018;Stasch et al 2018;Sudmanns et al 2018;Wiemann, Karrasch, and Bernard 2018). Especially, these papers demonstrate that and how semantics of data and operations can be exploited to support users to more efficiently generate information and to increase reusability of existing workflows.…”

mentioning

confidence: 99%