2020
DOI: 10.3390/ijgi9020119
|View full text |Cite
|
Sign up to set email alerts
|

Geoweaver: Advanced Cyberinfrastructure for Managing Hybrid Geoscientific AI Workflows

Abstract: AI (artificial intelligence)-based analysis of geospatial data has gained a lot of attention. Geospatial datasets are multi-dimensional; have spatiotemporal context; exist in disparate formats; and require sophisticated AI workflows that include not only the AI algorithm training and testing, but also data preprocessing and result post-processing. This complexity poses a huge challenge when it comes to full-stack AI workflow management, as researchers often use an assortment of time-intensive manual operations… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

3
6

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 88 publications
(94 reference statements)
0
10
0
Order By: Relevance
“…An advanced workflow management system is used to automate the tedious large dataset processing, to enhance time efficiency and reduce scientific reproducibility issues [13], [16], [24]. We implemented the DNN crop mapping workflow in Geoweaver-an open-source web-based workflow system [25]- [27], to help us integrate distributed resources, automate the pre/postprocessing workflows and track the provenance of the final crop maps [25]. Training DNN models is a back-and-forth process and requires a lot of skills and expertise.…”
Section: Introductionmentioning
confidence: 99%
“…An advanced workflow management system is used to automate the tedious large dataset processing, to enhance time efficiency and reduce scientific reproducibility issues [13], [16], [24]. We implemented the DNN crop mapping workflow in Geoweaver-an open-source web-based workflow system [25]- [27], to help us integrate distributed resources, automate the pre/postprocessing workflows and track the provenance of the final crop maps [25]. Training DNN models is a back-and-forth process and requires a lot of skills and expertise.…”
Section: Introductionmentioning
confidence: 99%
“…First, GIScience should encourage more practical demonstrations of multiuser provenance interchange that accelerate R&R (N€ ust and Pebesma 2020; Wilson et al 2020), regardless of whether the software environments are open source or commercial, workstation or cloud based, or code versus graphic block programming (Tullis et al 2015;Tullis et al 2019). Much work remains in how to compress, store, and otherwise manage provenance records throughout their life cycle (Sun et al 2020), issues that are continually emerging in discussions on GIScience workflows (OGC 2020). Even though the W3C created the powerful PROV data model (PROV-DM; Moreau and Missier 2013) almost a decade ago, GIScience applications of PROV have been very rare (Tullis et al 2019).…”
Section: Current Practice and Recommendations For Innovation Provenancementioning
confidence: 99%
“…Even though the W3C created the powerful PROV data model (PROV-DM; Moreau and Missier 2013) almost a decade ago, GIScience applications of PROV have been very rare (Tullis et al 2019). Successful, practical demonstrations of artificial intelligence and hybrid GIScience workflow management (e.g., Sun et al 2020) challenge software engineering teams to accelerate support for provenance interchange and workflow standards and specifications from authoritative sources such as W3C and OGC.…”
Section: Current Practice and Recommendations For Innovation Provenancementioning
confidence: 99%
“…A large portion of the costs in reusing existing services is charged by this step. The common processes to unify the datasets include reprojection, resampling, regridding, reformatting, mosaic, merging, getting location reports, etc [48,49]. Most web services do all the preprocessing work.…”
Section: Data Extraction and Fusion Submodulementioning
confidence: 99%