<p>We face unprecedented environmental challenges as a species, that threaten our existing way of life.&#160; We are still learning to understand our planet, although we have a good idea how it works.&#160; The speed of research needs to accelerate to provide information to decision makers, to better respond to our societal challenges.&#160; To do this we need to move towards leveraging large datasets to speed up research, as proposed by Jim Grey in &#8216;The Fourth Paradigm&#8217;. In the world of research infrastructures we need to provide a means for scientists to access vast amounts of research data from multiple data sources in an easy and efficient way.&#160; EOSC is addressing this but we are only scratching the surface when it comes to unleashing the full potential of the scientific community.&#160; Datacubes have recently emerged as a technology in the Environmental and Earth system domain to store imagery data in a way that makes it easier and quicker for scientists to perform their research.&#160; But with the scales of data volumes that are being considered, there are many challenges to curating, hosting, and funding this information in a centralised centre.&#160; Our proposal seeks to leverage the existing National Research and Education (NRENs) infrastructures to store national repositories of regional Environmental and Earth system domain data, for this to be shared with scientists in an open, federated but secure way, conforming to FAIR principles.&#160; This would provide levels of redundancy, data sovereignty and scalability for hosting global environmental datasets in an exascale world.</p>
<div class="page" title="Page 1"> <div class="layoutArea"> <div class="column"> <p>VESPA (Virtual European Solar and Planetary Access, Erard et al. EPSC2020-190, 2020) is a network of interoperable data services covering all fields of Solar System Sciences. It is a mature project, developed within EUROPLANET-FP7 and EUROPLANET-2020-RI. The latter ended in Aug. 2019. It is further supported under the EUROPLANET-2024-RI project (started in Feb. 2020).</p> <p>The VESPA data providers are using a standard API (based on the Table Access Protocol of IVOA (International Virtual Observatory Alliance) and EPNcore, a common dictionary of metadata developed by the VESPA team). The VESPA services consist in searchable metadata tables, with links (URLs) to science data products (files, web-services...). The VESPA metadata includes relevant keywords for scientific data discovery, such as data coverage (temporal, spectral, spatial...), data content (physical parameters, processing level...), data origin (observatory, instrument, publisher...) or data access (format, URL, size...). VESPA hence provides a unified data discovery service for Solar System Sciences.</p> <p>The architecture of the VESPA network is distributed (the metadata tables are hosted and maintained by the VESPA providers), but it is not redundant. The hosting and maintenance of VESPA provider's servers has proved to be a single point failure for small teams with little IT support. The VESPA-Cloud project with EOSC-Hub will greatly facilitate the sustainability of data sharing from small teams as well as teams, whose institutions have restrictive firewall policies (like labs hosted by space agencies, e.g., DLR in Germany). Most of the VESPA data provider are using the same server software, namely DaCHS (Data Centre Helper Suite), developed by the Heidelberg team included in the project.</p> <p>VESPA-Cloud provides a cloud-hosted facility to host VESPA compliant metadata tables in a controlled and maintained software environment. The VESPA providers will focus on the&#160;science application configuration, whereas the VESPA core team will support them with the maintenance of the deployed instances. The development of the VESPA provider&#8217;s data service will be done using a git versioning system (github or institute gitlab).<br />An instance of the VESPA query interface portal will also be implemented on EOSC-hub provided virtual machine.</p> <p>The community AAI (Authorization and Authentication Infrastructure) is provided by G&#201;ANT, through its eduTEAMS service. In the context of EOSC-hub, the EGI Federation is providing virtual machine services from IN2P3 and CESNET while data storage and registry services will be provided by EUDAT. &#160; &#160;</p> <p>In the course of the VESPA-Cloud project, we will implement in the DaCHS framework cloud-storage API connectors (such as Amazon S3, iRODS, etc.) to read data in the cloud and ingesting metadata. Since DacHS is used worldwide by many datacenters to share astronomical and solar system data collections, many teams will benefit from this development.&#160;</p> <p><em>The Europlanet-2024 Research Infrastructure project&#160;has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 871149.&#160;</em><em>This work used the EGI Infrastructure with the dedicated support of IN2P3-IRES and CESNET-MCC. The eduTEAMS Service is made possible via funding from the European Union's Horizon 2020 research and innovation programme under Grant Agreement No. 856726 (GN4-3).</em></p> </div> </div> </div>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.