Data generated by genome-wide association studies (GWAS) are growing fast with the linkage of biobank samples to health records, and expanding capture of high-dimensional molecular phenotypes. However the utility of these efforts can only be fully realised if their complete results are collected from their heterogeneous sources and formats, harmonised and made programmatically accessible. Here we present the OpenGWAS database, an open source, open access, scalable and high-performance cloud-based data infrastructure that imports and publishes complete GWAS summary datasets and metadata for the scientific community. Our import pipeline harmonises these datasets against dbSNP and the human genome reference sequence, generates summary reports and standardises the format of results and metadata. Users can access the data via a website, an application programming interface, R and Python packages, and also as downloadable files that can be rapidly queried in high performance computing environments. OpenGWAS currently contains 126 billion genetic associations from 14,582 complete GWAS datasets representing a range of different human phenotypes and disease outcomes across different populations. We developed R and Python packages to serve as conduits between these GWAS data sources and a range of available analytical tools, enabling Mendelian randomization, genetic colocalisation analysis, fine mapping, genetic correlation and locus visualisation. OpenGWAS is freely accessible at https://gwas.mrcieu.ac.uk, and has been designed to facilitate integration with third party analytical tools.
<p>We can only fully understand the past, present and future climate changes by bringing together data and process understanding from a broad range of environmental sciences. In theory, climate modelling provides a wealth of data of great interest to a wide variety of disciplines (e.g., chemistry, geology, hydrology), but in practice, the large volume and complexity of these datasets often prevent direct access and therefore limit their benefits for large parts of our community.</p><p>We present the new online platform &#8220;climatearchive.org&#8221; to break down these barriers and provide intuitive and informative access to paleoclimate model data to our community. The current release enables interactive access to a recently published compilation of 109 HadCM3BL climate model simulations. Key climate variables (temperature, precipitation, vegetation and circulation) are displayed on a virtual globe in an intuitive three-dimensional environment and on a continuous time axis throughout the Phanerozoic. The software runs in any web browser &#8212; including smartphones &#8212; and promotes data exploration, appeals to students and generates public interest.</p><p>We also show current work on the next phase of the platform, which aims to develop new tools for integration into a more quantitative research workflow. These include easy online generation and download of maps and time series plots of the underlying monthly model data. The data can also be exported as global fields or CSV files for any user-selected location for further offline analysis, such as use in spreadsheets. Finally, we will discuss and outline future integration of new sources of model and geochemical proxy data to simplify and advance interdisciplinary paleoclimate research.</p>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.