WikiPathways (https://www.wikipathways.org) is a biological pathway database known for its collaborative nature and open science approaches. With the core idea of the scientific community developing and curating biological knowledge in pathway models, WikiPathways lowers all barriers for accessing and using its content. Increasingly more content creators, initiatives, projects and tools have started using WikiPathways. Central in this growth and increased use of WikiPathways are the various communities that focus on particular subsets of molecular pathways such as for rare diseases and lipid metabolism. Knowledge from published pathway figures helps prioritize pathway development, using optical character and named entity recognition. We show the growth of WikiPathways over the last three years, highlight the new communities and collaborations of pathway authors and curators, and describe various technologies to connect to external resources and initiatives. The road toward a sustainable, community-driven pathway database goes through integration with other resources such as Wikidata and allowing more use, curation and redistribution of WikiPathways content.
Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.
BackgroundWhereas several engineered nanomaterials are able to incite toxicological effects, the underlying molecular processes are understudied. And the varied physicochemical properties complicate toxicological predictions. Gene expression data allow us to study the cell-specific responses of individual genes, whereas their role in biological processes is harder to interpret. An overrepresentation analysis allows us to identify enriched biological processes and link the experimental data to these, but still prompt broad results which complicates the analysis of detailed toxicological processes. We demonstrated a targeted filtering approach to compare the cell-specific effects of two concentrations of the widely used nanomaterial titanium dioxide (TiO2) -nanobelts.MethodsWe compared public gene expression data generated by Tilton et al. from colon endothelium cells (Caco2), lung endothelium cells (SAE), and monocytic like cells (THP1) after 24-hour exposure to low (10 μg/ml) and high (100 μg/ml) concentrations of TiO2 -nanobelts. We used pathway enrichment analysis of the WikiPathways collection to identify cell and concentration-specific affected pathways. Gene sets from selected Gene Ontology terms (apoptosis, inflammation, DNA damage response and oxidative stress) highlighted pathways with a clear toxicity focus. Finally, pathway-gene networks were created to show the genetic overlap between the altered toxicity-related pathways.ResultsAll cell lines showed more differentially expressed genes after exposure to higher concentration, but our analysis found clear differences in affected molecular processes between the cell lines. Approximately half of the affected pathways are categorized with one of the selected toxicity-related processes. Caco2 cells show resilience to low and high concentrations. SAE cells display some cytotoxic response to the high concentration, while THP1 cells are already strongly affected at a low concentration. The networks show for up- and downregulation for the THP1 cells the most pathways. Additionally, the networks show gene overlap between almost all pathways for all conditions.ConclusionsThe approach allowed us to focus the analysis on affected cytotoxic processes and highlight cell-specific effects. The results showed that Caco2 cells are more resilient to TiO2 -nanobelts exposure compared to SAE cells, while THP1 cells were affected the most. The automated workflow can be easily adapted using other Gene Ontology terms focusing on other biological processes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.