2019
DOI: 10.1093/gigascience/giz054
|View full text |Cite
|
Sign up to set email alerts
|

Software engineering for scientific big data analysis

Abstract: The increasing complexity of data and analysis methods has created an environment where scientists, who may not have formal training, are finding themselves playing the impromptu role of software engineer. While several resources are available for introducing scientists to the basics of programming, researchers have been left with little guidance on approaches needed to advance to the next level for the development of robust, large-scale data analysis tools that are amenable to integration into workflow manage… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 28 publications
(25 citation statements)
references
References 24 publications
0
25
0
Order By: Relevance
“…Ultimately, we wrapped each of these components in a command line interface (CLI) such that the results presented in each section of this work can be generated with a corresponding command following the guidelines described by Grüning et al (2019). The scripts for generating the figures in this manuscript are not included in the main pathway_forte, but rather in their own repository within Jupyter notebooks at .…”
Section: Methodsmentioning
confidence: 99%
“…Ultimately, we wrapped each of these components in a command line interface (CLI) such that the results presented in each section of this work can be generated with a corresponding command following the guidelines described by Grüning et al (2019). The scripts for generating the figures in this manuscript are not included in the main pathway_forte, but rather in their own repository within Jupyter notebooks at .…”
Section: Methodsmentioning
confidence: 99%
“…palletsprojects.com). Drug2ways allows users to use the algorithm on a variety of standard network formats (e.g., GraphML, Node-Link, and EdgeList) and is powered by a CLI, following the standard proposed by [38]. The CLI offers all the case scenarios for proposing drug candidates that are presented in the results section.…”
Section: Software and Implementationmentioning
confidence: 99%
“…The package leverages state-of-the-art Python packages such as NetworkX for network analysis (Hagberg et al ., 2008), MPI for parallelization ( https://mpi4py.readthedocs.io/ ), and click for exposing the command line interface (CLI) ( https://click.palletsprojects.com ). Drug2ways allows users to use the algorithm on a variety of standard network formats (e.g., GraphML, Node-Link, and EdgeList) and is powered by a CLI, following the standard proposed by Grüning et al . (2019).…”
Section: Software and Implementationmentioning
confidence: 99%