2021
DOI: 10.1093/gigascience/giab058
|View full text |Cite
|
Sign up to set email alerts
|

Scalable analysis of multi-modal biomedical data

Abstract: Background Targeted diagnosis and treatment options are dependent on insights drawn from multi-modal analysis of large-scale biomedical datasets. Advances in genomics sequencing, image processing, and medical data management have supported data collection and management within medical institutions. These efforts have produced large-scale datasets and have enabled integrative analyses that provide a more thorough look of the impact of a disease on the underlying system. The integration of larg… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 45 publications
0
2
0
Order By: Relevance
“…We designed the relational model to represent alignments and pileup function results as proposed in Sun et al ., 2018 and Smith et al ., 2021. Our package provides both SQL (Structured Query Language) and Dataframe programming interfaces for the Scala and Python (https://github.com/biodatageeks/pysequila) languages.…”
Section: Methodsmentioning
confidence: 96%
See 1 more Smart Citation
“…We designed the relational model to represent alignments and pileup function results as proposed in Sun et al ., 2018 and Smith et al ., 2021. Our package provides both SQL (Structured Query Language) and Dataframe programming interfaces for the Scala and Python (https://github.com/biodatageeks/pysequila) languages.…”
Section: Methodsmentioning
confidence: 96%
“…We used its three extension points: (i) SQL Analyzer -to register new table-valued functions, (ii) Planner -to add our optimized execution strategies for pileup calculations, and (iii) Logical Optimizer -to detect CreateDataSourceTableAsSelectCommand and InsertIntoHadoopFsRelationCommand actions and apply optimizations for direct vectorized writes into the Optimized Row Columnar (ORC) files (Figure 2). We designed the relational model to represent alignments and pileup function results as proposed in Sun et al, 2018 andSmith et al, 2021. Our package provides both SQL (Structured Query Language) and Dataframe programming interfaces for the Scala and Python (https://github. com/biodatageeks/pysequila) languages.…”
Section: Technical Designmentioning
confidence: 99%