2024
DOI: 10.1101/2024.04.23.590800
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Genotype Representation Graphs: Enabling Efficient Analysis of Biobank-Scale Data

Drew DeHaas,
Ziqing Pan,
Xinzhu Wei

Abstract: Computational analysis of a large number of genomes requires a data structure that can represent the dataset compactly while also enabling efficient operations on variants and samples. Current practice is to store large-scale genetic polymorphism data using tabular data structures and file formats, where rows and columns represent samples and genetic variants. However, encoding genetic data in such formats has become unsustainable. For example, the UK Biobank polymorphism data of 200,000 phased whole genomes h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 73 publications
0
0
0
Order By: Relevance