2024
DOI: 10.1038/s41597-024-03299-9
|View full text |Cite
|
Sign up to set email alerts
|

Cryo2StructData: A Large Labeled Cryo-EM Density Map Dataset for AI-based Modeling of Protein Structures

Nabin Giri,
Liguo Wang,
Jianlin Cheng

Abstract: The advent of single-particle cryo-electron microscopy (cryo-EM) has brought forth a new era of structural biology, enabling the routine determination of large biological molecules and their complexes at atomic resolution. The high-resolution structures of biological macromolecules and their complexes significantly expedite biomedical research and drug discovery. However, automatically and accurately building atomic models from high-resolution cryo-EM density maps is still time-consuming and challenging when t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 42 publications
0
2
0
Order By: Relevance
“…The models were trained as a sequence-to-sequence predictor, utilizing a Transformer-Encoder 17 to capture long-range voxel-voxel dependencies and a skip-connected decoder to combine the extracted features at different encoder layers to classify each voxel. The models were trained using the large Cryo2StructData dataset 21 . Cryo2StructData is a comprehensive labeled dataset of cryo-EM density maps curated specifically for deep learning-based atomic structure modeling in cryo-EM density maps.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The models were trained as a sequence-to-sequence predictor, utilizing a Transformer-Encoder 17 to capture long-range voxel-voxel dependencies and a skip-connected decoder to combine the extracted features at different encoder layers to classify each voxel. The models were trained using the large Cryo2StructData dataset 21 . Cryo2StructData is a comprehensive labeled dataset of cryo-EM density maps curated specifically for deep learning-based atomic structure modeling in cryo-EM density maps.…”
Section: Resultsmentioning
confidence: 99%
“…The dataset used to train and validate Cryo2Struct (Cryo2StructData) is available on the Harvard Dataverse 24 , and the description of the data preparation and labeling process can be found in 21 . The detailed information about the test datasets including the EMD IDs of the density maps and the evaluation scores are provided in two Excel files (Standard_test_data.xlsx for the standard test dataset and Cryo2Struct_test_data.xlsx for the new test dataset) available at 10.7910/DVN/GQCTTD, and the true structures and the structural models built by Cryo2Struct and Phenix for the test density maps are also available at the same website.…”
mentioning
confidence: 99%