2023
DOI: 10.1101/2023.02.21.529443
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CryoPPP: A Large Expert-Labelled Cryo-EM Image Dataset for Machine Learning Protein Particle Picking

Abstract: Cryo-electron microscopy (cryo-EM) is currently the most powerful technique for determining the structures of large protein complexes and assemblies. Picking single-protein particles from cryo-EM micrographs (images) is a key step in reconstructing protein structures. However, the widely used template-based particle picking process is labor-intensive and time-consuming. Though the emerging machine learning-based particle picking can potentially automate the process, its development is severely hindered by lack… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
13
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

4
2

Authors

Journals

citations
Cited by 7 publications
(13 citation statements)
references
References 59 publications
0
13
0
Order By: Relevance
“…After CryoSegNet was trained and validated on the training/validation, we blindly benchmarked it on a test dataset consisting of thousands of labeled cryo-EM micrographs of 7 different protein types from the CryoPPP 4 dataset. The particles picked by CryoSegNet were compared with the ground truth coordinates of the expert-labeled particles.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…After CryoSegNet was trained and validated on the training/validation, we blindly benchmarked it on a test dataset consisting of thousands of labeled cryo-EM micrographs of 7 different protein types from the CryoPPP 4 dataset. The particles picked by CryoSegNet were compared with the ground truth coordinates of the expert-labeled particles.…”
Section: Resultsmentioning
confidence: 99%
“…Thus, it is imperative to determine the protein structure for understanding protein function and interaction, studying their roles in the diseases, and accelerating the design of drugs. X-ray crystallography, nuclear magnetic resonance (NMR), and cryo-EM 4,5 are three main experimental techniques to determine protein structures. Among them, cryo-EM is the cutting-edge technique for solving the structure of large protein complexes.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Predicting rare GO terms is analogous to the few-shot learning problems [6] in various domains like computer vision[7, 8, 9], and natural language processing(NLP). For example, in the classification task of named entity typing[10, 11] in NLP, assigning rare entity types to entity names pose a similar challenge, due to the increasing size and granularity of entity types.…”
Section: Introductionmentioning
confidence: 99%