2022
DOI: 10.1093/gigascience/giac037
|View full text |Cite
|
Sign up to set email alerts
|

NuCLS: A scalable crowdsourcing approach and dataset for nucleus classification and segmentation in breast cancer

Abstract: Background Deep learning enables accurate high-resolution mapping of cells and tissue structures that can serve as the foundation of interpretable machine-learning models for computational pathology. However, generating adequate labels for these structures is a critical barrier, given the time and effort required from pathologists. Results This article describes a novel collaborative framework for engaging crowds of medical s… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
43
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 60 publications
(44 citation statements)
references
References 57 publications
(69 reference statements)
1
43
0
Order By: Relevance
“…b, Annotated areas for each tissue and c, the number of annotated cells for each cell type in SegPath. Those in publicly available datasets (BCSS 13 , GlaS 15 , CoNIC 5 , MoNuSAC2020 14 , and NuCLS 7 ) are also shown. Organs in brackets represent the target organs of the dataset.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…b, Annotated areas for each tissue and c, the number of annotated cells for each cell type in SegPath. Those in publicly available datasets (BCSS 13 , GlaS 15 , CoNIC 5 , MoNuSAC2020 14 , and NuCLS 7 ) are also shown. Organs in brackets represent the target organs of the dataset.…”
Section: Resultsmentioning
confidence: 99%
“…A mixture of anti-CD3 and anti-CD20 antibodies was used for lymphocytes. Because our workflow can generate segmentation masks in a high-throughput manner without the need for manual annotation, the size of the dataset is over one order of magnitude larger than the currently available segmentation mask datasets for tumour tissues 5,7,[13][14][15] (Fig. 1b, c).…”
Section: Dataset Generation Workflowmentioning
confidence: 99%
See 1 more Smart Citation
“…The publicly available NuCLS dataset 28 consists of many image patches extracted from breast cancer images from The Cancer Genome Atlas (TCGA) dataset 25 . Image patches were split into single-rater and multi-rater datasets.…”
Section: Breast Cancer Datasetmentioning
confidence: 99%
“…The NuCLS dataset 14 proposes a "crowdsourced" dataset where the annotations are made by non-pathologists from algorithmic suggestions, and with corrections by junior and senior pathologists. It also provides a "multi-rater" dataset, where detailed individual annotations from experts and non-experts are provided on selected FOVs.…”
Section: Nucls Dataset and Experimentsmentioning
confidence: 99%