2018
DOI: 10.1101/414904
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Bayesian Trees for Automated Cytometry Data Analysis

Abstract: Cytometry is an important single cell analysis technology in furthering our understanding of cellular biological processes and in supporting clinical diagnoses across a variety hematological and immunological conditions. Current data analysis workflows for cytometry data rely on a manual process called gating to classify cells into canonical types. This dependence on human annotation significantly limits the rate, reproducibility, and scope of cytometry's use in both biological research and clinical practice. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…According to (35) and (36), the k-means algorithm and GMM-based model (SWIFT) are often not the optimal choice for clustering CyTOF data; 2. There are already plenty of well-established cell identification algorithms and automatic gating techniques such as FlowSOM (10), Bayesian Tree (12), ACDC (37), SCINA (38), etc, all of which have similar goals but slightly different design considerations. For example, using FlowSOM requires minimal prior knowledge on the cell population other than some crude idea on the number of cell types.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…According to (35) and (36), the k-means algorithm and GMM-based model (SWIFT) are often not the optimal choice for clustering CyTOF data; 2. There are already plenty of well-established cell identification algorithms and automatic gating techniques such as FlowSOM (10), Bayesian Tree (12), ACDC (37), SCINA (38), etc, all of which have similar goals but slightly different design considerations. For example, using FlowSOM requires minimal prior knowledge on the cell population other than some crude idea on the number of cell types.…”
Section: Resultsmentioning
confidence: 99%
“…As a result, it requires not only the expression matrix of one particular CyTOF dataset to mimic but also the cell type label associated with each cell as inputs. Although Cytomulate can potentially perform cell type identification or clustering using common algorithms of the likes of K-means, we leave the choice of such algorithm for cell typing to our users as a design choice for the following two reasons: (i) According to (47) and (48), the k-means algorithm and GMM-based model (SWIFT) are often not the optimal choice for clustering CyTOF data; (ii) There are already plenty of well-established cell identification algorithms and automatic gating techniques such as FlowSOM (11), Bayesian Tree (13), ACDC (49), SCINA (50), etc., all of which have similar goals but slightly different design considerations. For example, using FlowSOM requires minimal prior knowledge on the cell population other than some crude idea on the number of cell types.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, it may be possible to reduce this complexity in practice, for example by leveraging prior knowledge available about markers and cell‐types to provide hints to the search algorithm, or by using an initial manual gating as a starting point and then having the algorithm make local changes to the structure to try to improve it in terms of its predictive accuracy. This initialization could also be done in a semisupervised manner based on given phenotype of cell populations of interest, without requiring prior knowledge for the locations of the gates (e.g., using a Bayesian tree method (Ji et al )).…”
Section: Discussionmentioning
confidence: 99%
“…Some of these tools use computer learning algorithms such as linear discriminant analysis (LDA) (Abdelaal et al, 2018) or neural networks to apply the patterns extracted from the training sets to annotate cells from a new dataset. 2The semi-supervised clustering algorithms incorporate user provided marker matrix of known marker associations with particular cell types (Lee et al, 2017;Ji et al, 2018) to guide cellular clustering and identification. These marker matrices are composed of marker expression patterns in various cell types that serve as a cluster dictionaries indicating whether the markers are negative, positive or ignorable for each cell types.…”
Section: Supervised or Semi-supervised Clustering Toolsmentioning
confidence: 99%