Single-cell RNA-sequencing (scRNA-seq) is a high-resolution transcriptomic approach used to discover gene expression patterns among cell types to study precise biological functions. Unsupervised machine learning (clustering) is of central importance for the analysis of scRNA- seq data. It can identify putative cell types, uncover regulatory relationships, and track cell lineages and trajectories. A key issue in clustering scRNA-seq data is determining which clus- tering method is appropriate to use, since varied methods can yield diverse results. Current approaches usually focus on a one method and manually select a seemingly meaningful result. From a biological relevance perspective, it is vital to distinguish between normal and pathogenic cell types using marker genes. We present a learning framework for comparing outcomes of multiple scRNA-seq clustering methods to determine the most optimal results. We address the challenges of model selection and validation metrics in the context of traumatic brain injury (TBI) applications. We compare clustering performance of five clustering algorithms and two dimensionality reduction techniques implemented in both Seurat and Scanpy packages.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.