The dense network of interconnected cellular signalling responses that are quantifiable in peripheral immune cells provides a wealth of actionable immunological insights. Although high-throughput single-cell profiling techniques, including polychromatic flow and mass cytometry, have matured to a point that enables detailed immune profiling of patients in numerous clinical settings, the limited cohort size and high dimensionality of data increase the possibility of false-positive discoveries and model overfitting. We introduce a generalizable machine learning platform, the immunological Elastic-Net (iEN), which incorporates immunological knowledge directly into the predictive models. Importantly, the algorithm maintains the exploratory nature of the high-dimensional dataset, allowing for the inclusion of immune features with strong predictive capabilities even if not consistent with prior knowledge. In three independent studies our method demonstrates improved predictions for clinically relevant outcomes from mass cytometry data generated from whole blood, as well as a large simulated dataset. The iEN is available under an open-source licence.
This diagnostic/prognostic study describes the use of cell-free transcriptomics, urine metabolomics, and plasma proteomics for identifying the biological measurements associated with preterm birth.
High-throughput single-cell analysis technologies produce an abundance of data that is critical for profiling the heterogeneity of cellular systems. We introduce VoPo (https://github. com/stanleyn/VoPo), a machine learning algorithm for predictive modeling and comprehensive visualization of the heterogeneity captured in large single-cell datasets. In three mass cytometry datasets, with the largest measuring hundreds of millions of cells over hundreds of samples, VoPo defines phenotypically and functionally homogeneous cell populations. VoPo further outperforms state-of-the-art machine learning algorithms in classification tasks, and identified immune-correlates of clinically-relevant parameters.
A multitude of clinical, biological, environmental, and demographic factors influence the trajectory of a pregnancy. Maternal genetics, environment, stress, nutrition, medical history, socioeconomic status, and racial and ethnic background all play a role in determining the success of a pregnancy. Diverse data sources are available for the study of pregnancy and prediction of adverse outcomes, including electronic health records (EHRs) and administrative claims data, high-throughput multiomics data for characterizing biological systems, and more complex sources like time series, imaging and video data, and text. Recent advances in multiview, multitask, and deep learning allow joint modeling across data sources as well as across outcomes and demonstrate the vast potential of such integrated approaches.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.