Deep learning has shown impressive diagnostic abilities in Alzheimer’s disease (AD) research in recent years. However, although neuropsychological tests play a crucial role in screening AD and mild cognitive impairment (MCI), there is still a lack of deep learning algorithms only using such basic diagnostic methods. This paper proposes a novel semi-supervised method using neuropsychological test scores and scarce labeled data, which introduces difference regularization and consistency regularization with pseudo-labeling. A total of 188 AD, 402 MCI, and 229 normal controls (NC) were enrolled in the study from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. We first chose the 15 features most associated with the diagnostic outcome by feature selection among the seven neuropsychological tests. Next, we proposed a dual semi-supervised learning (DSSL) framework that uses two encoders to learn two different feature vectors. The diagnosed 60 and 120 subjects were randomly selected as training labels for the model. The experimental results show that DSSL achieves the best accuracy and stability in classifying AD, MCI, and NC (85.47% accuracy for 60 labels and 88.40% accuracy for 120 labels) compared to other semi-supervised methods. DSSL is an excellent semi-supervised method to provide clinical insight for physicians to diagnose AD and MCI.
Analyzing highly individual-specific genomic data to understand genetic interactions in cancer development is still challenging, with significant implications for the discovery of individual biomarkers as well as personalized medicine. With the rapid development of deep learning, graph neural networks (GNNs) have been employed to analyze a wide range of biomolecular networks. However, many neural networks are limited to black box models, which are only capable of making predictions, and they are often challenged to provide reliable biological and clinical insights. In this research, for sample-specific networks, a novel end-to-end hierarchical graph neural network with interpretable modules is proposed, which learns structural features at multiple scales and incorporates a soft mask layer in extracting subgraphs that contribute to classification. The perturbations caused by the input graphs' deductions are used to evaluate key gene clusters, and the samples are then grouped into classes to produce both sample- and stage-level explanations. Experiments on four gene expression datasets from The Cancer Genome Atlas (TCGA) show that the proposed model not only rivals the advanced GNN methods in cancer staging but also identifies key gene clusters that have a great impact on classification confidence, providing potential targets for personalized medicine.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.