Single-cell RNA sequencing (scRNA-seq) provides the expression profiles of individual cells, and it is expected to provide higher cellular differential resolution than traditional bulk RNA sequencing. In scRNA-seq analysis, clustering is crucial for identifying cell types, and can be potentially exploited to understand high-level biological processes. Recently, autoencoder has been successfully applied in scRNAseq clustering problem and achieved promising results. Most existing works focus on characterizing the sparsity of data, and directly utilize the bottleneck feature of the autoencoder for clustering might not be optimal. In this paper, a novel framework named Adversarial AutoEncoder ScRNA-seq Clustering (AAE-SC) is proposed to bring an additional constraint on the bottleneck feature. Specifically, AAE-SC adds an AAE module on top of the bottleneck layer, and constrains the bottleneck feature distribution to be aligned with a consistent distribution. Also, the AAE and the reconstructed modules are jointly optimized to generate a highly discriminative and consistent feature, which is further proceeded for clustering. We find that by using AAE-SC to impose certain constraints on the features of the hidden layer, the performance of clustering can be improved. Experimental results on three real-world datasets demonstrate that the proposed AAE-SC framework outperformed state-of-the-art methods by 2% at least and 5% at most. And AAE-SC shows more robustness than the baseline model for downsampled and unbalanced cluster size datasets. INDEX TERMS Single-cell RNA-seq data, Adversarial autoencoder, Clustering analysis, Unsupervised learning Author et al.: AAE-SC: A scRNA-seq Clustering Framework based on Adversarial Autoencoder Genes Cells Low High Cluster1 Cluster2 Cluster3 Blood Cells Bladder Cells Neurons Unsupervised Clustering Cell Label Identification FIGURE 1. Explanation of scRNA-seq clustering task. Because dropout causes the gene expression level of the original data to be very low, the data is highly sparse, and it brings difficulties to the subsequent clustering. Therefore, special clustering algorithms are required to process this type of data and correctly assign different cell samples to different And identify the cell type. The heat map on the left of the figure is a visual representation of raw scRNA-seq data, and the numbers in the heat map indicate the expression value of each gene in the cell sample. The color bars in the figure indicate the level of gene expression.