Domain-generalized few-shot text classification (DG-FSTC) is a new setting for few-shot text classification (FSTC). In DG-FSTC, the model is meta-trained on a multi-domain dataset, and meta-tested on unseen datasets with different domains. However, previous methods mostly construct semantic representations by learning from words directly, which is limited in domain adaptability. In this study, we enhance the domain adaptability of the model by utilizing the distributional signatures of texts that indicate domain-related features in specific domains. We propose a Multi-level Distributional Signatures based model, namely MultiDS. Firstly, inspired by pretrained language models, we compute distributional signatures from an extra large news corpus, and we denote these as domain-agnostic features. Then we calculate the distributional signatures from texts in the same domain and texts from the same class, respectively. These two kinds of information are regarded as domain-specific and class-specific features, respectively. After that, we fuse and translate these three distributional signatures into word-level attention values, which enables the model to capture informative features as domain changes. In addition, we utilize domain-specific distributional signatures for the calibration of feature representations in specific domains. The calibration vectors produced by the domain-specific distributional signatures and word embeddings help the model adapt to various domains. Extensive experiments are performed on four benchmarks. The results demonstrate that our proposed method beats the state-of-the-art method with an average improvement of 1.41% on four datasets. Compared with five competitive baselines, our method achieves the best average performance. The ablation studies prove the effectiveness of each proposed module.
In recent years, COVID-19 has become the hottest topic. Various issues, such as epidemic transmission routes and preventive measures, have “occupied” several online social media platforms. Many rumors about COVID-19 have also arisen, causing public anxiety and seriously affecting normal social order. Identifying a rumor at its very inception is crucial to reducing the potential harm of its evolution to society as a whole. However, epidemic rumors provide limited signal features in the early stage. In order to identify rumors with data sparsity, we propose a few-shot learning rumor detection model based on capsule networks (CNFRD), utilizing the metric learning framework and the capsule network to detect the rumors posted during unexpected epidemic events. Specifically, we constructively use the capsule network neural layer to summarize the historical rumor data and obtain the generalized class representation based on the historical rumor data samples. Besides, we calculate the distance between the epidemic rumor sample and the historical rumor class-wise representation according to the metric module. Finally, epidemic rumors are discriminated against according to the nearest neighbor principle. The experimental results prove that the proposed method can achieve higher accuracy with fewer epidemic rumor samples. This approach provided 88.92% accuracy on the Chinese rumor dataset and 87.07% accuracy on the English rumor dataset, which improved by 7% to 23% over existing approaches. Therefore, the CNFRD model can identify epidemic rumors in COVID-19 as early as possible and effectively improve the performance of rumor detection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.