Qiming Du scite author profile

Qiming Du

5Publications

6Citation Statements Received

64Citation Statements Given

How they've been cited

How they cite others

Affiliations

PLA Information Engineering University

Publications

Order By: Most citations

A Method of Short Text Representation Fusion with Weighted Word Embeddings and Extended Topic Information

Liu

Pang²,

Du³

et al. 2022

Sensors

View full text Add to dashboard Cite

Short text representation is one of the basic and key tasks of NLP. The traditional method is to simply merge the bag-of-words model and the topic model, which may lead to the problem of ambiguity in semantic information, and leave topic information sparse. We propose an unsupervised text representation method that involves fusing word embeddings and extended topic information. Following this, two fusion strategies of weighted word embeddings and extended topic information are designed: static linear fusion and dynamic fusion. This method can highlight important semantic information, flexibly fuse topic information, and improve the capabilities of short text representation. We use classification and prediction tasks to verify the effectiveness of the method. The testing results show that the method is valid.

show abstract

Integrating KNN and Gradient Boosting Decision Tree for Recommendation

Du¹,

Li²,

Yang³

et al. 2021

View full text Add to dashboard Cite

A differential privacy preserving algorithm for greedy decision tree

Yang

Sun

et al. 2021

View full text Add to dashboard Cite

A Topic Recognition Method of News Text Based on Word Embedding Enhancement

Du¹,

Li²,

Liu³

et al. 2022

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

Topic recognition technology has been commonly applied to identify different categories of news topics from the vast amount of web information, which has a wide application prospect in the field of online public opinion monitoring, news recommendation, and so on. However, it is very challenging to effectively utilize key feature information such as syntax and semantics in the text to improve topic recognition accuracy. Some researchers proposed to combine the topic model with the word embedding model, whose results had shown that this approach could enrich text representation and benefit natural language processing downstream tasks. However, for the topic recognition problem of news texts, there is currently no standard way of combining topic model and word embedding model. Besides, some existing similar approaches were more complex and did not consider the fusion between topic distribution of different granularity and word embedding information. Therefore, this paper proposes a novel text representation method based on word embedding enhancement and further forms a full-process topic recognition framework for news text. In contrast to traditional topic recognition methods, this framework is designed to use the probabilistic topic model LDA, the word embedding models Word2vec and Glove to fully extract and integrate the topic distribution, semantic knowledge, and syntactic relationship of the text, and then use popular classifiers to automatically recognize the topic categories of news based on the obtained text representation vectors. As a result, the proposed framework can take advantage of the relationship between document and topic and the context information, which improves the expressive ability and reduces the dimensionality. Based on the two benchmark datasets of 20NewsGroup and BBC News, the experimental results verify the effectiveness and superiority of the proposed method based on word embedding enhancement for the news topic recognition problem.

show abstract

Research on security assessment based on big data and multi-entity profile

Liu¹,

Pang²,

Yang³

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Qiming Du

A Method of Short Text Representation Fusion with Weighted Word Embeddings and Extended Topic Information

Integrating KNN and Gradient Boosting Decision Tree for Recommendation

A differential privacy preserving algorithm for greedy decision tree

A Topic Recognition Method of News Text Based on Word Embedding Enhancement

Research on security assessment based on big data and multi-entity profile

Contact Info

Product

Resources

About