The network public opinion plays an irreplaceable role in today's society. The analysis and processing of data mining can effectively find out the sensitive network public opinion and prevent the intensification of social public opinion. In this paper, we propose an association degree algorithm between personal speech and user real name information in Chinese community. By defining the user's name, address, ID number, phone number, QQ number, e-mail, MSN and other information as keywords and setting different weights. The hot public opinion phrases in user comments are extracted, the word frequency is counted, the text semantic similarity classification model is constructed, the semantic tree of unknown text is automatically constructed, the semantic relevance based on the topic is calculated, and the relevance between the public opinion topic and the specified user information is obtained. The algorithm in this paper uses Chinese segmentation index technology and common algorithms of text de duplication technology. From the technical feasibility, economic feasibility and other aspects of the feasibility analysis; from the system response speed, scalability and security three main levels of the system function requirements analysis. The results show that the system design meets the requirements and improves the accuracy of online public opinion text collection and emotional analysis.