The text similarity detection algorithm has a wide range of applications in the processing of massive natural language text information. Unlike simple and complete repetitive search, the complexity of natural language has caused great difficulties in the calculation of text semantic similarity. The Simhash algorithm does not involve the semantic information of the text, and cannot support the semantic problems of natural language processing such as synonyms and polysemes. Therefore, using the "dimensionality reduction" advantage of animal algorithms in English text processing and the efficiency of the retrieval process, aiming at its inability to recognize semantically similar text content, this paper studies the English text similarity detection algorithm of animal algorithms. Aiming at the shortcomings of simhash in the semantic similarity of text, this paper proposes a semantic code design based on the synonym word forest and context through the study of existing synonym expansion schemes. Based on the comprehensive improvement scheme, a semantic fingerprint generation algorithm incorporating synonym information is proposed, which solves the problem of similar texts that cannot identify replacement synonyms. The experiments in this paper show that after testing the sample data of the algorithm in this paper, under the condition of using k = 3 parameter determination, the accuracy and recall of correct identification are over 77%. In contrast, the two indexes of the traditional simhash algorithm and the word frequency statistical algorithm are only about 70%. This proves that the improved algorithm proposed in this paper has achieved relatively good results for identifying multiple similar modification situations, especially the problem of synonym replacement.