2023
DOI: 10.1504/ijris.2023.128371
|View full text |Cite
|
Sign up to set email alerts
|

Entity extraction based on the combination of information entropy and TF-IDF

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 0 publications
0
7
0
Order By: Relevance
“…Applying TF-IDF and WordVec to the construction of knowledge graphs can significantly enhance the information retrieval capabilities and semantic parsing efficiency of knowledge graphs [69,70]. Using TF-IDF effectively identifies keywords or phrases that often carry crucial information linking different entities and attributes [71]. For example, when building a knowledge graph of legal documents, TF-IDF can help extract key legal terms and related definitions, which are fundamental to constructing entities and relationships.…”
Section: Dataset Preprocessing 321 Standard Preprocessing Methodsmentioning
confidence: 99%
“…Applying TF-IDF and WordVec to the construction of knowledge graphs can significantly enhance the information retrieval capabilities and semantic parsing efficiency of knowledge graphs [69,70]. Using TF-IDF effectively identifies keywords or phrases that often carry crucial information linking different entities and attributes [71]. For example, when building a knowledge graph of legal documents, TF-IDF can help extract key legal terms and related definitions, which are fundamental to constructing entities and relationships.…”
Section: Dataset Preprocessing 321 Standard Preprocessing Methodsmentioning
confidence: 99%
“…TF-IDF method is a commonly used weighting technique [17]. Where TF is the frequency of words appearing in the text, i.e., the word, the greater the frequency indicates the greater the contribution of the word to the text: IDF is the inverted text frequency, which refers to the distribution of the word in the whole set of text, if the word is rarely found in other texts, it indicates that the word has a very good ability to distinguish between categories.…”
Section: Tf-idf Methodologymentioning
confidence: 99%
“…The system adopts a CIS structure, with servers deployed in the ship software status command room, and clients distributed on all ship stations that require detection and maintenance [17]. The server has a visual interface and complete functions such as ship and ship software status detection, fault software repair, software version release and update; The client does not have a visual interface and is a console program that starts automatically in the background.…”
Section: Deployment Designmentioning
confidence: 99%