2023
DOI: 10.1109/access.2023.3266377
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Text Representation and Embedding Techniques in NLP

Abstract: Natural Language Processing (NLP) is a research field where a language in consideration is processed to understand its syntactic, semantic, and sentimental aspects. The advancement in the NLP area has helped solve problems in the domains such as Neural Machine Translation, Name Entity Recognition, Sentiment Analysis, and Chatbots, to name a few. The topic of NLP broadly consists of two main parts: the representation of the input text (raw data) into numerical format (vectors or matrix) and the design of models… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
4
4
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 51 publications
(8 citation statements)
references
References 113 publications
0
8
0
Order By: Relevance
“…(5) Intelligent retrieval Implement intelligent retrieval in different scenarios based on NLP technology and knowledge graph matching techniques [16]. After understanding users' retrieval intentions, conduct precise or fuzzy searches based on their query methods.…”
Section: Research Frameworkmentioning
confidence: 99%
“…(5) Intelligent retrieval Implement intelligent retrieval in different scenarios based on NLP technology and knowledge graph matching techniques [16]. After understanding users' retrieval intentions, conduct precise or fuzzy searches based on their query methods.…”
Section: Research Frameworkmentioning
confidence: 99%
“…TFIDF is a statistical method to evaluate the significance of a word in a document or corpus (33) . Terms with higher TF-IDF scores are more relevant and can be utilised in tasks such as keyword extraction, document ranking, and information retrieval (34) . As https://www.indjst.org/ for TFIDF, the value of IDF must be calculated first before the final calculation can be executed.…”
Section: Term Weighting and Analysismentioning
confidence: 99%
“…Sentence embeddings in Natural Language Processing (NLP) refer to techniques that capture the semantic meaning of entire sentences by representing them as dense numerical vectors, enabling a wide range of downstream tasks such as sentence similarity, paraphrase detection, and text classification. BERT (Bidirectional Encoder Representations from Transformers) [35][36][37] embeddings capture rich contextual information and have revolutionized NLP tasks such as text classification, named entity recognition, and sentiment analysis. BOW and TF-IDF [38] embeddings are simpler but still useful for tasks such as document classification or information retrieval, where word frequency or presence is crucial.…”
Section: Sentence Embeddingsmentioning
confidence: 99%