2023
DOI: 10.20944/preprints202310.0286.v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Natural Language Processing-based Method for Clustering and Analysis of Movie Reviews and Classification by Genre

Fernando González,
Miguel Torres-Ruiz,
Guadalupe Rivera-Torruco
et al.

Abstract: The large quantity of information retrieved from communities, public data repositories, web pages, or data mining can be sparsed and poorly classified. This work shows how to employ unsupervised classification algorithms such as K-means proper to classify user reviews into their closest category, forming a balanced data set. Moreover, we found that the text vectorization technique significantly impacts the clustering formation, comparing TF-IDF and Word2Vec. The value for mapping a cluster with movie genre was… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
references
References 45 publications
0
0
0
Order By: Relevance