2023
DOI: 10.21203/rs.3.rs-3570648/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

pysentimiento: A Python Toolkit for Opinion Mining and Social NLP tasks

Juan Manuel Perez,
Mariela Rajngewerc,
Juan Carlos Giudici
et al.

Abstract: In recent years, the extraction of opinions and information from user-generated text has attracted a lot of interest, largely due to the unprecedented volume of content in Social Media. However, social researchers face some issues in adopting cutting-edge tools for these tasks, as they are usually behind commercial APIs, unavailable for other languages than English, or very complex to use for non-experts. To address these issues, we present pysentimiento, a comprehensive multilingual Python toolkit designed fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 41 publications
0
2
0
Order By: Relevance
“…Pysentimiento is an open-source Python library that includes models for sentiment analysis and social NLP tasks, such as hate speech detection, irony detection, emotion analysis, named entity recognition, and part-of-speech tagging, in several languages such as English, Spanish, Portuguese, and Italian [101,102]. The English model for sentiment analysis is based on BERTweet [103], a RoBERTa model, trained on English tweets and also fine-tuned on the SemEval 2017 sentiment analysis data set [91].…”
Section: Pysentimientomentioning
confidence: 99%
“…Pysentimiento is an open-source Python library that includes models for sentiment analysis and social NLP tasks, such as hate speech detection, irony detection, emotion analysis, named entity recognition, and part-of-speech tagging, in several languages such as English, Spanish, Portuguese, and Italian [101,102]. The English model for sentiment analysis is based on BERTweet [103], a RoBERTa model, trained on English tweets and also fine-tuned on the SemEval 2017 sentiment analysis data set [91].…”
Section: Pysentimientomentioning
confidence: 99%
“…-Sentiment (English [17], Italian [18], Spanish [17], German [19]) -Emotions (English [20], Italian [18], Spanish [21]) -Hate speech (English, Italian, German [22]) -Fake news (English [23], German [24]) -Irony (English [25]) -Sexism (English [26])…”
Section: Natural Language Processing and Machine Learning Apimentioning
confidence: 99%