2018
DOI: 10.1007/978-3-030-05532-5_51
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Categorization of Tweets on the Political Electoral Theme Using Supervised Classification Algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 3 publications
0
2
0
Order By: Relevance
“…For knowing the real performance of a model before applying it, there is a very common solution, which is to use the training set to train and validate at the same time, known as cross-validation; it consists of dividing the training set in k equal parts, if k = 5 then the model is trained with the first four parts and tested with the fifth, then it is trained with the first three and the fifth and tested with the fourth, this is repeated k times, always leaving out one part for the test, then the performance in each one is averaged and thus the expected performance is obtained [5]. For the validations, it is considered to use k = 10 (number of folds), which is the number of parts into which the dataset is divided to train and evaluate the models.…”
Section: Phase 4: Model Development and Hypothesis Buildingmentioning
confidence: 99%
“…For knowing the real performance of a model before applying it, there is a very common solution, which is to use the training set to train and validate at the same time, known as cross-validation; it consists of dividing the training set in k equal parts, if k = 5 then the model is trained with the first four parts and tested with the fifth, then it is trained with the first three and the fifth and tested with the fourth, this is repeated k times, always leaving out one part for the test, then the performance in each one is averaged and thus the expected performance is obtained [5]. For the validations, it is considered to use k = 10 (number of folds), which is the number of parts into which the dataset is divided to train and evaluate the models.…”
Section: Phase 4: Model Development and Hypothesis Buildingmentioning
confidence: 99%
“…In the development of this article, the objectives that allow the classification of the tweets, which are extracted from a reliable repository such as Twitter, are established. Data mining techniques are also implemented in the data sets obtained in order to extract the relevant information for their subsequent classification [11].…”
Section: Introductionmentioning
confidence: 99%