2021
DOI: 10.1177/0165551521998636
|View full text |Cite
|
Sign up to set email alerts
|

Improvements for research data repositories: The case of text spam

Abstract: Current research has evolved in such a way scientists must not only adequately describe the algorithms they introduce and the results of their application, but also ensure the possibility of reproducing the results and comparing them with those obtained through other approximations. In this context, public data sets (sometimes shared through repositories) are one of the most important elements for the development of experimental protocols and test benches. This study has analysed a significant number of CS/ML … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 23 publications
(33 reference statements)
0
1
0
Order By: Relevance
“…As stated in recent works ( Vélez de Mendizabal et al, 2020 ; Novo-Lourés et al, 2020 ; Vázquez et al, 2021 ), there are a large number of public available corpora that can be used for testing new spam classification proposals. After reviewing the datasets reported in them, we have selected the Youtube Spam Collection dataset ( Alberto & Lochter, 2017 ) which was also the one chosen for the preceding SDRS study.…”
Section: Experimental Designmentioning
confidence: 99%
“…As stated in recent works ( Vélez de Mendizabal et al, 2020 ; Novo-Lourés et al, 2020 ; Vázquez et al, 2021 ), there are a large number of public available corpora that can be used for testing new spam classification proposals. After reviewing the datasets reported in them, we have selected the Youtube Spam Collection dataset ( Alberto & Lochter, 2017 ) which was also the one chosen for the preceding SDRS study.…”
Section: Experimental Designmentioning
confidence: 99%