Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval 2015
DOI: 10.1145/2766462.2767701
|View full text |Cite
|
Sign up to set email alerts
|

HSpam14

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 65 publications
(11 citation statements)
references
References 27 publications
0
11
0
Order By: Relevance
“…We combine MinHash clustering and incremental clustering to group tweets collected by keywords TC within a time window into clusters. It has been reported that the MinHash algorithm is effective in grouping near‐duplicate tweets (Sedhai & Sun, ). However, tweets with a different minimum hash value could also be similar.…”
Section: Proposed Method: Almikmentioning
confidence: 99%
“…We combine MinHash clustering and incremental clustering to group tweets collected by keywords TC within a time window into clusters. It has been reported that the MinHash algorithm is effective in grouping near‐duplicate tweets (Sedhai & Sun, ). However, tweets with a different minimum hash value could also be similar.…”
Section: Proposed Method: Almikmentioning
confidence: 99%
“…Before conducting analysis on the collection of tweets, we briefly describe the dataset and the data annotation process. A detailed explanation and discussion related to the annotation process were reported in our earlier work (Sedhai & Sun, ).…”
Section: Overview Of Hspam14mentioning
confidence: 99%
“…The collected tweets contain 20.21 million hashtags and 6.97 million hyperlinks. Among the collected tweets, 14.07 million tweets are in English and were labeled spam and ham to generate the HSpam14 dataset, as discussed previously (Sedhai & Sun, ).…”
Section: Overview Of Hspam14mentioning
confidence: 99%
See 2 more Smart Citations