“…Bayesian as the naive bayes (NB) [17,8], support vector machines (SVM) [17,8], and ensemble learning, which is marked () in the last column of the [20] Twitter from [30,31] racism, sexism characters, words, and both CNN 2018 Zimmerman et al [21] Twitter from [30] racism, sexism embedding deep learning 2018 Pitsilis et al [22] Twitter from [30] racism, sexism defined by the authors LSTM 2018 Montani and Schuller [18] GermEval 2018 1 general TFIDF, Word2Vec, n-gram LR, RF, ET 2019 Zhang and Luo [16] Twitter from [17,30] [17]: race ethnicity, religion [30]: racism, sexism Word2Vec CNN 2019 Liu et al [32] Twitter from [17] race ethnicity, religion embedding, LDA fuzzy ensemble 2019 Ramakrishnan et al [19] OffensEval [33] general n-gram, GloVe, others LR, RF, XG 2020 Paschalides et al [23] Twitter from [8] racism, sexism, homophobia The most common social media used to extract information to compose a dataset for hate speech detection is Twitter. Despite English being the most used language, there are datasets from many other languages, such as the Arabic-Twitter dataset [26] and Hindi-English Twitter dataset [27]. However, independently of the language, the construction of such datasets is a time-consuming task, not only because the number of hateful instances in online communities is relatively low compared to regular instances, but also because of the labeling process, i.e., which indicates if a sentence is hateful or not, is not a trivial piece of work.…”