“…First, two widely used data sets were used as the benchmark, as follows: The Reuters 21578 Distribution 1.0 data set (Reuters) consists of 12,902 articles and 90 topic categories from the Reuters newswire (Aphinyanaphongs et al, 2014;Debole & Sebastiani, 2003;Gliozzo et al, 2005;Ke, 2012;Sun, Lim, & Ng, 2003;Sun, Lim, & Liu, 2009;Yang & Liu, 1999;Yu et al, 2003). Following other studies by Nigam (2001) and Joachims (1998), we built binary classifiers for each class to identify the news topic.…”