“…A diverse range of techniques have been applied to the AND problem such as supervised approaches (support vector machines and naive Bayes: Han, Giles, Zha, Li, and Tsioutsiouliklis []), unsupervised approaches (Ferreira, Veloso, Gonçalves, & Laender, ; Khabsa, Treeratpituk, & Giles, ), graph‐based models (Markov random field: Tang, Fong, Wang, and Zhang []; factor graph model: Wang, Tang, Cheng, and Philip []), heuristic‐based solutions (Cota, Ferreira, Nascimento, Gonçalves, & Laender, ; Santana, Gonçalves, Laender, & Ferreira, ). Ferreira et al (2014), Liu, Li, Huang, and Fang (), and Cota et al (2010) have proposed grouping/clustering the records using coauthors, title, and venue.…”