“…Most distant supervision research focuses on addressing the disadvantages of heuristic labelling, namely reducing false positive training data (Hoffmann et al, 2011;Surdeanu et al, 2012;Riedel et al, 2013;Alfonseca et al, 2012;Roth and Klakow, 2013;Takamatsu et al, 2012;Xu et al, 2013) and dealing with false negatives due to missing entries in the knowledge base (Min et al, 2013), as well as combining distant supervision with active learning (Angeli et al, 2014) Distant supervision has been researched for different domains, including newswire Riedel et al, 2013), Wikipedia (Mintz et al, 2009;Nguyen and Moschitti, 2011), the biomedical domain (Craven and Kumlien, 1999;Roller and Stevenson, 2014), the architecture domain (Vlachos and Clark, 2014) and the Web (Xin et al, 2014;Augenstein et al, 2014;Augenstein et al, 2015). To date, there is very little research on improving NERC for distant supervision to extract relations between non-standard entities such as musical artists and albums.…”