“…Our methods are also reminiscent of active learning methods (Settles, 2009;Peris and Casacuberta, 2018;P.V.S and Meyer, 2019), such as uncertainty sampling (Lewis and Gale, 1994) which selects (unlabeled) data points, which a model trained on a small labeled subset, has least confidence in, or predicts as farthest (in vector space, based on cosine similarity) (Sener and Savarese, 2018;Wolf, 2011). Our approach uses labeled data for selection, similar to core-set selection approaches (Wei et al, 2013).…”