“…The recent years have seen much interest, as well as progress, in training directly with task-specific performance measures in the field of classification and ranking. Some notable works include those of [10,15] that investigate the statistical properties of plug-in classifiers for various non-decomposable objectives including F-measure, and [7,8,12,13] which propose stochastic gradient-style algorithms for optimizing non-decomposable performance measures such as F-measure, KL-divergence, area under the ROC curve (AUC), precision recall curve (AUCPR), recall at fixed precision (R@P), etc. However, all the works cited above focus only on training linear models.…”