“…On the Portuguese Corref-PT dataset, 923,566 mention-pairs were generated, 45,659 of which were positive learning instances and 877,907 were negative learning instances, corresponding to a 4.9%/95.1% split. This class imbalance problem has been extensively studied in literature, and is usually tackled by using random undersampling of the majority class [20], [31]. We chose to perform our own study using one of the proposed architectures, aiming to identify which undersampling percentage is able to maximize performance on specific coreference metrics.…”