“…While there are some prior data engineering solutions to "model patching", including augmentation (Sennrich et al, 2015;Wei and Zou, 2019;Kaushik et al, 2019;Goel et al, 2021a), weak labeling (Ratner et al, 2017;Chen et al, 2020), and synthetic data generation (Murty et al, 2020), due to the noise in WIKIPEDIA, we repurpose BOOTLEGSPORT using weak labeling to modify training labels and correct for this noise. Our weak labeling technique works as follows: any existing mention from strong-sport-cues that is labeled as a country is relabeled as a national sports team for that country.…”