Background: To stop tuberculosis (TB), the leading infectious cause of death globally, we need to better understand transmission risk factors. Although many studies have identified associations between individual-level covariates and pathogen genetic relatedness, few have identified characteristics of transmission pairs or explored how closely covariates associated with genetic relatedness mirror those associated with transmission. Methods: We simulated a TB-like outbreak with pathogen genetic data and estimated odds ratios (ORs) to correlate each covariate and genetic relatedness. We used a naive Bayes approach to modify the genetic links and nonlinks to resemble the true links and nonlinks more closely and estimated modified ORs with this approach. We compared these two sets of ORs with the true ORs for transmission. Finally, we applied this method to TB data in Hamburg, Germany, and Massachusetts, USA, to find pair-level covariates associated with transmission.Results: Using simulations, we found that associations between covariates and genetic relatedness had the same relative magnitudes and directions as the true associations with transmission, but biased absolute magnitudes. Modifying the genetic links and nonlinks reduced the bias and increased the confidence interval widths, more accurately capturing error. In Hamburg and Massachusetts, pairs were more likely to be probable transmission links if they lived in closer proximity, had a shorter time between observations, or had shared ethnicity, social risk factors, drug resistance, or genotypes. Conclusions: We developed a method to improve the use of genetic relatedness as a proxy for transmission, and aid in understanding TB transmission dynamics in low-burden settings.