Dairy farmers are motivated to ensure cows become pregnant in an optimal and timely manner. Although timed artificial insemination (TAI) is a successful management tool in dairy cattle, it masks an animal's innate fertility performance, likely reducing the accuracy of genetic evaluations for fertility traits. Therefore, separating fertility traits based on the recorded management technique involved in the breeding process or adding the breeding protocol as an effect to the model can be viable approaches to address the potential bias caused by such management decisions. Nevertheless, there is a lack of specificity and uniformity in the recording of breeding protocol descriptions by dairy farmers. Therefore, this study investigated the use of 8 supervised machine learning algorithms to classify 1,835 unique breeding protocol descriptions from 981 herds into the following 2 classes: TAI or other than TAI. Our results showed that models that used a stacking classifier algorithm had the highest Matthews correlation coefficient (0.94 ± 0.04, mean ± SD) and maximized precision and recall (F1-score = 0.96 ± 0.03) on test data. Nonetheless, their F1-scores on test data were not different from 5 out of the other 7 algorithms considered. Altogether, results presented herein suggest machine learning algorithms can be used to produce robust models that correctly identify TAI protocols from dairy cattle breeding records, thus opening the opportunity for unbiased genetic evaluation of animals based on their natural fertility.
In the dairy industry, mate allocation is dependent on the producer’s breeding goals and the parents’ breeding values. The probability of pregnancy differs among sire-dam combinations, and the compatibility of a pair may vary due to the combination of gametic haplotypes. Under the hypothesis that incomplete incompatibility would reduce the odds of fertilization, and complete incompatibility would lead to a non-fertilizing or lethal combination, deviation from Mendelian inheritance expectations would be observed for incompatible pairs. By adding an interaction to a transmission ratio distortion (TRD) model, which detects departure from the Mendelian expectations, genomic regions linked to gametic incompatibility can be identified. This study aimed to determine the genetic background of gametic incompatibility in Holstein cattle. A total of 283,817 genotyped Holstein trios were used in a TRD analysis, resulting in 422 significant regions, which contained 2075 positional genes further investigated for network, overrepresentation, and guilt-by-association analyses. The identified biological pathways were associated with immunology and cellular communication and a total of 16 functional candidate genes were identified. Further investigation of gametic incompatibility will provide opportunities to improve mate allocation for the dairy cattle industry.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.