Symposium review: Big data, big predictions: Utilizing milk Fourier-transform infrared and genomics to improve hyperketonemia management

Breeding objectives in the dairy industry have shifted from being solely focused on production to including fertility, animal health, and environmental impact. Increased serum concentrations of candidate biomarkers of health and fertility, such as β-hydroxybutyric acid (BHB), fatty acids, and urea are difficult and costly to measure, and thus limit the number of records. Accurate genomic prediction requires a large reference population. The inclusion of milk mid-infrared (MIR) spectroscopic predictions of biomarkers may increase genomic prediction accuracy of these traits. Our objectives were to (1) estimate the heritability of, and genetic correlations between, selected serum biomarkers and their respective MIR predictions, and (2) evaluate genomic prediction accuracies of either only measured serum traits, or serum traits plus MIR-predicted traits. The MIR-predicted traits were either fitted in a single trait model, assuming the measured trait and predicted trait were the same trait, or in a multitrait model, where measured and predicted trait were assumed to be correlated traits. We performed all analyses using relationship matrices constructed from pedigree (A matrix), genotypes (G matrix), or both pedigree and genotypes (H matrix). Our data set comprised up to 2,198 and 9,657 Holstein cows with records for serum biomarkers and MIR-predicted traits, respectively. Heritabilities of measured serum traits ranged from 0.04 to 0.07 for BHB, from 0.13 to 0.21 for fatty acids, and from 0.10 to 0.12 for urea. Heritabilities for MIR-predicted traits were not significantly different from those for the measured traits. Genetic correlations between measured traits and MIR-predicted traits were close to 1 for urea. For BHB and fatty acids, genetic correlations were lower and had large standard errors. The inclusion of MIR predicted urea substantially increased prediction accuracy for urea. For BHB, including MIR-predicted BHB reduced the genomic prediction accuracy, whereas for fatty acids, prediction accuracies were similar with either measured fatty acids, MIR-predicted fatty acids, or both. The high genetic correlation between urea and MIR-predicted urea, in combination with the increased prediction accuracy, demonstrated the potential of using MIR-predicted urea for genomic prediction of urea. For BHB and fatty acids, further studies with larger data sets are required to obtain more accurate estimates of genetic correlations.

Section: Many Authors Have Reported Promising Accuracies Whenmentioning

confidence: 99%

The use of milk mid-infrared spectroscopy to improve genomic prediction accuracy of serum biomarkers

Berg

Luke

et al. 2021

“…Such problems not only impair farm profitability, they also directly increase veterinary and reproductive costs (Hogeveen et al, 2011;Shalloo et al, 2014), environmental losses (Bell et al, 2013), and affect animal welfare outcomes (Oltenacu and Broom, 2010). Given the high incidence and the costs of these disorders, there has been growing interest in predicting the metabolic status of dairy cows in the early stages of lactation (see the recent review of Pralle and White, 2020). This information could either be used to help farmers make informed interventions to prevent the development of these diseases, or to generate novel phenotypes for genetic improvement purposes, most likely through genomic selection.…”

Section: Introductionmentioning

confidence: 99%

“…Furthermore, many studies have reported promising accuracies from using milk MIR to predict serum metabolic profiles, with the R 2 obtained through random cross-validation ranging between 0.21 and 0.92 (Belay et al, 2017;Grelet et al, 2018;Pralle et al, 2018;Benedet et al, 2019;Luke et al, 2019b). Although the initial results are promising, the models should be properly validated, preferably through external validation, before they can be implemented for farmers to use as management tools (Pralle and White, 2020). This is because random cross validation is often overly optimistic when compared with the more stringent external validation (i.e., using data from a different herd; Wang and Bovenhuis, 2019).…”

Section: Introductionmentioning

confidence: 99%

Validation of milk mid-infrared spectroscopy for predicting the metabolic status of lactating dairy cows in Australia

Luke

Pryce

2021

Increased concentrations of some serum biomarkers are known to be associated with impaired health of dairy cows. Therefore, being able to predict these biomarkers, especially in the early stage of lactation, would enable preventive management decision. Some health biomarkers may also be used as phenotypes for genetic improvement for improved animal health. In this study, we validated the accuracy and robustness of models for predicting serum concentrations of β-hydroxybutyrate (BHB), fatty acids, and urea nitrogen, using milk midinfrared (MIR) spectroscopy. The data included 3,262 blood samples of 3,027 lactating Holstein-Friesian cows from 19 dairy herds in Southeastern Australia, collected in the period from July 2017 to April 2020. The models were developed using partial least squares regression and were validated using 10-fold random cross-validation, herd-year by herd-year external validation, and year by year validation. The coefficients of determination (R 2 ) for prediction of serum BHB, fatty acids, and urea obtained through random cross-validation were 0.60, 0.42, and 0.87, respectively. For the herd-year by herd-year external validation, the prediction accuracies held up comparatively well, with R 2 values of 0.49, 0.33, and 0.67 for of serum BHB, fatty acids, and urea, respectively. When the models were developed using data from a single year to predict data collected in future years, the R 2 remained comparable, however, the root mean squared errors increased substantially (4-10 times larger than compared with that of herd-year by herd-year external validation) which could be due to machine differences in spectral response, the change in spectral response of individual machines over time, or other differences associated with farm management between seasons. In conclusion, the mid-infrared equations for predicting serum BHB, fatty acids, and urea have been validated. The prediction equations could be used to help farmers detect cows with metabolic disorders in early lactation in addition to generating novel phenotypes for genetic improvement purposes.

“…We did further investigations on other factors, such as differences in the interval between calving to first service and breed between herd-years, but could not find any systematic patterns. Possibly, collecting more data from a diverse portfolio of herds and production systems would improve the robustness of the models (Pralle and White, 2020). In the meantime, we recommend cautious practical application of this model until this issue is resolved.…”

Section: Resultsmentioning

confidence: 99%

Predicting the likelihood of conception to first insemination of dairy cows using milk mid-infrared spectroscopy

Pryce

2020

The objective of this study was to examine the ability of milk mid-infrared (MIR) spectroscopy and other onfarm data, such as milk yield, milk composition, stage of lactation, calving age, days in milk at insemination, and somatic cell count, to identify cows that were most or least likely to conceive to first insemination. A total of 16,628 spectral and milk production records of 7,040 cows from 29 commercial dairy herds across 3 Australian states were used. Three models, comprising different explanatory variables, were tested. Model 1 included features that are readily available on farms participating in milk recording, such as milk yield, milk composition, somatic cell count, days from calving to insemination, and calving season. Days in milk and age at calving were incorporated into model 1 to form model 2. In model 3, MIR was added to model 2, but to avoid double counting, milk composition traits of model 2 were removed. The models were first trained on extreme data [i.e., including cows that (1) conceived to first insemination and (2) cows with no conception event recorded and with only 1 insemination]. Then, the models were validated in a fresh data set with all cows regardless of conception outcomes present to test for their ability to identify cows that conceived or did not conceive to first insemination. To do this, we ranked the predicted probability of all cows in the validation set and then selected the top and bottom records in varying proportions from 5 to 40% (i.e., where the model predicted the highest versus lowest likelihood of conception to first insemination, respectively) and compared with the actual values. The model's performance was evaluated through herd-year by herd-year external validation and measured as the proportion of selected records being correct. The results show that when more cows are selected (i.e., descending confidence), the accuracy of the models was reduced, and selecting the 10% of cows with the highest confidence of predictions produces optimal accuracy. Irrespective of the proportions, none of the models could predict cows that conceived to first insemination, with an accuracy around 0.48. When attempting to predict the bottom 10% of cows, which had the least likelihood of conception to first insemination, model 1 had prediction accuracy around 0.64. Compared with model 1, the addition of days in milk and calving age (model 2) resulted in a negligible improvement in prediction accuracy (0.01 to 0.03). Model 3 had the highest prediction accuracy (0.76), which implies that in the models tested, MIR is of primary importance in the prediction of fertility of dairy cows. In conclusion, this study indicates that MIR and other milk recording data could be used to identify cows with potential difficulty in getting pregnant to first insemination with promising accuracy.