Search citation statements
Paper Sections
Citation Types
Year Published
Publication Types
Relationship
Authors
Journals
Data cleaning is a core process when it comes to using data from dairy sensor technologies. This article presents guidelines for sensor data cleaning with a specific focus on dairy herd management and breeding applications. Prior to any data cleaning steps, context and purpose of the data use must be considered. Recommendations for data cleaning are provided in five distinct steps: 1) validate the data merging process, 2) get to know the data, 3) check completeness of the data, 4) evaluate the plausibility of sensor measures and detect outliers, and 5) check for technology related noise. Whenever necessary, the recommendations are supported by examples of different sensor types (bolus, accelerometer) collected in an international project (D4Dairy) or supported by relevant literature. To ensure quality and reproducibility, data users are required to document their approach throughout the process. The target group for these guidelines are professionals involved in the process of collecting, managing, and analyzing sensor data from dairy herds. Providing guidelines for data cleaning could help to ensure that the data used for analysis is accurate, consistent, and reliable, ultimately leading to more informed management decisions and better breeding outcomes for dairy herds.
Data cleaning is a core process when it comes to using data from dairy sensor technologies. This article presents guidelines for sensor data cleaning with a specific focus on dairy herd management and breeding applications. Prior to any data cleaning steps, context and purpose of the data use must be considered. Recommendations for data cleaning are provided in five distinct steps: 1) validate the data merging process, 2) get to know the data, 3) check completeness of the data, 4) evaluate the plausibility of sensor measures and detect outliers, and 5) check for technology related noise. Whenever necessary, the recommendations are supported by examples of different sensor types (bolus, accelerometer) collected in an international project (D4Dairy) or supported by relevant literature. To ensure quality and reproducibility, data users are required to document their approach throughout the process. The target group for these guidelines are professionals involved in the process of collecting, managing, and analyzing sensor data from dairy herds. Providing guidelines for data cleaning could help to ensure that the data used for analysis is accurate, consistent, and reliable, ultimately leading to more informed management decisions and better breeding outcomes for dairy herds.
The sustainability of the salmon farming industry is being challenged by increased mortality rates. Scotland’s open-source salmon production data provides the possibility of developing an industry-wide mortality monitoring model, valuable for identifying and addressing unexpected increases in mortality without needing data sharing agreements across different companies. This study aimed to utilize these data to develop a hierarchical dynamic linear model (DLM) for monitoring monthly mortality of maricultured Atlantic salmon in Scotland. We evaluated whether considering the hierarchical structure present in the data (country, region, and site) would improve mortality predictions when compared to the production cycle level DLMs developed in a previous study. Our findings demonstrated that the hierarchical DLM outperformed the production cycle level DLMs, confirming the value of this more complex modelling approach. Nevertheless, the hierarchical model, like the production cycle level DLMs, exhibited some uncertainty in the mortality predictions. When mortality is higher than expected, site level warnings are generated, which can encourage producers and inspectors to further investigate the cause. Between 2015 and 2020, approximately 25% of the production cycles and 50% of the sites encountered at least one warning, with most warnings happening in the summer and autumn months. Additionally, the hierarchical model enabled monitoring mortality at multiple levels. This information is useful for various stakeholders as part of a monitoring system, offering insights into mortality trends at national, regional, and sites levels that may benefit from strategic resource management. Recommendations for model improvements include utilizing shorter data aggregation periods, such as weekly, which are not currently available as open-source data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.