Machine Learning with Known Input Data Uncertainty Measure

Czarnecki, Wojciech Marian; Podolak, Igor T.

doi:10.1007/978-3-642-40925-7_35

Cited by 17 publications

(13 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To achieve the desired accuracy, it is essential to generate a reference for selecting the parameters that need to be recorded. The current study considers the use of different datasets as a useful method to ascertain the appropriate data that may have fewer variables and significant implications for predictions indeed [ 20 , 21 , 36 , 37 ]. So that the current study adopts the Spearman rank correlation coefficient approach in order to extract the best features, which is a commonly followed method to explore the relationships between attributes.…”

Section: Methodsmentioning

confidence: 99%

“…Determination of input data is the bottom line of any modelling criteria yet crucial consideration in diagnosing the exquisite functional form of ML models. Choosing the right input variables involves improving the accuracy of the algorithm; also, it dominates the calculation speed, training time, training complexity, comprehensibility, and computational effort of the simulation [ 20 , 21 , 22 ]. The present study analyzes the performance of the models with feature-selected datasets and available datasets; it also suggests the optimal input selection to feed the models from the available datasets.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Machine Learning-Based Microclimate Model for Indoor Air Temperature and Relative Humidity Prediction in a Swine Building

Arulmozhi

Basak

Sihalath

et al. 2021

Animals

View full text Add to dashboard Cite

Indoor air temperature (IAT) and indoor relative humidity (IRH) are the prominent microclimatic variables; still, potential contributors that influence the homeostasis of livestock animals reared in closed barns. Further, predicting IAT and IRH encourages farmers to think ahead actively and to prepare the optimum solutions. Therefore, the primary objective of the current literature is to build and investigate extensive performance analysis between popular ML models in practice used for IAT and IRH predictions. Meanwhile, multiple linear regression (MLR), multilayered perceptron (MLP), random forest regression (RFR), decision tree regression (DTR), and support vector regression (SVR) models were utilized for the prediction. This study used accessible factors such as external environmental data to simulate the models. In addition, three different input datasets named S1, S2, and S3 were used to assess the models. From the results, RFR models performed better results in both IAT (R2 = 0.9913; RMSE = 0.476; MAE = 0.3535) and IRH (R2 = 0.9594; RMSE = 2.429; MAE = 1.47) prediction among other models particularly with S3 input datasets. In addition, it has been proven that selecting the right features from the given input data builds supportive conditions under which the expected results are available. Overall, the current study demonstrates a better model among other models to predict IAT and IRH of a naturally ventilated swine building containing animals with fewer input attributes.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Machine Learning-Based Microclimate Model for Indoor Air Temperature and Relative Humidity Prediction in a Swine Building

Arulmozhi

Basak

Sihalath

et al. 2021

Animals

View full text Add to dashboard Cite

show abstract

“…Uncertainty in data, and therefore in all types of data science models, introduces the risk of poor decision outcomes because of biases, drift and lack of precision in individual sensor systems (Wolfert et al, 2017). Further, as the volume and variety of data increases, so do the uncertainties inherent within (Czarnecki and Podolak, 2013;Hariri et al, 2019)-big data is often subject to noise, incompleteness, bias and inconsistency (Hariri et al, 2019;Sharifi et al, 2020), and may often be disparate, dynamic, untrustworthy, and inter-related (Wang and Jones, 2017).…”

Section: Limitations and Challengesmentioning

confidence: 99%

“…The risk of poor decision outcomes is particularly true in analytics that combine non-traditional information sources such as rapidly arriving data from sensors, process models, qualitative information and user behavior (Wynne, 1992). Using multiple disparate data sources means compounding data uncertainty originating from the data collection, data curation and combination from multiple sources (Czarnecki and Podolak, 2013;Hariri et al, 2019).Communicating uncertainty in data can introduce further complexities, and uncertainties are sometimes ignored, or even explicitly denied (van der Bles et al, 2019). Uncertainty in the data collection, analysis and knowledge extension processes can lead to a lack of confidence in the resulting model outputs and decision made thereof.…”

Section: Approximation and Uncertaintymentioning

confidence: 99%

See 1 more Smart Citation

Integrating Digital Technologies to Aid Grassland Productivity and Sustainability

Stevens

Thompson

Johnson

et al. 2021

Front. Sustain. Food Syst.

View full text Add to dashboard Cite

Digital technologies provide an opportunity to further increase the sustainability and productivity of grasslands and rangelands. Three resources are key to that change. These are the soil on which forage grows, the forages that grow on those soils and the animals that use the forage resource as food. This paper describes elements of technologies to measure and monitor these resources and provides some insights on combining that knowledge and controlling the animal's utilization through virtual fencing. There are many potential challenges to the application of digital technologies to pastoral farming. These often require the calibration of digital signals to define biophysical characteristics. The significant repository of historic data of pasture growth over many geo-climatic regions, for example, provides New Zealand with an opportunity to accelerate that development. Future advances in rangeland use, nutrient deposition, greenhouse gas emissions and the provision and utilization of high quality and quantity will be enabled by the application of digital technologies at scale, under the control of virtual fencing. Digital technologies may provide the means to maintain or enhance ruminant production from grassland in a sustainable operating space into the future.

show abstract

Generation of Computational Data Sets for Machine Learning Applied to Battery Materials

Bhowmik

Bölle

Castelli

et al. 2021

Atomic‐Scale Modelling of Electrochemical Systems

View full text Add to dashboard Cite

Machine Learning with Known Input Data Uncertainty Measure

Cited by 17 publications

References 9 publications

Machine Learning-Based Microclimate Model for Indoor Air Temperature and Relative Humidity Prediction in a Swine Building

Machine Learning-Based Microclimate Model for Indoor Air Temperature and Relative Humidity Prediction in a Swine Building

Integrating Digital Technologies to Aid Grassland Productivity and Sustainability

Generation of Computational Data Sets for Machine Learning Applied to Battery Materials

Contact Info

Product

Resources

About