2021
DOI: 10.3390/s21124187
|View full text |Cite
|
Sign up to set email alerts
|

Artificial Breath Classification Using XGBoost Algorithm for Diabetes Detection

Abstract: Exhaled breath analysis has become more and more popular as a supplementary tool for medical diagnosis. However, the number of variables that have to be taken into account forces researchers to develop novel algorithms for proper data interpretation. This paper presents a system for analyzing exhaled air with the use of various sensors. Breath simulations with acetone as a diabetes biomarker were performed using the proposed e-nose system. The XGBoost algorithm for diabetes detection based on artificial breath… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
53
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 58 publications
(53 citation statements)
references
References 77 publications
(79 reference statements)
0
53
0
Order By: Relevance
“…Secondly, for high-dimensional scene data exploration (such as medical time-series data), the XGBoost algorithm cannot effectively eliminate noise variables ( 57 ). Therefore, we conducted a grid search to determine the algorithms of optimal dimensionality reduction and added randomness to improve robustness ( 58 ). Additionally, an increasing fraction of the training time in the LSTM model would reduce the number of iterations within the same total training time ( 59 ).…”
Section: Discussionmentioning
confidence: 99%
“…Secondly, for high-dimensional scene data exploration (such as medical time-series data), the XGBoost algorithm cannot effectively eliminate noise variables ( 57 ). Therefore, we conducted a grid search to determine the algorithms of optimal dimensionality reduction and added randomness to improve robustness ( 58 ). Additionally, an increasing fraction of the training time in the LSTM model would reduce the number of iterations within the same total training time ( 59 ).…”
Section: Discussionmentioning
confidence: 99%
“…The Python Data Analysis Library was used as an accurate and flexible data manipulation tool to first analyze the data. Next, a decision-tree-based method—extreme gradient boosting—was adopted [ 21 ]. This method was selected to deal with distributed problems, microwave theoretical analysis, and calculations owing to its advantages of high-speed dynamic response and memory ability.…”
Section: Methodsmentioning
confidence: 99%
“…It can handle both classification and regression problems by selecting different baseline models, and the final result is achieved by adding new models to the existing model until its performance is no longer greatly improved. The main advantage of XGB is its speed, thanks to its parallel and cache‐aware computing [37] . Additionally, it includes a regularization term to prevent over‐fitting and shrinkage and column subsampling to further reduce the risk of over‐fitting.…”
Section: Methodsmentioning
confidence: 99%