k-Fold Cross-Validation Can Significantly Over-Estimate True Classification Accuracy in Common EEG-Based Passive BCI Experimental Designs: An Empirical Investigation

White, Jacob; Power, Sarah

doi:10.3390/s23136077

Cited by 11 publications

(6 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Also, the results of fve-fold cross-validation protocol are given Table 13. As observed, this protocol outperforms the cross-subject cross-validation, while due to the following reasons, the cross-subject protocol is most popular than the k-fold one in BCI applications [32][33][34].…”

Section: Efect Of Data Augmentationmentioning

confidence: 77%

Fusion of Deep Features from 2D-DOST of fNIRS Signals for Subject-Independent Classification of Motor Execution Tasks

Khani,

Solouk,

Kalbkhani

et al. 2023

International Journal of Intelligent Systems

View full text Add to dashboard Cite

Functional near-infrared spectroscopy (fNIRS) is a low-cost and noninvasive method to measure the hemodynamic responses of cortical brain activities and has received great attention in brain-computer interface (BCI) applications. In this paper, we present a method based on deep learning and the time-frequency map (TFM) of fNIRS signals to classify the three motor execution tasks including right-hand tapping, left-hand tapping, and foot tapping. To simultaneously obtain the TFM and consider the correlation among channels, we propose to utilize the two-dimensional discrete orthonormal Stockwell transform (2D-DOST). The TFMs for oxygenated hemoglobin (HbO), reduced hemoglobin (HbR), and two linear combinations of them are obtained and then we propose three fusion schemes for combining their deep information extracted by the convolutional neural network (CNN). Two CNNs, LeNet and MobileNet, are considered and their structures are modified to maximize the accuracy. Due to the lack of enough signals for training CNNs, data augmentation based on the Wasserstein generative adversarial network (WGAN) is performed. Several simulations are performed to assess the performance of the proposed method in three-class and binary scenarios. The results present the efficiency of the proposed method in different scenarios. Also, the proposed method outperforms the recently introduced methods.

show abstract

Section: Efect Of Data Augmentationmentioning

confidence: 77%

Fusion of Deep Features from 2D-DOST of fNIRS Signals for Subject-Independent Classification of Motor Execution Tasks

Khani,

Solouk,

Kalbkhani

et al. 2023

International Journal of Intelligent Systems

View full text Add to dashboard Cite

show abstract

“…However, in machine learning models based on radiomics, k-fold cross-validation risks data leakage and introduces errors. Also, inappropriate k values, imbalanced training and validation set division, and unshuffled data order affect model prediction [ 59 , 60 ]. This study adopted random grouping and k-fold cross-validation for parameter selection during model building.…”

Section: Discussionmentioning

confidence: 99%

CT-based delta-radiomics nomogram to predict pathological complete response after neoadjuvant chemoradiotherapy in esophageal squamous cell carcinoma patients

Fan,

Yang,

Chang

et al. 2024

J Transl Med

View full text Add to dashboard Cite

Background This study developed a nomogram model using CT-based delta-radiomics features and clinical factors to predict pathological complete response (pCR) in esophageal squamous cell carcinoma (ESCC) patients receiving neoadjuvant chemoradiotherapy (nCRT). Methods The study retrospectively analyzed 232 ESCC patients who underwent pretreatment and post-treatment CT scans. Patients were divided into training (n = 186) and validation (n = 46) sets through fivefold cross-validation. 837 radiomics features were extracted from regions of interest (ROIs) delineations on CT images before and after nCRT to calculate delta values. The LASSO algorithm selected delta-radiomics features (DRF) based on classification performance. Logistic regression constructed a nomogram incorporating DRFs and clinical factors. Receiver operating characteristic (ROC) and area under the curve (AUC) analyses evaluated nomogram performance for predicting pCR. Results No significant differences existed between the training and validation datasets. The 4-feature delta-radiomics signature (DRS) demonstrated good predictive accuracy for pCR, with α-binormal-based and empirical AUCs of 0.871 and 0.869. T-stage (p = 0.001) and differentiation degree (p = 0.018) were independent predictors of pCR. The nomogram combined the DRS and clinical factors improved the classification performance in the training dataset (AUCαbin = 0.933 and AUCemp = 0.941). The validation set showed similar performance with AUCs of 0.958 and 0.962. Conclusions The CT-based delta-radiomics nomogram model with clinical factors provided high predictive accuracy for pCR in ESCC patients after nCRT.

show abstract

“…For model evaluation, we employ k-fold cross-validation, indicated by the cv = 5 parameter in the RandomizedSearchCV function. By specifying cv = 5, we perform 5-fold cross-validation, iteratively splitting the training data into five equal-sized folds for training and validation [37]. Utilizing multiple folds enhances performance estimates' robustness and reduces variability compared to a single train-test split.…”

Section: Model Training and Evaluationmentioning

confidence: 99%

Advanced Machine Learning Techniques for Predictive Modeling of Property Prices

Mathotaarachchi,

Hasan,

Mahmood

2024

Information

View full text Add to dashboard Cite

Real estate price prediction is crucial for informed decision making in the dynamic real estate sector. In recent years, machine learning (ML) techniques have emerged as powerful tools for enhancing prediction accuracy and data-driven decision making. However, the existing literature lacks a cohesive synthesis of methodologies, findings, and research gaps in ML-based real estate price prediction. This study addresses this gap through a comprehensive literature review, examining various ML approaches, including neural networks, ensemble methods, and advanced regression techniques. We identify key research gaps, such as the limited exploration of hybrid ML-econometric models and the interpretability of ML predictions. To validate the robustness of regression models, we conduct generalization testing on an independent dataset. Results demonstrate the applicability of regression models in predicting real estate prices across diverse markets. Our findings underscore the importance of addressing research gaps to advance the field and enhance the practical applicability of ML techniques in real estate price prediction. This study contributes to a deeper understanding of ML’s role in real estate forecasting and provides insights for future research and practical implementation in the real estate industry.

show abstract

k-Fold Cross-Validation Can Significantly Over-Estimate True Classification Accuracy in Common EEG-Based Passive BCI Experimental Designs: An Empirical Investigation

Cited by 11 publications

References 17 publications

Fusion of Deep Features from 2D-DOST of fNIRS Signals for Subject-Independent Classification of Motor Execution Tasks

Fusion of Deep Features from 2D-DOST of fNIRS Signals for Subject-Independent Classification of Motor Execution Tasks

CT-based delta-radiomics nomogram to predict pathological complete response after neoadjuvant chemoradiotherapy in esophageal squamous cell carcinoma patients

Advanced Machine Learning Techniques for Predictive Modeling of Property Prices

Contact Info

Product

Resources

About