“…A close look at the existing methods reveals that, firstly, the complex deep models were used with relatively small datasets, which is an indicator of the overfitting problem; the methods in [11,14,15,22,24,25] achieved accuracies of 98.07%, 99%, 99.25%, 99.9%, 96.83%, and 99.5%, respectively. Moreover, some studies [11][12][13][14]16,21,22,24,27] used EEG trials of long temporal lengths in a range between 20 to 60 s, which requires more computation and time. Also, in [23], the authors adopted only two channels, FP1 and FP2, without providing enough information on why they chose them.…”