“…However, there were some papers that simply did not separate training and testing patients, could lead to biased measurements of their performance. The results reported by these contributions are interesting as preliminary work, but cannot safely be considered as representative of a real, prospective usage.Others measured their performance by employing a patient-wise validation method[11,12,21,22,26,27,32,29,33,36,47,48,51,54,55,57,58,61,62,63,64,65,66,69,70,72,73,74,75,76,77,78,79,80,81,82]. While it does not ensure the complete absence of data leakage (that could occur, for example, by mixing the validation and testing sets, or by selecting or normalize features with the whole database), these results can be considered as more reliable, and more representative of prospective-usage performance.…”