“…A potential solution lies in the ML-driven detection of such deepfakes using, for example, binary classifiers to discriminate between genuine/bonafide and AI-generated speech. The field has witnessed a surge in research, from the creation of extensive datasets [10,11,12,13,14,15,16] to the development of new detection models [17,18,19,20,21,22]. Most notably, initiatives such as ASVspoof [23,24,25] which were launched to benchmark competing detection solutions, seemingly show impressive progress; lower and lower state-of-the-art error rates are reported on a regular basis [21,22,26].…”