“…The earlier ones [24][25][26] were based on small-scale databases, where only a small number playback and recording conditions were taken into account. For example, in [24,27], three playback and recording devices were used to collect the database; in [25,28], one recording device and one playback device were used to create the database, which is named as authentic and playback speech database (APSD); in [29], the database was built by four smartphones; and in [26], four devices were used to create the playback utterances in the database, which is named as (audio-visual spoofing 2015) AVspoof 2015. Different from the above databases, the launch of the ASVspoof 2017 corpus provided a large common database, obtained using 26 playback devices, 25 recording devices, and 26 environments [1,2,30].…”