“…When evaluated in the EMIDEC training dataset with ground-truth labels, the Attri-VAE approach provided accuracy results (0.98) equivalent to the best challenge participants reporting their performance on the same dataset (1.0 (Lourenc ¸o et al, 2021), 0.95 (Shi et al, 2021), 0.94 (Ivantsits et al, 2021) and 0.90 (Sharma et al, 2021)). For the testing EMIDEC dataset (Lalande et al, 2021), the best participant method obtained a decreased accuracy (0.82, (Lourenc ¸o et al, 2021;Girum et al, 2021)), increasing to 0.92 for the challenge organizers (Shi et al, 2021). As for the ACDC dataset, which was tested as an external database (i.e., without considering it in training), classification accuracy was substantially reduced (0.59), being worst than results reported by challenge participants (Bernard et al, 2018) (0.96) to classify between the different pathologies (not only between healthy and myocardial infarction).…”