n-gram models, repeat distribution, and edit distance), and data generated by different stochastic 30 processes (entropy rate and n-grams). However, the string edit (Levenshtein) distance performed 31 consistently and significantly better than all other tested metrics (including entropy, Markov 32 chains, n-grams, mutual information) for all empirical datasets, despite being less commonly used 33 in the field of animal acoustic communication.