“…In a case of tonal bird vocalisations, the use of a sinusoidal detection for segmentation also offers a natural way of representing the segment as a temporal sequence of the frequencies of the detected sinusoid, which we refer to as frequency track. This representation was employed in a few earlier studies [1], [6] and also in our recent works [3], [4], [7], [8], [9], [10]. Among the acoustic modelling approaches, the most commonly used are Gaussian mixture models (GMM) [1], [3], hidden Markov models (HMMs) [1], [4], [6], [11], and decision trees [12].…”