“…Firstly, two single-feature SVM-based systems were developed considering two different types of compact parameterizations: the average of the MFCC and the average of the modulation spectrogram. The first parameterization, MFCC, is a very popular feature extraction procedure in audio and speech related tasks (see, for example, [38,31]), and for this reason, it was tried for the task under consideration in our previous work [10]. MFCCs are extracted on a frame-by-frame basis by applying the Discrete Cosine Transform on the log-mel spectrogram of the speech signal (see Subsection 4.2) and retaining the first 13 coefficients.…”