Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features

Markaki, Maria; Stylianou, Yannis

doi:10.1016/j.specom.2010.08.007

Cited by 18 publications

(12 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The HOSVD has been applied in numerous application domains [29], such as image processing [9,32,39,61], pattern recognition [49,50,59,60,62], data mining and machine learning [33,34,52,53], signal processing [12,19,36,37,38,45], psychometrics [54,55,56], chemometrics [5], and biomedicine [16,40,41]. Aside from its use in applications, the HOSVD is also of considerable theoretical importance.…”

mentioning

confidence: 99%

A New Truncation Strategy for the Higher-Order Singular Value Decomposition

Vannieuwenhoven¹,

Vandebril²,

Meerbergen³

2012

SIAM J. Sci. Comput.

249

237

View full text Add to dashboard Cite

We present an alternative strategy to truncate the higher-order singular value decomposition (T-HOSVD). An error expression for an approximate Tucker decomposition with orthogonal factor matrices is presented, leading us to propose a novel truncation strategy for the HOSVD, which we refer to as the sequentially truncated higher-order singular value decomposition (ST-HOSVD). This decomposition retains several favorable properties of the T-HOSVD, while reducing the number of operations to compute the decomposition and practically always improving the approximation error. Three applications are presented, demonstrating the effectiveness of ST-HOSVD. In the first application, ST-HOSVD, T-HOSVD and Higher-Order Orthogonal Iteration (HOOI) are employed to compress a database of images of faces. On average, the ST-HOSVD approximation was only 0.1% worse than the optimum computed by HOOI, while cutting the execution time by a factor 20. In the second application, classification of handwritten digits, ST-HOSVD achieved a speedup of 50 over T-HOSVD during the training phase, reduced the classification time and storage costs, while not significantly affecting the classification error. The third application demonstrates the effectiveness of ST-HOSVD in compressing results from a numerical simulation of a partial differential equation. In such problems, ST-HOSVD inevitably can greatly improve the running time. We present an example wherein the 2 hour 45 minute calculation of T-HOSVD was reduced to just over one minute by ST-HOSVD, representing a speedup of 133, while even improving the memory consumption.

show abstract

mentioning

confidence: 99%

A New Truncation Strategy for the Higher-Order Singular Value Decomposition

Vannieuwenhoven¹,

Vandebril²,

Meerbergen³

2012

SIAM J. Sci. Comput.

249

237

View full text Add to dashboard Cite

show abstract

“…These characteristics are referred in the literature as segment-based features [29,30]. For example, in [31], a content-based speech discrimination algorithm is designed to exploit the long-term information inherent in the modulation spectrum; and in [32], authors propose two segment-based features: the variance of the spectrum flux (VSF) and the variance of the zero crossing rate (VZCR).…”

Section: General Description Of Audio Segmentation Systemsmentioning

confidence: 99%

Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains

Castán

Tavarez

López-Otero

et al. 2015

J AUDIO SPEECH MUSIC PROC.

View full text Add to dashboard Cite

Audio segmentation is important as a pre-processing task to improve the performance of many speech technology tasks and, therefore, it has an undoubted research interest. This paper describes the database, the metric, the systems and the results for the Albayzín-2014 audio segmentation campaign. In contrast to previous evaluations where the task was the segmentation of non-overlapping classes, Albayzín-2014 evaluation proposes the delimitation of the presence of speech, music and/or noise that can be found simultaneously. The database used in the evaluation was created by fusing different media and noises in order to increase the difficulty of the task. Seven segmentation systems from four different research groups were evaluated and combined. Their experimental results were analyzed and compared with the aim of providing a benchmark and showing up the promising directions in this field.

show abstract

“…Some of the widely used are: (i) multi-class problem, 10 (ii) binary-classes problem, 37 (iii) hierarchical structure of the classes problem, 11 (iv) two-groups or multi-group of classes problem 28 and (v) detection of a class over the other classes problem. 19,48 In this work, we present a broadcast news sound recognition methodology based on widely known and used audio features. The implemented framework clusters the audio feature space to subspaces, based on data-driven criteria.…”

Section: Introductionmentioning

confidence: 99%

Data-Driven Audio Feature Space Clustering for Automatic Sound Recognition in Radio Broadcast News

Theodorou

Mporas

Lazaridis

et al. 2017

Int. J. Artif. Intell. Tools

View full text Add to dashboard Cite

Aiming to an automatic sound recognizer for radio broadcasting events, a methodology of clustering the audio feature space using the discrimination ability of the audio descriptors as a criterion, is investigated in this work. From a given and close set of audio events, commonly found in broadcast news transmissions, a large set of audio descriptors is extracted and their data-driven ranking of relevance is clustered, providing a more robust feature selection. The clusters of the feature space are feeding machine learning algorithms implemented as classification models during the experimental evaluation. This methodology showed that support vector machines provide significantly good results, considering the achieved accuracy due to their ability of coping well in high dimensionality experimental conditions.

show abstract

Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features

Cited by 18 publications

References 23 publications

A New Truncation Strategy for the Higher-Order Singular Value Decomposition

A New Truncation Strategy for the Higher-Order Singular Value Decomposition

Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains

Data-Driven Audio Feature Space Clustering for Automatic Sound Recognition in Radio Broadcast News

Contact Info

Product

Resources

About