2012 35th International Conference on Telecommunications and Signal Processing (TSP) 2012
DOI: 10.1109/tsp.2012.6256338
|View full text |Cite
|
Sign up to set email alerts
|

Broadcast news audio classification using SVM binary trees

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 13 publications
0
5
0
Order By: Relevance
“…Temporal Audio Features: these features are directly calculated from the audio samples. The most known timedomain features are Short-term energy [98] [99], energy entropy and zero crossing rate [98]. These features will be utilized in the feature extraction stage of our technique because they guarantee simple and good mean for audio signals analysis.…”
Section: ) Proposed Scheme For Content Characterization Using Audio Watermarkingmentioning
confidence: 99%
See 1 more Smart Citation
“…Temporal Audio Features: these features are directly calculated from the audio samples. The most known timedomain features are Short-term energy [98] [99], energy entropy and zero crossing rate [98]. These features will be utilized in the feature extraction stage of our technique because they guarantee simple and good mean for audio signals analysis.…”
Section: ) Proposed Scheme For Content Characterization Using Audio Watermarkingmentioning
confidence: 99%
“…These metrics are computed using Discrete Fourier Transform (DFT) coefficients of the designed audio frame. The most known spectral-domain features are spectral flux , spectral centroid, [100], spectral roll off, Mel-Frequency Cepstrum Coefficients (MFCCs) [98,100], chroma vector [101] and Relative Spectral Analysis-Perceptual Linear Prediction (Rasta PLP) [102].…”
Section: ) Proposed Scheme For Content Characterization Using Audio Watermarkingmentioning
confidence: 99%
“…Since the broadcasting FM channels demand the news content classification of broadcasting context, Vavrek, J. et al [20] also proposed a hierarchical tree to address the news content classification problem. This hierarchical classification strategy is used as a particular feature set for each SVM binary classifier.…”
Section: Same As Neural Network Approaches the Hidden Markovmentioning
confidence: 99%
“…In addition to that, 16000 Hz and 22050 Hz were also used as the sample rates in previous works related to radio broadcasting classification. Weeratunga et al [15] proposed 22050 Hz as the sample rate, John Saunders [16] and Vavrek, J. et al [20] proposed 16000 Hz as the sample rates for their studies. Therefore, we evaluate the models with sample rates 16000 Hz and 22050 Hz, and 44100 Hz to find the most reliable sample rate for the research.…”
Section: B Different Frame Sizesmentioning
confidence: 99%
“…2. Methods based on the template matching of features extracted from the query/utterance speech signal [7,8,[17][18][19][20][21][22][23][24][25][26][27][28][29]. They usually borrow the idea from dynamic time warping (DTW)-based speech recognition and were found to outperform phonetic transcription-based techniques on QbE STD [18].…”
Section: Introductionmentioning
confidence: 99%