2014
DOI: 10.3906/elk-1207-74
|View full text |Cite
|
Sign up to set email alerts
|

Source microphone identification from speech recordings based on a Gaussian mixture model

Abstract: Abstract:Microphone identification is a specific type of media forensics that investigates whether it is possible to identify the source microphone from speech recordings. The main aim of this study is to find out which of the several feature extraction techniques are best suited to the source microphone identification systems. We perform microphone identification experiments with 16 different microphones using 3 datasets. In order to improve the results on the datasets, we also investigate the important param… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…In this approach, voice samples were modeled as a weighted sum of multivariate Gaussian probability density functions. In the GMM parameter estimation, the distribution of features is modeled by the mean vectors , covariance matrices ∑ i , and mixture weights c i which is denoted by the notation Θ = { c i , μ i , ∑ i }, i = 1,2,…, K , where K is the number of mixture components [ 39 ]. These model parameters (Θ) are commonly determined using expectation maximization (EM) algorithm.…”
Section: Methodsmentioning
confidence: 99%
“…In this approach, voice samples were modeled as a weighted sum of multivariate Gaussian probability density functions. In the GMM parameter estimation, the distribution of features is modeled by the mean vectors , covariance matrices ∑ i , and mixture weights c i which is denoted by the notation Θ = { c i , μ i , ∑ i }, i = 1,2,…, K , where K is the number of mixture components [ 39 ]. These model parameters (Θ) are commonly determined using expectation maximization (EM) algorithm.…”
Section: Methodsmentioning
confidence: 99%
“…Several studies employed MFCC for microphone and environment classification [7], [19], [24], [25]. Several other frame-level features have also been evaluated for microphone recognition, such as Multi-taper MFCC [9], or Linear Prediction Cepstral Coefficient (LPCC) [10]. The researchers in [10] classified 16 different microphones recorded in one silent room using a Gaussian Mixture Model (GMM) classifier.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Several other frame-level features have also been evaluated for microphone recognition, such as Multi-taper MFCC [9], or Linear Prediction Cepstral Coefficient (LPCC) [10]. The researchers in [10] classified 16 different microphones recorded in one silent room using a Gaussian Mixture Model (GMM) classifier. They studied 3 feature representations: LPCC, MFCC, and Perceptually-based Linear Predictive Coefficients (PLPCs).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Hanilci et al [18] explored microphone identification problem of speech recording from a mobile phone where audio recorded from 14 models of mobile phones are classified using vector quantization and SVM-based classifier. Moreover, Eskidere [19] reported his recent work in microphone identification on 16 microphone models using GMM-based modeling technique along with three different features called LPCC, PLPC and MFCC.…”
Section: Introductionmentioning
confidence: 99%