2015 IEEE 13th International Conference on Industrial Informatics (INDIN) 2015
DOI: 10.1109/indin.2015.7281886
|View full text |Cite
|
Sign up to set email alerts
|

A system for semantic information extraction from mixed soundtracks deploying MARSYAS framework

Abstract: Ever increasing volumes of media content and the desire to extract information from media archives motivate the studies into semantic audio information mining. Much research in this filed concerns development of bespoke systems, in which soundtracks are exclusively classified and segmented, and a specific type of sound is recognized and analyzed. This approach however is detrimental to the complete extraction of all relevant semantic information and audio scene analysis. The current study addresses the issues … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 19 publications
0
7
0
Order By: Relevance
“…An audio mixer was developed using the MATLAB code, which blends pre-recorded speech segments with noise according to their signal intensity. The mixing strategy used was empirically verified and published to emulate the best mix of soundtracks [25], [26]. The sound mixer procedure described is being as: firstly, the issue of normalization is addressed in such a way that speech and noise are added in the wanted proportion to avoid misinterpretation.…”
Section: Noisy Datamentioning
confidence: 99%
“…An audio mixer was developed using the MATLAB code, which blends pre-recorded speech segments with noise according to their signal intensity. The mixing strategy used was empirically verified and published to emulate the best mix of soundtracks [25], [26]. The sound mixer procedure described is being as: firstly, the issue of normalization is addressed in such a way that speech and noise are added in the wanted proportion to avoid misinterpretation.…”
Section: Noisy Datamentioning
confidence: 99%
“…103 Semantic IE from audio is capable to extract music score and text information through classification and segmentation which are helpful to update and insert music or speech occurrence and analyze arbitrary soundtracks. 104 Recently, the exponential growth of unstructured big data and computational power, ASR is moving toward more advanced and challenging applications such as mobile interaction with voice, voice control in smart systems, and communicative assistance. 105 Limitations of IE techniques for audio content.…”
Section: Audio Content Ie Techniquesmentioning
confidence: 99%
“…The field of acoustic IE is facing challenges such as more accurate feature selection, 97,99 classification of nonexclusive sound and content overlapping. 104 Call centers use audio data for analysis and IE in the form of conversations with clients, music, monitoring, and processing of conversations. Background noise, words overlapping, considering one single voice in crowd, and language ambiguities are open challenges in this domain.…”
Section: Audio Content Ie Techniquesmentioning
confidence: 99%
“…Semantic IE to extract music score and text information using segmentation and classification is achieved by analyzing arbitrary soundtracks and timestamp the occurrence of music and speech [35]. An IE approach is required for integrated detection and verification from speech which can be useful for speech analysis, speech recognition, speaker and language recognition [36].…”
Section: Information Extraction From Audio Datamentioning
confidence: 99%