2016
DOI: 10.1587/transinf.2015mup0009
|View full text |Cite
|
Sign up to set email alerts
|

Reversible Audio Data Hiding Based on Variable Error-Expansion of Linear Prediction for Segmental Audio and G.711 Speech

Abstract: SUMMARY Reversible data hiding is a technique in which hidden data are embedded in host data such that the consistency of the host is perfectly preserved and its data are restored during extraction of the hidden data. In this paper, a linear prediction technique for reversible data hiding of audio waveforms is improved. The proposed variable expansion method is able to control the payload size through varying the expansion factor. The proposed technique is combined with the prediction error expansion method. R… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 10 publications
0
3
0
Order By: Relevance
“…The work in [40] proposed a blind audio watermarking algorithm by discrete wavelet transform, which has a capacity of around 100 bps when SNR = 21 [dB], and the work in [41] proposed a robust audio watermarking scheme based on fractional Charlier moment transform, achieving SNR = 32 [dB], with capacity around 500 bps. As a popular criterion, MOS-LQO scores are used in [42], which refers to a reversible watermarking method based on variable error expansion of linear prediction applied to G711 µ-law-coded speech signals. The MOS-LQO is 4.13 for a capacity of 711 bps, 3.44 for 1253 bps, and 2.84 for 1995 bps.…”
Section: Comparison Of Different Frame Lengths and Embedding Capacitymentioning
confidence: 99%
“…The work in [40] proposed a blind audio watermarking algorithm by discrete wavelet transform, which has a capacity of around 100 bps when SNR = 21 [dB], and the work in [41] proposed a robust audio watermarking scheme based on fractional Charlier moment transform, achieving SNR = 32 [dB], with capacity around 500 bps. As a popular criterion, MOS-LQO scores are used in [42], which refers to a reversible watermarking method based on variable error expansion of linear prediction applied to G711 µ-law-coded speech signals. The MOS-LQO is 4.13 for a capacity of 711 bps, 3.44 for 1253 bps, and 2.84 for 1995 bps.…”
Section: Comparison Of Different Frame Lengths and Embedding Capacitymentioning
confidence: 99%
“…[12,13]. Additionally, RDH methods are applied not only to images, but also to video signals [14,15] and sound signals [16,17]. A comprehensive review of the literature on RDH methods from 1997-2016 is presented in [3].…”
Section: Related Workmentioning
confidence: 99%
“…For example, due to the popularity over social networks and the ease of handling, digital imagery is still one of the most popular sources for RDH. Other cover sources such as video sequences [21], speech signals [22] and texts [23] are also of increasing interest to researchers recently due to the fast development of multimedia technologies and social networking services. From the viewpoint of the data embedding mechanism, most advanced RDH algorithms use the so-called prediction errors (PEs) of the elements in the cover to be embedded to realize RDH.…”
Section: Introductionmentioning
confidence: 99%