2019
DOI: 10.48550/arxiv.1910.13212
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Privacy Enhanced Multimodal Neural Representations for Emotion Recognition

Abstract: Many mobile applications and virtual conversational agents now aim to recognize and adapt to emotions. To enable this, data are transmitted from users' devices and stored on central servers. Yet, these data contain sensitive information that could be used by mobile applications without user's consent or, maliciously, by an eavesdropping adversary. In this work, we show how multimodal representations trained for a primary task, here emotion recognition, can unintentionally leak demographic information, which co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…Most of the proposed works focus on protecting/anonymizing speaker identity using voice conversion (VC) mechanisms [11,81,94,99]. Beyond speaker identity, various works propose to protect speaker gender [49] and emotion [13], wherein an edge-based system is proposed to filter affect patterns from a user's voice before sharing it with cloud services for further analysis. Another direction is protecting users' privacy by ensuring that sensitive data is not unnecessarily transmitted to service providers [15].…”
Section: Privacy Guardmentioning
confidence: 99%
See 1 more Smart Citation
“…Most of the proposed works focus on protecting/anonymizing speaker identity using voice conversion (VC) mechanisms [11,81,94,99]. Beyond speaker identity, various works propose to protect speaker gender [49] and emotion [13], wherein an edge-based system is proposed to filter affect patterns from a user's voice before sharing it with cloud services for further analysis. Another direction is protecting users' privacy by ensuring that sensitive data is not unnecessarily transmitted to service providers [15].…”
Section: Privacy Guardmentioning
confidence: 99%
“…The word error rate (WER; lower is better) and realtime factor (RTF; lower RTF is more computationally efficient). 49. 18.45 7.35 7.14 2.11 5.95 1.33 5.36 2.20 Private Data 23.21 51.41 85.00 2.87 32.74 1.39 27.38 1.04 11.31 2.06…”
mentioning
confidence: 99%
“…It has been found that when the attacker has complete knowledge of the VC scheme and target speaker mapping, none of the existing VC methods can protect speaker identity. Beyond speaker identity, various works propose to protect speaker gender [12] and emotion [13], wherein an edge-based system is proposed to filter affect patterns from a user's voice before sharing it with cloud services for further analysis.…”
Section: Privacy-preserving Voice Analyticsmentioning
confidence: 99%
“…Privacy preserved representation learning is a relatively unexplored research topic. Recently, researchers have started to utilise privacy-preserving representation learning models to protect speaker identity [383], gender identity [384]. To preserve users' privacy, federated learning [385] is another alternative setting where the training of a shared global model is performed using multiple participating computing devices.…”
Section: F Privacy Preserving Representationsmentioning
confidence: 99%