Life after Speech Recognition: Fuzzing Semantic Misinterpretation for Voice Assistant Applications

Zhang, Yangyong; Xu, Lei; Mendoza, Abner; Yang, Guangliang; Chinprutthiwong, Phakpoom; Gu, Guofei

doi:10.14722/ndss.2019.23525

Cited by 49 publications

(26 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We highlight ways in which the backend code can be updated to trigger dormant intents, which can deceive users into giving up sensitive data -something that has not been previously discussed or demonstrated. Zhang et al [56] state that an attacker can swap backend audio files without providing concise details, whereas we demonstrate (by publishing a skill) how an attacker can register dormant intents of sensitive data types (Section V-C). We also showcase how an attacker can register skills using well-known developer names (e.g., Ring, Withings, Samsung) to deceive users into enabling phishing skills (Section V).…”

Section: Related Workmentioning

confidence: 83%

“…This attack is based on the observation that Alexa favors the longest matching skill name when processing voice commands. In another concurrent work, Zhang et al [56] design a linguistic-model-guided fuzzing tool to systematically discover the semantic inconsistencies in Alexa skills. They state that the developer controlled backend can be abused by the developer, for example by swapping legitimate audio files with malicious audio files.…”

Section: Related Workmentioning

confidence: 99%

“…In 2017, Alhadlaq et al [8] performed a small analysis on Alexa skills (around 10,000 skills at the time) and found that 75 % of the skills did not have a privacy policy and 70 % of the existing policies did not mention anything specific to Alexa. [56] et al [55] et al [35] et al [8] Backend change Developer registration Squatting Activation criteria Privacy policy Permission check…”

Section: Related Workmentioning

confidence: 99%

“…If the user's request does not match a skill's invocation name, Alexa automatically tries to fulfill the request by presenting the user with a list of probable skills to choose from [14]. Existing studies [56], [55], [35] have highlighted the existence of many duplicate skills, however, none of them have thoroughly analyzed how Alexa prioritizes among skills sharing the same invocation name.…”

Section: A Duplicate Skill Invocation Namesmentioning

confidence: 99%

“…Research shows that participants feel uncomfortable knowing that information from their private home has been shared or disclosed to third parties [40], [16], [36]. Moreover, recent studies continue to show increasingly sophisticated attacks on automated speech recognition systems [46], [20], [21] and on Alexa skills [56]. When Alexa integrates with other smart home IoT devices such as smart locks or smart cars, 1 security implications arise.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Hey Alexa, is this Skill Safe?: Taking a Closer Look at the Alexa Skill Ecosystem

Lentzsch

Shah²,

Andow

et al. 2021

Proceedings 2021 Network and Distributed System Security Symposium

View full text Add to dashboard Cite

Amazon's voice-based assistant, Alexa, enables users to directly interact with various web services through natural language dialogues. It provides developers with the option to create third-party applications (known as Skills) to run on top of Alexa. While such applications ease users' interaction with smart devices and bolster a number of additional services, they also raise security and privacy concerns due to the personal setting they operate in. This paper aims to perform a systematic analysis of the Alexa skill ecosystem. We perform the first largescale analysis of Alexa skills, obtained from seven different skill stores totaling to 90,194 unique skills. Our analysis reveals several limitations that exist in the current skill vetting process. We show that not only can a malicious user publish a skill under any arbitrary developer/company name, but she can also make backend code changes after approval to coax users into revealing unwanted information. We, next, formalize the different skillsquatting techniques and evaluate the efficacy of such techniques. We find that while certain approaches are more favorable than others, there is no substantial abuse of skill squatting in the real world. Lastly, we study the prevalence of privacy policies across different categories of skill, and more importantly the policy content of skills that use the Alexa permission model to access sensitive user data. We find that around 23.3 % of such skills do not fully disclose the data types associated with the permissions requested. We conclude by providing some suggestions for strengthening the overall ecosystem, and thereby enhance transparency for end-users.

show abstract

Section: Related Workmentioning

confidence: 83%

Section: Related Workmentioning

confidence: 99%