Towards Vulnerability Analysis of Voice-Driven Interfaces and Countermeasures for Replay Attacks

Malik, Khalid Mahmood; Malik, Hafiz; Baumann, Roland

doi:10.1109/MIPR.2019.00106

Citation Details

Towards Vulnerability Analysis of Voice-Driven Interfaces and Countermeasures for Replay Attacks

Fake audio detection is expected to become an important research area in the field of smart speakers such as Google Home, Amazon Echo and chatbots developed for these platforms. This paper presents replay attack vulnerability of voice-driven interfaces and proposes a countermeasure to detect replay attack on these platforms. This paper introduces a novel framework to model replay attack distortion, and then use a non-learning-based method for replay attack detection on smart speakers. The reply attack distortion is modeled as a higher-order nonlinearity in the replay attack audio. Higher-order spectral analysis (HOSA) is used to capture characteristics distortions in the replay audio. The replay attack recordings are successfully injected into the Google Home device via Amazon Alexa using the drop-in conferencing feature. Effectiveness of the proposed HOSA-based scheme is evaluated using original recorded speech as well as corresponding played back recording to the Google Home via the Amazon Alexa using the drop-in conferencing feature. more »

Award ID(s):: 1815724 1816019

PAR ID:: 10097312

Author(s) / Creator(s):: Malik, Khalid Mahmood; Malik, Hafiz; Baumann, Roland

Date Published:: 2019-03-01

Journal Name:: IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)

Page Range / eLocation ID:: 523 to 528

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/MIPR.2019.00106

More Like this