Auto Annotation of Linguistic Features for Audio Deepfake Discernment

Nwosu, Kifekachukwu; Evered, Chloe; Khanjani, Zahra; Bhalli, Noshaba; Davis, Lavon; Mallinson, Christine; Janeja, Vandana P.

doi:10.1609/aaaiss.v2i1.27682

Citation Details

Auto Annotation of Linguistic Features for Audio Deepfake Discernment

We present an innovative approach to auto-annotate Expert Defined Linguistic Features (EDLFs) as subsequences in audio time series to improve audio deepfake discernment. In our prior work, these linguistic features – namely pitch, pause, breath, consonant release bursts, and overall audio quality, labeled by experts on the entire audio signal – have been shown to improve detection of audio deepfakes with AI algorithms. We now expand our approach to pilot a way to auto annotate subsequences in the time series that correspond to each EDLF. We developed an ensemble of discords, i.e. anomalies in time series, detected using matrix profiles across multiple discord lengths to identify multiple types of EDLFs. Working closely with linguistic experts, we evaluated where discords overlapped with EDLFs in the audio signal data. Our ensemble method to detect discords across multiple discord lengths achieves much higher accuracy than using individual discord lengths to detect EDLFs. With this approach and domain validation we establish the feasibility of using time series subsequences to capture EDLFs to supplement annotation by domain experts, for improved audio deepfake detection. more »

Award ID(s):: 2210011

PAR ID:: 10502122

Author(s) / Creator(s):: Nwosu, Kifekachukwu; Evered, Chloe; Khanjani, Zahra; Bhalli, Noshaba; Davis, Lavon; Mallinson, Christine; Janeja, Vandana P.

Publisher / Repository:: Association for the Advancement of Artificial Intelligence

Date Published:: 2024-01-22

Journal Name:: Proceedings of the AAAI Symposium Series

Volume:: 2

Issue:: 1

ISSN:: 2994-4317

Page Range / eLocation ID:: 242 to 244

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aaaiss.v2i1.27682

More Like this