Scribe: Simultaneous Voice and Handwriting Interface

Bai, Yang; Shahid, Irtaza; Takawale, Harshvardhan; Roy, Nirupam

doi:10.1145/3631411

Citation Details

Scribe: Simultaneous Voice and Handwriting Interface

This paper presents the design and implementation of Scribe, a comprehensive voice processing and handwriting interface for voice assistants. Distinct from prior works, Scribe is a precise tracking interface that can co-exist with the voice interface on low sampling rate voice assistants. Scribe can be used for 3D free-form drawing, writing, and motion tracking for gaming. Taking handwriting as a specific application, it can also capture natural strokes and the individualized style of writing while occupying only a single frequency. The core technique includes an accurate acoustic ranging method called Cross Frequency Continuous Wave (CFCW) sonar, enabling voice assistants to use ultrasound as a ranging signal while using the regular microphone system of voice assistants as a receiver. We also design a new optimization algorithm that only requires a single frequency for time difference of arrival. Scribe prototype achieves 73 μm of median error for 1D ranging and 1.4 mm of median error in 3D tracking of an acoustic beacon using the microphone array used in voice assistants. Our implementation of an in-air handwriting interface achieves 94.1% accuracy with automatic handwriting-to-text software, similar to writing on paper (96.6%). At the same time, the error rate of voice-based user authentication only increases from 6.26% to 8.28%. more »

Award ID(s):: 2238433

PAR ID:: 10514753

Author(s) / Creator(s):: Bai, Yang; Shahid, Irtaza; Takawale, Harshvardhan; Roy, Nirupam

Publisher / Repository:: ACM

Date Published:: 2023-12-19

Journal Name:: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

Volume:: 7

Issue:: 4

ISSN:: 2474-9567

Page Range / eLocation ID:: 1 to 31

Subject(s) / Keyword(s):: Handwriting tracking voice assistants cross frequency continuous wave sonar

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3631411

More Like this