NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PlanGenLLMs: A Modern Survey of LLM Planning Capabilities

Wei, Hui; Zhang, Zihao; He, Shenghua; Xia, Tian; Pan, Shijia; Liu, Fei (January 2025, Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025))

Full Text Available
Enabling Accessible and Ubiquitous Interaction in Next-Generation Wearables: An Unvoiced Speech Approach

https://doi.org/10.1145/3636534.3695908

Srivastava, Tanmay; Khanna, Prerna; Pan, Shijia; Nguyen, Vp; Jain, Shubham (December 2024, ACM)

Full Text Available
Unvoiced: Designing an LLM-assisted Unvoiced User Interface using Earables

https://doi.org/10.1145/3666025.3699374

Srivastava, Tanmay; Khanna, Prerna; Pan, Shijia; Nguyen, Phuc; Jain, Shubham (November 2024, ACM)

Full Text Available
Jawthenticate: Microphone-free Speech-based Authentication using Jaw Motion and Facial Vibrations

Srivastava, Tanmay; Pan, Shijia; Nguyen, Phuc; Jain, Shubham (November 2023, ACM)

In this paper, we present Jawthenticate, an earable system that authenticates a user using audible or inaudible speech without us- ing a microphone. This system can overcome the shortcomings of traditional voice-based authentication systems like unreliability in noisy conditions and spoofing using microphone-based replay attacks. Jawthenticate derives distinctive speech-related features from the jaw motion and associated facial vibrations. This combi- nation of features makes Jawthenticate resilient to vocal imitations as well as camera-based spoofing. We use these features to train a two-class SVM classifier for each user. Our system is invariant to the content and language of speech. In a study conducted with 41 subjects, who speak different native languages, Jawthenticate achieves a Balanced Accuracy (BAC) of 97.07%, True Positive Rate (TPR) of 97.75%, and True Negative Rate (TNR) of 96.4% with just 3 seconds of speech data.
more » « less
Full Text Available
MuteIt: Jaw Motion Based Unvoiced Command Recognition Using Earable

https://doi.org/10.1145/3550281

Srivastava, Tanmay; Khanna, Prerna; Pan, Shijia; Nguyen, Phuc; Jain, Shubham (September 2022, Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies)

In this paper, we present MuteIt, an ear-worn system for recognizing unvoiced human commands. MuteIt presents an intuitive alternative to voice-based interactions that can be unreliable in noisy environments, disruptive to those around us, and compromise our privacy. We propose a twin-IMU set up to track the user's jaw motion and cancel motion artifacts caused by head and body movements. MuteIt processes jaw motion during word articulation to break each word signal into its constituent syllables, and further each syllable into phonemes (vowels, visemes, and plosives). Recognizing unvoiced commands by only tracking jaw motion is challenging. As a secondary articulator, jaw motion is not distinctive enough for unvoiced speech recognition. MuteIt combines IMU data with the anatomy of jaw movement as well as principles from linguistics, to model the task of word recognition as an estimation problem. Rather than employing machine learning to train a word classifier, we reconstruct each word as a sequence of phonemes using a bi-directional particle filter, enabling the system to be easily scaled to a large set of words. We validate MuteIt for 20 subjects with diverse speech accents to recognize 100 common command words. MuteIt achieves a mean word recognition accuracy of 94.8% in noise-free conditions. When compared with common voice assistants, MuteIt outperforms them in noisy acoustic environments, achieving higher than 90% recognition accuracy. Even in the presence of motion artifacts, such as head movement, walking, and riding in a moving vehicle, MuteIt achieves mean word recognition accuracy of 91% over all scenarios.
more » « less
Full Text Available
FinePose: Fine-Grained Postural Muscle Profiling via Haptic Vibration Signals

https://doi.org/10.1145/3539489.3539590

Rohal, Shubham; Shriram, Shreya; Nguyen, VP; Pan, Shijia (June 2022, Proceedings of the 2022 Workshop on Body-Centric Computing Systems)

Full Text Available
Leveraging earables for unvoiced command recognition

https://doi.org/10.1145/3498361.3538665

Srivastava, Tanmay; Khanna, Prerna; Pan, Shijia; Nguyen, Phuc; Jain, Shubham (June 2022, MobiSys '22: Proceedings of the 20th Annual International Conference on Mobile Systems, Applications and Services)

Full Text Available
Obstruction-invariant occupant localization using footstep-induced structural vibrations

https://doi.org/10.1016/j.ymssp.2020.107499

Mirshekari, Mostafa; Fagert, Jonathon; Pan, Shijia; Zhang, Pei; Noh, Hae Young (May 2021, Mechanical Systems and Signal Processing)
null (Ed.)
Full Text Available
Structure- and Sampling-Adaptive Gait Balance Symmetry Estimation Using Footstep-Induced Structural Floor Vibrations

https://doi.org/10.1061/(ASCE)EM.1943-7889.0001889

Fagert, Jonathon; Mirshekari, Mostafa; Pan, Shijia; Lowes, Linda; Iammarino, Megan; Zhang, Pei; Noh, Hae Young (February 2021, Journal of Engineering Mechanics)
null (Ed.)
Full Text Available
PigNet: Failure-Tolerant Pig Activity Monitoring System Using Structural Vibration

https://doi.org/10.1145/3412382.3458902

Bonde, Amelie; Codling, Jesse R.; Naruethep, Kanittha; Dong, Yiwen; Siripaktanakon, Wachirawich; Ariyadech, Sripong; Sangpetch, Akkarit; Sangpetch, Orathai; Pan, Shijia; Noh, Hae Young; et al (May 2021, The 20th ACM/IEEE International Conference on Information Processing in Sensor Networks)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records