NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Active acoustic sensing for determining touch location on an elastic surface

https://doi.org/10.1016/j.jsv.2024.118667

Thompson, Benjamin R; DiPassio, Tre; Rutowski, Jenna; Bocko, Mark F; Heilemann, Michael C (January 2025, Journal of Sound and Vibration)

Free, publicly-accessible full text available January 1, 2026
University of Rochester room impulse response dataset

https://doi.org/10.60593/ur.d.26801089.v3

Rutowski, Jenna; DiPassio, Tre; Thompson, Benjamin R; Heilemann, Michael C; Bocko, Mark F (October 2024, University of Rochester)

The dataset includes impulse responses recorded from 14 different rooms. Each room has unique acoustic properties, providing a wide range of RT60, clarity, and EDT values. The recordings are in 48kHz, 32bit, mono WAV files. The dataset is organized by room, with each subfolder containing the impulse responses specific to that room, as well as a general layout of each room and plots of acoustic data.This dataset supports Estimating direction of arrival in reverberant environments for wake-word detection using a single structural vibration sensor, published in the Journal of the Acoustical Society of America, Vol. 156, Iss. 4, October, 2024.If you plan to download this dataset, we would appreciate it very much if you could fill out the Google form at https://forms.gle/jnuP2dYRK3CPmXQG6. This will help us understand the usage and impacts of this dataset. Your feedback will also help us improve any future extensions of this work.
more » « less
Estimating direction of arrival in reverberant environments for wake-word detection using a single structural vibration sensor

https://doi.org/10.1121/10.0032367

Rutowski, Jenna; DiPassio, Tre; Thompson, Benjamin R; Bocko, Mark F; Heilemann, Michael C (October 2024, The Journal of the Acoustical Society of America)

The vibrational response of an elastic panel to incident acoustic waves is determined by the direction-of-arrival (DOA) of the waves relative to the spatial structure of the panel's bending modes. By monitoring the relative modal excitations of a panel immersed in a sound field, the DOA of the source may be inferred. In reverberant environments, early acoustic reflections and the late diffuse acoustic field may obscure the DOA of incoming sound waves. Panel microphones may be especially susceptible to the effects of reverberation due to their large surface areas and long-decaying impulse responses. An investigation into the effect of reverberation on the accuracy of DOA estimation with panel microphones was made by recording wake-word utterances in eight spaces with reverberation times (RT60s) ranging from 0.27 to 3.00 s. The responses were used to train neural networks to estimate the DOA. Within ±5°, DOA estimation reliability was measured at 95.00% in the least reverberant space, decreasing to 78.33% in the most reverberant space, suggesting an inverse relationship between RT60 and DOA accuracy. Experimental results suggest that a system for estimating DOA with panel microphones can generalize to new acoustic environments by cross-training the system with data from multiple spaces with different RT60s.
more » « less
Full Text Available
Smart Speaker Command Dataset

https://doi.org/10.60593/ur.d.26417548.v1

DiPassio, Tre; Heilemann, Michael; Rutowski, Jenna; Sedlacek, Paula; Thompson, Benjamin; Wen, Yutong (January 2024, University of Rochester)

This dataset contains a collection of voice commands for a smart speaker, each beginning with the common wake-word "Hey Alexa". The commands cover a range of tasks such as music control, smart home management, information requests, reminders, shopping, entertainment, and communication. The dataset reflects natural language usage from a diverse group of speakers, capturing various phrasings, inflections, and contexts. It includes contributions from both male and female voices and features speakers with different native languages.If you plan to download this dataset, we would appreciate it very much if you could fill out the Google form at https://forms.gle/dixQ4mkZ4xbXtXRDA. This will help us understand the usage and impacts of this dataset. Your feedback will also help us improve any future extensions of this work.
more » « less
Estimating the Direction of Arrival of a Spoken Wake Word Using a Single Sensor on an Elastic Panel

https://doi.org/10.1109/WASPAA58266.2023.10248068

DiPassio, Tre; Heilemann, Michael C.; Thompson, Benjamin; Bocko, Mark F. (October 2023, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics)

Full Text Available
Estimating Acoustic Direction of Arrival Using a Single Structural Sensor on a Resonant Surface

https://doi.org/10.1109/ICASSP49357.2023.10095986

DiPassio, Tre; Heilemann, Michael C.; Thompson, Benjamin; Bocko, Mark F. (June 2023, ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

The direction of arrival (DOA) of an acoustic source is a signal characteristic used by smart audio devices to enable signal enhancement algorithms. Though DOA estimations are traditionally made using a multi-microphone array, we propose that the resonant modes of a surface excited by acoustic waves contain sufficient spatial information that DOA may be estimated using a singular structural vibration sensor. In this work, sensors are affixed to an acrylic panel and used to record acoustic noise signals at various angles of incidence. From these recordings, feature vectors containing the sums of the energies in the panel’s isolated modal regions are extracted and used to train deep neural networks to estimate DOA. Experimental results show that when all 13 of the acrylic panel’s isolated modal bands are utilized, the DOA of incident acoustic waves for a broadband noise signal may be estimated by a single structural sensor to within ±5° with a reliability of 98.4%. The size of the feature set may be reduced by eliminating the resonant modes that do not have strong spatial coupling to the incident acoustic wave. Reducing the feature set to the 7 modal bands that provide the most spatial information produces a reliability of 89.7% for DOA estimates within ±5° using a single sensor.
more » « less
Full Text Available
Direction of arrival estimation of an acoustic wave using a single structural vibration sensor

https://doi.org/10.1016/j.jsv.2023.117671

DiPassio, Tre; Heilemann, Michael C.; Bocko, Mark F. (June 2023, Journal of Sound and Vibration)

Full Text Available
Audio Capture Using Piezoelectric Sensors on Vibrating Panel Surfaces

DiPassio, Tre; Heilemann, Michael C.; Thompson, Benjamin; Bocko, Mark F. (May 2023, 154th Convention of the Audio Engineering Society)

The microphone systems employed by smart devices such as cellphones and tablets require case penetrations that leave them vulnerable to environmental damage. A structural sensor mounted on the back of the display screen can be employed to record audio by capturing the bending vibration signals induced in the display panel by an incident acoustic wave - enabling a functional microphone on a fully sealed device. Distributed piezoelectric sensing elements and low-noise accelerometers were bonded to the surfaces of several different panels and used to record acoustic speech signals. The quality of the recorded signals was assessed using the speech transmission index, and the recordings were transcribed to text using an automatic speech recognition system. Although the quality of the speech signals recorded by the piezoelectric sensors was reduced compared to the quality of speech recorded by the accelerometers, the word-error-rate of each transcription increased only by approximately 2% on average, suggesting that distributed piezoelectric sensors can be used as a low-cost surface microphone for smart devices that employ automatic speech recognition. A method of crosstalk cancellation was also implemented to enable the simultaneous recording and playback of audio signals by an array of piezoelectric elements and evaluated by the measured improvement in the recording’s signal-to-interference ratio.
more » « less
Full Text Available
Audio Capture Using Structural Sensors on Vibrating Panel Surfaces

https://doi.org/10.17743/jaes.2022.0049

Dipassio, Tre; Heilemann, Michael C.; Bocko, Mark F. (December 2022, Journal of the Audio Engineering Society)

Full Text Available
Audio-Source Rendering on Flat-Panel Loudspeakers with Non-Uniform Boundary Conditions

Heilemann, Michael C.; DiPassio, Tre; Bocko, Mark F. (October 2021, 151st Convention of the Audio Engineering Society)

Devices from smartphones to televisions are beginning to employ dual purpose displays, where the display serves as both a video screen and a loudspeaker. In this paper we demonstrate a method to generate localized sound-radiating regions on a flat-panel display. An array of force actuators affixed to the back of the panel is driven by appropriately filtered audio signals so the total response of the panel due to the actuator array approximates a target spatial acceleration profile. The response of the panel to each actuator individually is initially measured via a laser vibrometer, and the required actuator filters for each source position are determined by an optimization procedure that minimizes the mean squared error between the reconstructed and targeted acceleration profiles. Since the single-actuator panel responses are determined empirically, the method does not require analytical or numerical models of the system’s modal response, and thus is well-suited to panels having the complex boundary conditions typical of television screens, mobile devices, and tablets. The method is demonstrated on two panels with differing boundary conditions. When integrated with display technology, the localized audio source rendering method may transform traditional displays into multimodal audio-visual interfaces by colocating localized audio sources and objects in the video stream.
more » « less
Full Text Available

Search for: All records