NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

People make mistakes: Obtaining accurate ground truth from continuous annotations of subjective constructs

https://doi.org/10.3758/s13428-024-02503-3

Booth, Brandon M; Narayanan, Shrikanth S (December 2024, Behavior Research Methods)

Abstract Accurately representing changes in mental states over time is crucial for understanding their complex dynamics. However, there is little methodological research on the validity and reliability of human-produced continuous-time annotation of these states. We present a psychometric perspective on valid and reliable construct assessment, examine the robustness of interval-scale (e.g., values between zero and one) continuous-time annotation, and identify three major threats to validity and reliability in current approaches. We then propose a novel ground truth generation pipeline that combines emerging techniques for improving validity and robustness. We demonstrate its effectiveness in a case study involving crowd-sourced annotation of perceived violence in movies, where our pipeline achieves a .95 Spearman correlation in summarized ratings compared to a .15 baseline. These results suggest that highly accurate ground truth signals can be produced from continuous annotations using additional comparative annotation (e.g., a versus b) to correct structured errors, highlighting the need for a paradigm shift in robust construct measurement over time.
more » « less
Full Text Available
Formant Tracking by Combining Deep Neural Network and Linear Prediction

https://doi.org/10.1109/OJSP.2025.3530876

Kadiri, Sudarsana Reddy; Huang, Kevin; Hagedorn, Christina; Byrd, Dani; Alku, Paavo; Narayanan, Shrikanth (January 2025, IEEE Open Journal of Signal Processing)

Full Text Available
Analysis of articulatory setting for L1 and L2 English speakers using MRI data

https://doi.org/10.21437/Interspeech.2024-2175

Huang, Kevin; Goldberg, Jack; Goldstein, Louis; Narayanan, Shrikanth (September 2024, ISCA)

Full Text Available
Scaling Representation Learning From Ubiquitous ECG With State-Space Models

https://doi.org/10.1109/JBHI.2024.3416897

Avramidis, Kleanthis; Kunc, Dominika; Perz, Bartosz; Adsul, Kranti; Feng, Tiantian; Kazienko, Przemysław; Saganowski, Stanisław; Narayanan, Shrikanth (October 2024, IEEE Journal of Biomedical and Health Informatics)

Full Text Available
Direct articulatory observation reveals phoneme recognition performance characteristics of a self-supervised speech model

https://doi.org/10.1121/10.0034430

Shi, Xuan; Feng, Tiantian; Huang, Kevin; Kadiri, Sudarsana_Reddy; Lee, Jihwan; Lu, Yijing; Zhang, Yubin; Goldstein, Louis; Narayanan, Shrikanth (November 2024, JASA Express Letters)

Variability in speech pronunciation is widely observed across different linguistic backgrounds, which impacts modern automatic speech recognition performance. Here, we evaluate the performance of a self-supervised speech model in phoneme recognition using direct articulatory evidence. Findings indicate significant differences in phoneme recognition, especially in front vowels, between American English and Indian English speakers. To gain a deeper understanding of these differences, we conduct real-time MRI-based articulatory analysis, revealing distinct velar region patterns during the production of specific front vowels. This underscores the need to deepen the scientific understanding of self-supervised speech model variances to advance robust and inclusive speech technology.
more » « less
State-of-the-art speech production MRI protocol for new 0.55 Tesla scanners

https://doi.org/10.21437/Interspeech.2024-2263

Kumar, Prakash; Tian, Ye; Lim, Yongwan; Cui, Sophia X; Hagedorn, Christina; Byrd, Dani; Sinha, Uttam K; Narayanan, Shrikanth; Nayak, Krishna S (September 2024, ISCA)

Full Text Available
CVAT-BWV: A Web-Based Video Annotation Platform for Police Body-Worn Video

https://doi.org/10.24963/ijcai.2024/1006

Hejabi, Parsa; Padte, Akshay Kiran; Golazizian, Preni; Hebbar, Rajat; Trager, Jackson; Chochlakis, Georgios; Kommineni, Aditya; Graeden, Ellie; Narayanan, Shrikanth; Graham, Benjamin AT; et al (August 2024, International Joint Conferences on Artificial Intelligence Organization)

We introduce an open-source platform for annotating body-worn video (BWV) footage aimed at enhancing transparency and accountability in policing. Despite the widespread adoption of BWVs in police departments, analyzing the vast amount of footage generated has presented significant challenges. This is primarily due to resource constraints, the sensitive nature of the data, which limits widespread access, and consequently, lack of annotations for training machine learning models. Our platform, called CVAT-BWV, offers a secure, locally hosted annotation environment that integrates several AI tools to assist in annotating multimodal data. With features such as automatic speech recognition, speaker diarization, object detection, and face recognition, CVAT-BWV aims to reduce the manual annotation workload, improve annotation quality, and allow for capturing perspectives from a diverse population of annotators. This tool aims to streamline the collection of annotations and the building of models, enhancing the use of BWV data for oversight and learning purposes to uncover insights into police-civilian interactions.
more » « less
Full Text Available
An Engineering View on Emotions and Speech: From Analysis and Predictive Models to Responsible Human-Centered Applications

https://doi.org/10.1109/JPROC.2023.3276209

Lee, Chi-Chun; Chaspari, Theodora; Provost, Emily Mower; Narayanan, Shrikanth S. (June 2023, Proceedings of the IEEE)

Full Text Available
Differences in Self-Rated Worker Outcomes Across Stress States: An Interim Analysis of Hybrid Worker Data

https://doi.org/10.1177/10711813241275500

Parga, Madeline_R; Roll, Shawn_C; Lucas, Gale_M; Becerik-Gerber, Burcin; Narayanan, Shrikanth (August 2024, Proceedings of the Human Factors and Ergonomics Society Annual Meeting)

Stress experiences can have dire consequences for worker performance and well-being, and the social environment of the workplace is a key contributor to worker experience. This study investigated the relationship between hybrid workers’ self-ratings of productivity, mood, and stress with perceptions of positive (eustress) and negative (distress) stress states. We hypothesized that self-ratings would vary across combinations of eustress and distress experiences and that these differences would differ based on the social context. Ecological momentary assessments (EMA) were used to obtain ecologically valid data at four data points each workday across a 4-month study period in a cohort of seven office workers. Findings aligned with the Yerkes–Dodson law, such that higher states of arousal were associated with greater self-perceived productivity, and higher stress magnitudes were found when distress existed. Compared to other states, eustress was associated with higher productivity in work-related activities and better mood across all activity types.
more » « less
Toward Privacy-Enhancing Ambulatory-Based Well-Being Monitoring: Investigating User Re-Identification Risk in Multimodal Data

https://doi.org/10.1109/ICASSP49357.2023.10096235

Pranjal, Ravi; Seshadri, Ranjana; Kumar Sanath Kumar Kadaba, Rakesh; Feng, Tiantian; Narayanan, Shrikanth S.; Chaspari, Theodora (June 2023, Toward Privacy-Enhancing Ambulatory-Based Well-Being Monitoring: Investigating User Re-Identification Risk in Multimodal Data)

Full Text Available

« Prev Next »

Search for: All records