skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Methods for PoLaR Explorations with Machine Learning: Grammatical Analysis of Intonation without Grammatical Labels
This study provides a proof-of-concept for a new method for analyzing intonational form and meaning, demonstrated by analysis of mirative utterances in American English. Here, K-means clustering using measures derived from PoLaR labels (i.e., TCoG) revealed emergent clusters of pitch accents that are suggestive of familiar phonological categories (e.g., MAE_ToBI L+H*). A Random Forest analysis then classified utterance-level meaning based on measures from both smaller granularity (related to individual pitch accents) and larger granularity (related to utterance level meaning), showing >85% correct categorization of exclamative vs filler sentences. This work has implications for how to model mappings between prosody and meaning, especially where existing phonological categories alone don’t identify semantic/pragmatic categories.  more » « less
Award ID(s):
2042702
PAR ID:
10436610
Author(s) / Creator(s):
Date Published:
Journal Name:
International Congress of Phonetic Sciences
ISSN:
2412-0669
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Radnick, S.; Volin, J. (Ed.)
    What is the minimal mathematical model that can generate the F0 trajectories for a system of pitch accents? In this work, we propose a nonlinear coupled dynamical systems theory of American English pitch accents with a single basic parameter. As that parameter increases, F0 profiles for different pitch accents are generated. The terms in the differential equation are based on a novel dynamical analysis of a large database of F0 productions in terms of measurements of F0 peak, peak velocity, and the time to achieve peak velocity. We describe the basic dynamical properties of pitch accents in our database and argue for the proposed model as the simplest one that realizes all the major dynamic F0 properties of the pitch accent system. We argue that the proposed model describes both abstract phonological and concrete phonetic aspects of the system. 
    more » « less
  2. Goldwater, M; Angora, F; Hayes, B; Ong, D (Ed.)
    This study investigated how observing pitch gestures conveying lexical tones and representational gestures conveying word meanings when learning L2 Mandarin words differing in lexical tone affects their subsequent semantic and phonological processing in L1 English speakers using the N400 event-related potential (ERP). Larger N400s for English target words mismatching vs. matching Mandarin prime words in meaning were observed for words learned with pitch and representational gesture, but not no gesture. Additionally, larger N400s for Mandarin target words mismatching vs. matching Mandarin prime words in lexical tone were observed for words learned with pitch gesture, but not representational or no gesture. These findings provide the first ERP evidence that observing gestures conveying phonological and semantic information during L2 word learning enhances subsequent phonological and semantic processing of learned L2 words. 
    more » « less
  3. This study employed the N400 event-related potential (ERP) to investigate how observing different types of gestures at learning affects the subsequent processing of L2 Mandarin words differing in lexical tone by L1 English speakers. The effects of pitch gestures conveying lexical tones (e.g., upwards diagonal movements for rising tone), semantic gestures conveying word meanings (e.g., waving goodbye for to wave), and no gesture were compared. In a lexical tone discrimination task, larger N400s for Mandarin target words mismatching vs. matching Mandarin prime words in lexical tone were observed for words learned with pitch gesture. In a meaning discrimination task, larger N400s for English target words mismatching vs. matching Mandarin prime words in meaning were observed for words learned with pitch and semantic gesture. These findings provide the first neural evidence that observing gestures during L2 word learning enhances subsequent phonological and semantic processing of learned L2 words. 
    more » « less
  4. This paper applies the Autosegmental Metrical (AM) model of intonation phonology and the Spanish Tones and Break Indices (Sp_ToBI) annotation conventions to compare the intonational contours of declarative sentences in two varieties of Puerto Rican Spanish: (1) San Juan Spanish, spoken in the capital city of San Juan, and (2) Loíza Spanish, an Afro-Hispanic vernacular spoken in Loíza. The geographical proximity between these two municipalities entails constant contact within a shared linguistic space. However, speakers from San Juan perceive Loíza as a municipality that has its own peculiar way of speaking. The acoustic and phonological analysis was carried out with PRAAT to verify whether pitch accents coincide in the spontaneous speech of the two analyzed varieties. The data we examined contain an overall predominance of the bitonal pitch accents L*+H and L+ 
    more » « less
  5. The Autosegmental-Metrical model of American English distinguishes three pitch accents with rising F0 trajectories (H*, L+H*, L*+H), differing in peak alignment and presence vs. absence of a low pitch marking the rise onset. Empirical studies report additional distinctions in the dynamics and scaling of the F0 rise, raising the question of which properties best capture variation among accents. We use functional principal components analysis (FPCA) to examine dynamic properties of accentual F0 trajectories in data from an intonation imitation experiment. F0 trajectories from 70 speakers producing rising accents on the phrase-final (nuclear) accented word were submitted to FPCA. The first three PCs account for 95% of variation in F0 trajectories and each shows significant differences between the three rising accents. Variation in PC1 primarily relates to differences in the overall F0 level of the trajectory, PC2 captures differences in rise shape (scooped vs. domed rise) and PC3 captures fine variation from a following Low phrase accent. Alignment distinctions are distributed across all three PCs. Examination of individual speakers shows all use PC1 and PC2 to some degree to distinguish rising accents, with no trading relations. Rises are variously implemented through level or shape distinctions, to varying degrees across individuals 
    more » « less