skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Statistical learning dynamically shapes auditory perception
Abstract Humans implicitly pick up on probabilities of stimuli and events, yet it remains unclear how statistical learning builds expectations that affect perception. Across 29 experiments, we examine the influence of task-irrelevant distributions—defined across acoustic frequency—on both tone detection in noise and tone duration judgments. The shape and range of the frequency distributions impact suppression and enhancement effects, as does a given tone's position within the range. Perception adapts quickly to changing distributions, but past distributions influence future judgments. Massed exposure to a single frequency impacts perception along a range of subsequently encountered frequencies. A novel bias emerges as well: lower frequencies are perceived as longer and higher ones as shorter. Probability-driven learning dynamically shapes perception, driven by interacting influences of sensory processing, distributional learning, and selective attention that sculpt a gain function involving modest enhancement of more-likely stimuli, and robust suppression of less-likely stimuli.  more » « less
Award ID(s):
2414066 2420979
PAR ID:
10607976
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
npj Science of Learning
Volume:
10
Issue:
1
ISSN:
2056-7936
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Perception changes rapidly and implicitly as a function of passive exposure to speech that samples different acoustic distributions. Past research has shown that this statistical learning generalizes across talkers and, to some extent, new items, but these studies involved listeners’ active engagement in processing statistics-bearing stimuli. In this study, we manipulated the relationship between voice onset time (VOT) and fundamental frequency (F0) to establish distributional regularities either aligned with American English or reversed to create a subtle foreign accent. We then tested whether statistical learning across passive exposure to these distributions generalized to new items never experienced in the accent. Experiment 1 showed statistical learning across passive exposure but no generalization of learning when exposure and test items shared the same initial consonant but differed in vowels (bear/pear → beer/pier) or when they differed in initial consonant but shared distributional regularities across VOT and F0 dimensions (deer/tear → beer/pier). Experiment 2 showed generalization to stimuli that shared the statistics-bearing phoneme (bear/pear → beer/pier), but only when the response set included tokens from both exposure and generalization stimuli. Moreover, statistical learning transferred to influence the subtle acoustics of listeners’ own speech productions but did not generalize to influence productions of stimuli not heard in the accent. In sum, passive exposure is thus sufficient to support statistical learning and its generalization, but task demands modulate this dynamic. Moreover, production does not simply mirror perception: generalization in perception was not accompanied by transfer to production. 
    more » « less
  2. Period-doubled voice consists of two alternating periods with multiple frequencies and is often perceived as rough with an indeterminate pitch. Past pitch-matching studies in period-doubled voice found that the perceived pitch was lower as the degree of amplitude and frequency modulation between the two alternating periods increased. The perceptual outcome also differed across f0s and modulation types: a lower f0 prompted earlier identification of a lower pitch, and the matched pitch dropped more quickly in frequency- than amplitude-modulated tokens (Sun & Xu, 2002; Bergan & Titze, 2001). However, it is unclear how listeners perceive period doubling when identifying linguistic tones. In an artificial language learning paradigm, this study used resynthesized stimuli with alternating amplitudes and/or frequencies of varying degrees, based on a production study of period-doubled voice (Huang, 2022). Listeners were native speakers of English and Mandarin. We confirm the positive relationship between the modulation degree and the proportion of low tones heard, and find that frequency modulation biased listeners to choose more low-tone options than amplitude modulation. However, a higher f0 (300 Hz) leads to a low-tone percept in more amplitude-modulated tokens than a lower f0 (200 Hz). Both English and Mandarin listeners behaved similarly, suggesting that pitch perception during period doubling is not language-specific. Furthermore, period doubling is predicted to signal low tones in languages, even when the f0 is high. 
    more » « less
  3. Abstract The present study uncovers the fine structures of magnetosonic waves by investigating the EFW waveforms measured by Van Allen Probes. We show that each harmonic of the magnetosonic wave may consist of a series of elementary rising‐tone emissions, implying a nonlinear mechanism for the wave generation. By investigating an elementary rising‐tone magnetosonic wave that spans a wide frequency range, we show that the frequency sweep rate is likely proportional to the wave frequency. We studied compound rising‐tone magnetosonic waves, and found that they typically consist of multiple harmonics in the source region, and may gradually become continuous in frequency as they propagate away from source. Both elementary and compound rising‐tone magnetosonic waves last for ∼1 min which is close to the bounce period of the ring proton distribution, but their relation is not fully understood. 
    more » « less
  4. Male frogs court females from within crowded choruses, selecting for mechanisms allowing them to call at favourable times relative to the calls of rivals and background chorus noise. To accomplish this, males must continuously evaluate the fluctuating acoustic scene generated by their competitors for opportune times to call. Túngara frogs produce highly frequency- and amplitude-modulated calls from within dense choruses. We used similarly frequency- and amplitude-modulated playback tones to investigate the sensory basis of their call-timing decisions. Results revealed that different frequencies present throughout this species’ call differed in their degree of call inhibition, and that lower-amplitude tones were less inhibitory. Call-timing decisions were then driven by fluctuations in inhibition arising from underlying frequency- and amplitude-modulation patterns, with tone transitions that produced steeper decreases in inhibition having higher probabilities of triggering calls. Interactions between the varied behavioural sensitivities to different conspecific call frequencies revealed here, and the stereotyped amplitude- and frequency-modulation patterns present in this species’ calls, can explain previously surprising patterns observed in túngara frog choruses. This highlights the importance of understanding the specific sensory drivers underpinning conspecific signalling interactions, and reveals how sensory systems can mediate the interplay between signal perception and production to facilitate adaptive communication strategies. 
    more » « less
  5. null (Ed.)
    In studying visual perception, we seek to develop models of processing that accurately predict perceptual judgments. Much of this work is focused on judgments of discrimination, and there is a large literature concerning models of visual discrimination. There are, however, non-threshold visual judgments, such as judgments of the magnitude of differences between visual stimuli, that provide a means to bridge the gap between threshold and appearance. We describe two such models of suprathreshold judgments, maximum likelihood difference scaling and maximum likelihood conjoint measurement, and review recent literature that has exploited them. 
    more » « less