<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Individual Differences in Lexical Contributions to Speech Perception</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>03/17/2021</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10280766</idno>
					<idno type="doi">10.1044/2020_JSLHR-20-00283</idno>
					<title level='j'>Journal of Speech, Language, and Hearing Research</title>
<idno>1092-4388</idno>
<biblScope unit="volume">64</biblScope>
<biblScope unit="issue">3</biblScope>					

					<author>Nikole Giovannone</author><author>Rachel M. Theodore</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Purpose              The extant literature suggests that individual differences in speech perception can be linked to broad receptive language phenotype. For example, a recent study found that individuals with a smaller receptive vocabulary showed diminished lexically guided perceptual learning compared to individuals with a larger receptive vocabulary. Here, we examined (a) whether such individual differences stem from variation in reliance on lexical information or variation in perceptual learning itself and (b) whether a relationship exists between lexical recruitment and lexically guided perceptual learning more broadly, as predicted by current models of lexically guided perceptual learning.                                      Method                              In Experiment 1, adult participants (                n                = 70) completed measures of receptive and expressive language ability, lexical recruitment, and lexically guided perceptual learning. In Experiment 2, adult participants (                n                = 120) completed the same lexical recruitment and lexically guided perceptual learning tasks to provide a high-powered replication of the primary findings from Experiment 1.                                                    Results                              In Experiment 1, individuals with weaker receptive language ability showed                increased                lexical recruitment relative to individuals with higher receptive language ability; however, receptive language ability did not predict the magnitude of lexically guided perceptual learning. Moreover, the results of both experiments converged to show no evidence indicating a relationship between lexical recruitment and lexically guided perceptual learning.                                                    Conclusion              The current findings suggest that (a) individuals with weaker language ability demonstrate increased reliance on lexical information for speech perception compared to those with stronger receptive language ability; (b) individuals with weaker language ability maintain an intact perceptual learning mechanism; and, (c) to the degree that the measures used here accurately capture individual differences in lexical recruitment and lexically guided perceptual learning, there is no graded relationship between these two constructs.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>In speech perception, listeners must accommodate for the fact that there is no one-to-one mapping between speech acoustics and any given consonant or vowel. Despite this lack of invariance, phonemes are perceived categorically <ref type="bibr">(Liberman et al., 1957)</ref> and their representations exhibit a rich internal structure that reflects typicality of speech input <ref type="bibr">(Miller, 1994)</ref>. The mapping between speech acoustics and speech sounds can be dynamically modified by both bottom-up <ref type="bibr">(Clayards et al., 2008;</ref><ref type="bibr">Kleinschmidt &amp; Jaeger, 2015)</ref> and top-down learning mechanisms <ref type="bibr">(Ganong, 1980;</ref><ref type="bibr">Norris et al., 2003)</ref>.</p><p>Indeed, it has long been known that listeners use lexical information to facilitate speech perception <ref type="bibr">(Ganong, 1980)</ref>. When presented with a potentially ambiguous acoustic variant such as a voice-onset-time (VOT) value ambiguous between /g/ and /k/, listeners are more likely to perceive the variant as a member of the category that is consistent with lexical knowledge <ref type="bibr">(Ganong, 1980)</ref>. For example, when the variant precedes /&#618;s/, listeners are more likely to perceive the variant as /k/ than /g/, consistent with the interpretation that yields the real word kiss as opposed to the nonword giss. However, when the same variant precedes /&#618;ft/, listeners are more likely to perceive the variant as /g/, as gift is a real word and kift is a nonword. This lexical influence on speech perception (also known as the Ganong effect) can be exploited for lexically guided perceptual learning <ref type="bibr">(Norris et al., 2003;</ref><ref type="bibr">Samuel &amp; Kraljic, 2009)</ref>, in which repeated exposure to ambiguous input in lexically biasing contexts leads to persistent changes in the mapping between acoustics and speech sounds, even when lexical context is subsequently removed. For example, after repeated exposure to an ambiguous fricative (i.e., spectral energy ambiguous between /s/ and /&#643;/) in place of /s/ in lexical contexts (e.g., in place of /s/ in pencil), individuals will categorize a continuum of sounds ranging from /s/ to /&#643;/ as having more /s/ than /&#643;/ tokens. However, if individuals instead receive exposure to the ambiguous fricative in place of /&#643;/ in lexical contexts (e.g., in place of /&#643;/ in ambition), then they will categorize the same continuum of sounds as having more /&#643;/ than /s/ tokens. Thus, lexically guided perceptual learning allows listeners to dynamically modify the mapping between speech acoustics and speech sound categories, even when disambiguating lexical context is subsequently removed. Learning in this paradigm is robust; it extends beyond the boundary region to facilitate a comprehensive reorganization of phonetic category structure <ref type="bibr">(Drouin et al., 2016;</ref><ref type="bibr">Xie et al., 2017)</ref>. Moreover, learning can persist over time <ref type="bibr">(Eisner &amp; McQueen, 2006;</ref><ref type="bibr">Kraljic &amp; Samuel, 2005)</ref>. Lexically guided perceptual learning is often assessed using a between-subjects design in which one group of listeners receives an exposure block biased towards /s/ perception followed by a test block, while the other receives an exposure block biased towards /&#643;/ perception before test. In the absence of any additional input from exposure talker, learning can be observed following both short and long delays between exposure and test <ref type="bibr">(Kraljic &amp; Samuel, 2005;</ref><ref type="bibr">Eisner &amp; McQueen, 2006)</ref>. However, if listeners hear a second, opposite exposure block from the same speaker, listeners can rapidly retune to the talker's new input, as has been shown when lexically guided perceptual learning is assessed using a within-subjects design <ref type="bibr">(Saltzman &amp; Myers, 2018)</ref>. <ref type="foot">1</ref>Though lexically guided perceptual learning is a robust phenomenon when assessed at the group level, individual differences in the degree to which adults learn have been observed. A growing body of research suggests that individual differences in lexically guided perceptual learning may reflect individual variation in the relative weighting of phonetic and lexical information for speech perception. For example, <ref type="bibr">Scharenborg, Weber, and Janse (2015)</ref> found that older adults with higher attention-switching capability showed decreased lexically guided perceptual learning compared to those with lower attention-switching capability. Attentionswitching was measured via the Trail-Making Test, a standardized measure in which participants must connect alternating letters and numbers in sequence <ref type="bibr">(Reitan, 1958)</ref>. The authors suggested that this finding may reflect individuals with higher attention-switching ability relying more on phonetic information, whereas individuals with lower attention-switching ability instead rely more on lexical information. Because this study tested older adults, future research is needed in order to determine whether such relationships will also be observed in different populations.</p><p>Recent findings from <ref type="bibr">Colby, Clayards, and Baum (2018)</ref> lend additional support to the hypothesis that differences in lexical access contribute to individual differences in lexically guided perceptual learning. Specifically, individuals with a lower receptive vocabulary showed diminished lexically guided perceptual learning compared to individuals with a higher receptive vocabulary. Colby and colleagues assessed individual differences in both distributional learning and lexically guided perceptual learning in two age groups (younger and older adults) using a between-subjects design. In both age groups, receptive language ability (as measured by the Peabody Picture Vocabulary Test ; <ref type="bibr">Dunn &amp; Dunn, 1997)</ref> predicted perceptual learning such that individuals with lower PPVT scores demonstrated less learning-consistent responses in both the distributional learning and lexically guided perceptual learning tasks. Similar to the hypothesis of <ref type="bibr">Scharenborg and colleagues (2015)</ref>, <ref type="bibr">Colby et al. (2018)</ref> suggested that this pattern may reflect individuals with a larger vocabulary relying on lexical information to a greater degree than those with a smaller vocabulary.</p><p>While the hypothesis that lexical recruitment modulates lexically guided perceptual learning was not directly tested in these studies, it is consistent with work demonstrating that individuals do in fact differ in the degree to which they rely on lexical information for speech perception <ref type="bibr">(Ishida et al., 2016)</ref>. Ishida and colleagues tested listeners on two tasks. In the first task, listeners made same-different judgements for pairs of stimuli consisting of a natural speech token and a locally time-reversed speech token. Stimuli consisted of both word and nonword items, and the lexical effect was quantified as the difference in sensitivity (d') between word and nonword items. The second task was phonemic restoration, in which listeners judged the phonetic similarity between acoustically modified and unmodified versions of word and nonword items. In the phonemic restoration task, the lexical effect was quantified as the difference in phonemic restoration between word and nonword items. Ishida and colleagues found that (1) individuals varied in the degree to which lexical status influenced performance and</p><p>(2) individual differences in lexical reliance were stable across tasks such that individuals who showed a stronger effect of lexical status on the perceived intelligibility of locally time-reversed speech stimuli also showed a stronger effect of lexical status on phonemic restoration. These findings suggest that some adults rely more heavily on lexical information than others, and that lexical reliance is not dependent on a specific task.</p><p>Moreover, the hypothesis that lexical recruitment is directly related to lexically guided perceptual learning <ref type="bibr">(Colby et al., 2018;</ref><ref type="bibr">Scharenborg et al., 2015)</ref> is consistent with current leading models of speech perception that account for lexically guided perceptual learning. Two classes of speech perception models have accounted for the process by which lexical information influences speech perception, as shown in Figure <ref type="figure">1</ref>. Interactive theories such as TRACE <ref type="bibr">(McClelland &amp; Elman, 1986)</ref> posit that lexical information guides perception online -feedback from the lexicon can influence perception itself. In the TRACE model, acoustic input activates feature information, which feeds forward to the phoneme level and then to the lexical level. A defining aspect of the TRACE architecture is that activation can feed backward from the lexical level to the phoneme level and from the phoneme level to the feature level. Phonemic decisions, such as identifying which of two phonemes are heard during the standard lexically guided perceptual learning test task, are modeled as the node with the highest level of activation in the phoneme layer. A Hebbian learning dynamic in the TRACE model allows lexical feedback to strengthen the bidirectional connections between lexical, phoneme, and feature levels based on prior exposure, leading to perceptual learning (i.e., an adjusted connection between the initially ambiguous input and phonemes) even in nonword contexts <ref type="bibr">(Mirman et al., 2006)</ref>. The TRACE model suggests that lexical recruitment, which is often measured in the form of lexical effects (such as the Ganong effect), necessarily contribute to the phenomenon of lexically guided perceptual learning <ref type="bibr">(Mirman et al., 2006)</ref>. Modular (i.e., feed-forward) theories such as Merge <ref type="bibr">(Norris et al., 2000)</ref> posit that lexical information does not modify online processing, but instead guides processing at a later decisionlevel stage. In the Merge model, acoustic input activates nodes at a prelexical (phoneme) level, which feeds forward to the lexical level to facilitate word recognition. Unlike TRACE, there is no feedback from the lexical level to earlier processing levels. To model phonemic decisions, Merge posits that information from both the phoneme and lexical levels feeds to separate decision nodes, which are responsible for determining phonetic categorization. In this way, the decisions made during speech perception are influenced by both phonemic and lexical information, but without lexical information directly feeding back to the phonemic level.</p><p>Learning in this model occurs when activation from the phoneme and lexical levels is mismatched at the decision level. As a result, a training signal from the decision level modifies prelexical representations, thus modeling a learning effect that can generalize across words. As in TRACE, learning is contingent on lexical recruitment in the Merge model. Though these models differ in whether lexical information directly feeds back to phonemic representations, they converge on three points for modeling individual differences in lexically guided perceptual learning. First, activation of units within the phonemic and lexical levels is probabilistic, meaning that a specific phoneme/word may be activated with high probability (e.g., 0.9) while other phonemes/words may also be activated for the same input but with a low probability (e.g., 0.1). Probabilistic activation of representational units is fully consistent a wide body of literature for spoken word recognition (e.g., <ref type="bibr">Allopenna et al., 1998;</ref><ref type="bibr">McClelland &amp; Elman, 1986)</ref>. Second, both models posit that lexically guided perceptual learning cannot occur without lexical activation, an assumption that is supported by findings demonstrating that lexically guided perceptual learning does not occur when exposure consists of ambiguous sounds embedded in nonwords <ref type="bibr">(Norris et al., 2003)</ref>. Third, both models dissociate online lexical processing from learning within their architectures. That is, though lexical recruitment is necessary for lexically guided perceptual learning to occur, it is not sufficient for learning; lexical information needs to be passed to an intact learning mechanism that modifies the mapping between acoustics and phonemes. Within these frameworks, individual differences in lexically guided perceptual learning can be modeled in at least two ways. First, individual differences in learning could be accommodated by positing that the degree to which information from the lexical level contributes to reaching a lexical decision (whether online or post-perceptually) has the potential to vary on an individual level. Such differences may reflect the relative availability of acoustic and lexical information, leading some individuals to weight acoustic information more highly than lexical information, or vice versa. These differences may then feed into the learning mechanism, influencing the degree to which individuals dynamically adapt to variation in the speech signal. Second, individual differences in learning could be modeled by variability in the learning mechanism itself. Specifically, both models allow for the possibility that the learning mechanism itself can be selectively impaired, without impairment in lexical recruitment; thus, an individual who demonstrates strong lexical recruitment (for example, a strong Ganong effect) may not necessarily demonstrate an equivalently strong learning effect. Recall that <ref type="bibr">Colby and colleagues (2018)</ref> hypothesized that lexically guided perceptual learning was diminshed in those with weaker receptive language due to weaker use of lexical information during speech perception. This hypothesis is fully consistent with the models described above, but is potentially at odds with findings examining lexical recuitment in children with specific language impairment (SLI). <ref type="bibr">Schwartz et al. (2013)</ref> measured the magnitude of the Ganong effect in children with and without SLI and observed a larger Ganong effect in children with SLI compared to their typically developing peers. This finding suggests that individuals with weaker receptive language ability may show increased reliance on lexical information for speech perception, in opposition to Colby and colleagues' suggestion that weaker receptive language ability is associated with decreased reliance on the lexicon. While these seemingly contrary findings may have arisen from any of the methodological differences between these studies, it is theoretically possible that weaker receptive language ability can be associated with both an increased reliance on the lexicon and a deficit in lexically guided perceptual learning created by impairment to the learning mechanism. Consistent with the TRACE and Merge frameworks, an impaired learning mechanism in individuals with weaker receptive language ability would result in deficits to lexically guided perceptual learning despite intact (or even stronger) lexical recruitment.</p><p>Within this context, the goal of the current investigation is twofold. First, we examine whether the relationship between receptive language ability and lexically guided perceptual learning can be attributed to individual differences in lexical reliance in individuals with lower language ability, or whether they are attributable to variation in the learning mechanism itself.</p><p>Second, we examine whether there is a relationship between lexical recruitment and lexically guided perceptual learning, as is predicted by both the TRACE and Merge models. To do so, participants in Experiment 1 completed four subtests of the Clinical Evaluation of Language Fundamentals -5 th Edition (CELF; <ref type="bibr">Wiig, Semel, &amp; Secord, 2013)</ref> in addition to tasks assessing the Ganong effect and lexically guided perceptual learning. CELF subtests consisted of two that assess expressive language (Formulated Sentences, Recalling Sentences) and two that assess receptive language (Understanding Spoken Paragraphs, Semantic Relationships). Separate expressive and receptive language profiles were obtained in order to examine potential specificity of these constructs as contributors to individual differences in lexical recruitment and perceptual learning; two measures for expressive and receptive language were collected in order to assess convergence in results between the measures assessing each of these broad constructs.</p><p>Participants in Experiment 2 completed the same Ganong and lexically guided perceptual leaning tasks as for Experiment 1 but did not complete the CELF measures.</p><p>If individual variation in lexically guided perceptual learning is due to individual differences in lexical recruitment, then individuals with weaker receptive language ability should show a diminished Ganong effect in addition to diminished perceptual learning. Alternatively, if weaker perceptual learning in individuals with weaker receptive langauge abiltiy is attributable to impairment in the learning mechanism itself, consistent with the procedural deficit <ref type="bibr">(Ullman &amp; Pierpont, 2005)</ref> and statistical learning deficit <ref type="bibr">(Hsu &amp; Bishop, 2014)</ref> hypotheses of language impairment, then individuals with weaker language ability will show diminished lexically guided perceptual learning regardless of the degree of lexical recruitment. Independent of language ability, if lexical recruitment modulates lexically guided perceptual learning -as predicted by both the TRACE and Merge models, then performance on the Ganong task will predict performance on the lexically guided perceptual learning task such that increased lexical recruitment is associated with increased perceptual learning.</p><p>Experiment 1</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Method</head><p>Participants. The participants were 70 native speakers of American English (20 men, 50 women) between 18 and 26 years of age (mean = 20, SD = 2) who were recruited from the University of Connecticut community. Thirty-one participants had experience with a second language, with self-reported proficiency of novice (n = 18), intermediate (n = 11), or advanced (n = 2). All participants passed a pure tone hearing screen administered at 25 dB for octave frequencies between 500 and 4000 Hz and had nonverbal intelligence within normal limits (range = 86 -122, mean = 103, SD = 9) as assessed using the standard score of the Test of Nonverbal Intelligence -4 th Edition (TONI; <ref type="bibr">Brown, Sherbenou, &amp; Johnsen, 2010)</ref>. The TONI is normed to reflect a population mean of 100 (SD = 15). All participants completed Ganong and lexically guided perceptual learning tasks (described below) in addition to assessments of expressive and receptive language ability. Language ability was assessed using the standard score 2 of four subtests from the Clinical Evaluation of Language Fundamentals -5 th Edition 2 Twelve of the 70 participants were beyond the oldest age (21 years) provided for the standard score conversion of the CELF-5. Calculation of standard scores for these participants was made using the oldest age provided for the conversion, which is sensible given that this age bracket represents a maturational end-state. However, all analyses conducted with standard scores were also conducted with raw scores, and qualitatively similar results were observed in all cases.</p><p>(CELF; <ref type="bibr">Wiig et al., 2013)</ref>; scoring (e.g., trial-level scoring, calculation of standard score) was performed as outlined in the administration manual.</p><p>Expressive language was assessed using the Formulated Sentences and Recalling Sentences subtests. For Formulated Sentences, participants are asked to generate a sentence to describe a specific picture that contains one (or two) words provided by the experimenter.</p><p>Responses are scored based on the appropriateness of the sentence in the context of the stimulus picture. For Recalling Sentences, participants are required to repeat verbatim a sentence provided by the experimenter. Though the Recalling Sentences task requires contributions from perception and memory in order to be completed successfully, this subtest is characterized as an expressive language measure in the CELF manual. Receptive language was assessed using the Understanding Spoken Paragraphs and Semantic Relationships subtests. For Understanding Spoken Paragraphs, participants hear a series of short passages read by the experimenter and answer comprehension questions for each passage. For Semantic Relationships, participants are asked to solve short word problems that probe semantic knowledge by selecting the two correct items from a set of four items following a spoken prompt. An example problem is hearing "Jan saw Pedro. Pedro saw Francis. Who was seen?" and being shown Jan, Dwayne, Pedro, and Francis as possible response items (with the correct answers being Pedro and Francis). Due to an error in implementing the reversal rule during CELF-5 administration, the number of participants that could be accurately scored for a given subtest varied slightly across the four subtests (Formulated Sentences, n = 54; Recalling Sentences; n = 58; Understanding Spoken These analyses can be viewed by executing the supplemental analysis script provided in the OSF repository: <ref type="url">https://osf.io/r5sp9/</ref>.</p><p>Paragraphs, n = 70; Semantic Relationships, n = 63). 3 Figure <ref type="figure">2</ref> shows the distribution of standard scores for each of the four CELF subtests; standard scores for the CELF subtests reflect a population mean of 10 (SD = 3).</p><p>Stimuli: Ganong task. Stimuli for the Ganong task were two eight-step voice-onset-time (VOT) continua that perceptually ranged from giss to kiss and gift to kift, respectively. Both continua were created using the Praat software <ref type="bibr">(Boersma, 2002)</ref> from tokens produced by a native male speaker of American English. Drawing from recorded productions that were free of acoustic artifact, a single /&#618;s/ portion was selected for the giss-kiss continuum and a single /&#618;ft/ portion was selected for the gift-kift continuum such that duration of the /&#618;s/ (374 ms) and /&#618;ft/ (371 ms) portions were equivalent. To create the VOT portion (cueing the initial consonant), eight different VOTs <ref type="bibr">(17,</ref><ref type="bibr">21,</ref><ref type="bibr">27,</ref><ref type="bibr">37,</ref><ref type="bibr">46,</ref><ref type="bibr">51,</ref><ref type="bibr">59</ref>, and 71 ms) were created by successively removing energy from the aspiration region of a natural kiss production. The first step contained the burst plus the first quasi-periodic pitch period; subsequent steps contained this burst in addition to aspiration energy that increased across continuum steps. These eight VOTs were then spliced to the selected /&#618;s/ and /&#618;ft/ portions. With this procedure, the only difference among steps within a given continuum was VOT duration and the only difference between continua for a given step was lexical context (cued by the /&#618;s/ or /&#618;ft/ context). All stimuli were normalized for 3 The nature of the administration error for the Formulated Sentences, Recalling Sentences, and Semantic Relationships subtests, discovered after data collection was completed in conjunction with a double-check of the scoring data, is as follows. For all three subtests, participants began the test at the item appropriate for their age. In some cases, the participant did not meet the reversal rule (i.e., perfect score on the first two consecutive items) and the administrator failed to go back to the first item and test forward to the initial start point. As a consequence, the raw score for the affected participants may be higher than what would have been obtained if the reversal rule had been implemented correctly. As described in the main text, affected participants were removed from specific subtest analyses in light of this error. We note that even with their removal, the sample size for each subtest analysis (n &#8805; 54 in all cases) remains large relative to similar recent investigations (e.g., n = 31 for the young adult sample in <ref type="bibr">Colby et al., 2018)</ref>. peak amplitude.</p><p>As described above, the VOT portion cueing the initial consonant was identical between the two continua, with the coda portion (i.e., /s/ and /ft/) providing the critical lexical context required to elicit a Ganong effect. A preliminary experiment was conducted in order to ensure that perception of VOT was indeed equivalent between the two continua in the absence of lexical context. This study was hosted online by the Gorilla platform following procedures described for Experiment 2. Participants (n = 20 monolingual English speakers between 20 and 34 years of age with no history of language disorders) were recruited from the Prolific participant pool and passed the headphone screen of Woods and colleagues <ref type="bibr">(Woods, Siegel, &amp; McDermott, 2017)</ref> at the beginning of the experiment. Stimuli for this preliminary study consisted of those described for the Ganong task (2 continua x 8 steps) in addition to two parallel continua that were created by removing the coda portion from each of the 16 stimuli, thus creating two "control" continua that each perceptually ranged from /g&#618;/ to /k&#618;/. All participants completed two blocks of phonetic categorization, one for the control stimuli and one for the Ganong stimuli; block order was fixed across participants (control block followed by Ganong block). Each block consisted of 10 repetitions of the 16 stimuli appropriate for each block presented in randomized order. On each trial, participants were asked to identify the initial sound as either /g/ or /k/ by pressing an appropriately labeled key on the keyboard.</p><p>Mean proportion /k/ responses for each continuum in each block are shown in Figure <ref type="figure">3</ref>.</p><p>Visual inspection suggests that a Ganong effect was indeed observed in the Ganong block, reflecting more /k/ responses for the giss-kiss continuum compared to the gift-kift continuum. In contrast, /k/ responses appear equivalent between the two continua in the control block.</p><p>To confirm this pattern statistically, trial-level responses (/g/ = 0, /k/ = 1) were submitted to a generalized linear mixed effects model as implemented in the lme4 <ref type="bibr">(Bates et al., 2015)</ref> package in R. The fixed effects included VOT (scaled/centered around the mean), continuum, block, and all interactions. Continuum and block were sum-coded (continuum: gift-kift = -0.5, giss-kiss = 0.5; block: control = -0.5, Ganong = 0.5). The random effects structure consisted of random intercepts by participant and random slopes for VOT, continuum, and block by participant. An interaction between continuum and block was observed (&#120573; = 2.248, SE = 0.174, z = 12.927, p &lt; .001). To explicate the nature of the interaction, separate models with the fixed effects of VOT, continuum, and their interaction were constructed for each block. The main effect of continuum was significant in the Ganong block (&#120573; = 1.836, SE = 0.467, z = 3.936, p &lt; .001) but not in the control block (&#120573; = -0.175, SE = 0.159, z = -1.100, p = .271). This preliminary study confirms that the stimuli developed for the Ganong task are appropriate for use in the primary experiment.</p><p>Stimuli: Perceptual learning task. Stimuli for the lexically guided perceptual learning task were those in <ref type="bibr">Myers and Mesite (2014)</ref> to which the reader is referred for comprehensive details on stimulus creation. We used this stimulus set given that it has been shown to successfully elicit lexically guided perceptual learning across numerous samples <ref type="bibr">(Drouin et al., 2016;</ref><ref type="bibr">Drouin &amp; Theodore, 2018;</ref><ref type="bibr">Myers &amp; Mesite, 2014;</ref><ref type="bibr">Saltzman &amp; Myers, 2018)</ref>. In brief, there were two sets of exposure stimuli (one for the /s/-bias block and one for the /&#643;/-bias block) and one set of test stimuli, all produced by a single female native speaker of American English.</p><p>The exposure sets each consisted of 200 auditory items (100 words and 100 nonwords). For word items, 20 were critical /s/ items (e.g., pencil), 20 were critical /&#643;/ items (e.g., ambition), and 60 were filler items that contained no instances of /s/ or /&#643;/. For the /s/-bias set, the medial /s/ of the critical /s/ items was replaced with an ambiguous fricative (consisting of a 50:50 blend of /s/ and /&#643;/ sounds). For the /&#643;/-bias set, the medial /&#643;/ of the critical /&#643;/ items was replaced with an ambiguous fricative (i.e., a 50:50 blend of /s/ and /&#643;/ sounds). Test stimuli consisted of a sevenstep continuum that perceptually ranged from shine to sign. The continuum was created by blending the initial fricatives from natural productions of sign and shine in different proportions ranging from 20:80 (20% /s/ and 80% /&#643;/, the shine end of the continuum) to 80:20 (80% /s/ and 20% /&#643;/, the sign end of the continuum) in 10% steps. All stimuli were normalized for peak amplitude using Praat <ref type="bibr">(Boersma, 2002)</ref>.</p><p>Procedure. Participants were tested individually in a sound-attenuated booth. Stimuli were presented via headphones (Sony MDR-7506) at a comfortable listening level held constant across participants. Responses were made via button box (Cedrus RB-740). Stimulus presentation and response collection were controlled using SuperLab (version 4.5) running on a Mac OS X operating system. For both tasks, participants were directed to respond as quickly and accurately as possible and to guess if they were unsure.</p><p>The Ganong task consisted of 160 trials, formed by 10 repetitions of the eight continuum steps for each of the giss-kiss and gift-kift continua; items were presented in randomized order (ISI = 1500 ms). On each trial, participants indicated whether the initial sound was either /g/ or /k/. Participants then completed two blocks of lexically guided perceptual learning. All participants first received /s/-bias exposure (followed by test) and then received /&#643;/-bias exposure (followed by test). During exposure, the 200 items appropriate for the specific exposure block were presented in randomized order (ISI = 2000 ms). On each trial, participants indicated whether each item was a word or nonword. During test, eight repetitions of the seven test stimuli were presented in randomized order (ISI = 2000 ms); participants were asked to categorize each item as either sign or shine.</p><p>All participants completed the Ganong task before the lexically guided perceptual learning task in order to mitigate the possibility that the Ganong effect would be inflated due to possible carryover effects from the LGPL task. Specifically, the LGPL task requires listeners to make lexical decisions during the exposure phase, but the Ganong task requires listeners to make phonetic decisions. If listeners had completed the LGPL task first, then they may have been primed to approach the Ganong task as a lexical decision task instead of a phonetic categorization task. Participants were given a brief break in between the two tasks and received monetary compensation or partial course credit for their participation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head><p>Ganong task. Trial-level data and a script (in R) to reproduce all analyses presented in this manuscript can be retrieved at: <ref type="url">https://osf.io/r5sp9/</ref>. Responses on the Ganong task were coded as either /g/ (0) or /k/ (1). Trials for which no response was provided were excluded (&lt; 1% of the total trials). To visualize performance in the aggregate, mean proportion /k/ responses was calculated for each participant for each step of the two continua. Responses were then averaged across participants and are shown Figure <ref type="figure">4</ref> To examine this pattern statistically, trial-level responses (0 = /g/, 1 = /k/) were fit to a generalized linear mixed-effects model (GLMM) using the glmer() function with the binomial response family (i.e., a logistic regression) as implemented in the lme4 package <ref type="bibr">(Bates et al., 2015)</ref> in R. The fixed effects included VOT, continuum, and their interaction. VOT was entered into the model as continuous variable, scaled and centered around the mean. Continuum was sum-coded (giss-kiss = 0.5, gift-kift = -0.5). The random effects structure consisted of random intercepts by participant and random slopes by participant for VOT, continuum, and their interaction. As expected, the model showed a significant effect of VOT (&#120573; = 3.463, SE = 0.153, z = 22.698, p &lt; .001), indicating that /k/ responses increased as VOT increased. There was a significant effect of continuum (&#120573; = 1.265, SE = 0.177, z = 7.141, p &lt; .001), with the direction of the beta estimate indicating increased /k/ responses in the giss-kiss compared to the gift-kift continuum. There was also an interaction between continuum and VOT (&#120573; = 1.097, SE = 0.187, z = 5.882, p &lt; .001), indicating that the magnitude of the Ganong effect was not equivalent across continuum steps. Thus, the results of this model confirm the presence of a Ganong effect for participants in the aggregate.</p><p>The next set of analyses were conducted in order to examine whether the magnitude of the Ganong effect was linked to performance on the receptive and expressive language measures.</p><p>To do so, trial-level data (0 = /g/ response, 1 = /k/ response) were fit to a series of mixed effects models, one for each CELF subtest. Subtests were tested in separate models due to collinearity among predictors. The fixed effects in each model consisted of VOT, continuum, the CELF subtest, and all interactions among the three factors. Continuum was sum-coded as described for the aggregate model; VOT and CELF subtest were entered into the model as continuous variables (scaled/centered around the mean). The random effects structure consisted of random intercepts by participant and random slopes by participant for VOT, continuum, and their interaction. In all models, evidence of a link between subtest performance and the Ganong effect would manifest as an interaction between continuum and subtest.</p><p>The results of the four models are shown in Table <ref type="table">1</ref>. There was no interaction between continuum and subtest for Formulated Sentences (p = .911), an expressive language measure. However, the continuum by subtest interaction was significant for the expressive Recalling Sentences measure (p = .020) and both receptive language measures (Understanding Spoken Paragraphs, p = .008; Semantic Relationships, p = .007). The (negative) direction of the beta estimate for the significant interactions indicates a larger Ganong effect (i.e., difference between the giss-kiss and gift-kift continua) for those with weaker receptive language scores. Figure <ref type="figure">5</ref>, panel A shows the beta estimate and 95% confidence interval for the continuum by subtest interaction for all four subtests. To illustrate the nature of the interaction, Figure <ref type="figure">5</ref>, panel B shows performance on the Ganong task according to a median split of participants based on USP score; though both groups show a Ganong effect, the magnitude of this effect is larger in those with weaker receptive language as indexed by USP score. The same qualitative pattern -a larger Ganong effect for those with weaker compared to stronger language scores -was present for the other two significant interactions (i.e., Recalling Sentences, Semantic Relationships) as indicated by the negative beta estimate for each of the interaction terms.</p><p>In addition, single-order correlations between the magnitude of the Ganong effect (quantified as the difference in proportion /k/ responses between the giss-kiss and gift-kift continua) and subtest standard scores were run to facilitate comparison with the extant literature.</p><p>These results are presented in Table <ref type="table">2</ref>. In all cases, qualitatively similar results to those demonstrated by the GLMM analyses were found.</p><p>Perceptual learning task. Accuracy (proportion correct) on the lexical decision task during the exposure phase was near ceiling (mean = 0.95, SD = 0.03, range = 0.86 -0.99). 4 4 Recall that <ref type="bibr">Colby et al. (2018)</ref> found that individuals with stronger receptive vocabulary (as measured by the PPVT) showed increased lexically guided perceptual learning compared to those with weaker receptive vocabulary. Though Experiment 1 did not include a standardized measure of receptive vocabulary, performance on the lexical decision task provides an indirect measure of vocabulary. A series of exploratory analyses was conducted for the lexical decision data. Mean accuracy during exposure was not correlated with any of the four CELF subtests, and mean accuracy during exposure was not a significant predictor of the magnitude of perceptual Responses at test for the perceptual learning task were coded as either /&#643;/ (0) or /s/ (1). Trials for which no response was provided were excluded (&lt; 1% of the total trials). To visualize performance in the aggregate (Figure <ref type="figure">6</ref>, panel A), mean proportion sign responses was first calculated by participant in each half of the two test blocks (block 1 = /s/-bias, block 2 = /&#643;/-bias) at each step of the test continuum. Performance at test is considered over time (i.e., first half vs. second half) given research showing that lexically guided perceptual learning is attenuated throughout the test period <ref type="bibr">(Liu &amp; Jaeger, 2018</ref><ref type="bibr">, 2019)</ref>. That is, recent findings have shown that exposure to the flat frequency distributions at test (e.g., eight repetitions of each of the seven test stimuli) promotes unlearning of the biased input during exposure presumably due to distributional learning that occurs throughout the test period <ref type="bibr">(Liu &amp; Jaeger, 2018</ref><ref type="bibr">, 2019)</ref>. Indeed, visual inspection of Figure <ref type="figure">6</ref>, panel A suggests that the lexically guided perceptual learning effect is present in the first half of the test block, but attenuated in the second half of the test block.</p><p>To examine these patterns statistically, trial-level responses (0 = /&#643;/, 1 = /s/) were fit to a GLMM. The fixed effects included step, bias, half, and all interactions between the three factors.</p><p>Step was entered into the model as a continuous variable (scaled/centered around the mean). Bias and half were sum-coded (/s/-bias = 0.5, /&#643;/-bias = -0.5; first half = 0.5, second half = -0.5). The learning during the test phase. When accuracy on the lexical decision task was measured separately for the four item types presented during exposure (i.e., critical /s/ words, critical /&#643;/ words, filler words, nonwords), there was (1) no correlation between any of the CELF subtests and accuracy for /s/ words or /&#643;/ words, (2) significant, moderate positive correlations between the two receptive language subtests and accuracy for filler words, and (3) a significant but weak positive correlation between the Semantic Relationships subtest and accuracy for nonwords. However, accuracy for either filler words or nonwords was not a significant predictor of the magnitude of perceptual learning during the test phase. These exploratory analyses can be viewed in the supplementary analysis script on the OSF repository for this manuscript: <ref type="url">https://osf.io/r5sp9/</ref>. random effects structure consisted of random intercepts by participant and random slopes by participant for step, bias, and half. The model showed a main effect of step (&#120573; = 3.899, SE = 0.179, z = 21.835, p &lt; .001), with /s/ responses increasing across the test continuum. There was also a main effect of bias (&#120573; = 0.503, SE = 0.191, z = 2.630, p = .009), with more /s/ responses in the /s/-bias block compared to the /&#643;/-bias block, indicative of lexically guided perceptual learning. However, there was a significant interaction between bias and half (&#120573; = 0.913, SE = 0.232 z = 3.938, p &lt; .001). Simple slopes analyses showed a robust effect of bias in the first half of the test block (&#120573; = 0.959, SE = 0.221, z = 4.338, p &lt; .001), but no effect of bias in the second half of the test block (&#120573; = 0.046, SE = 0.226, z = 0.204, p = .839). Thus, a robust perceptual learning effect is observed at test, but it is limited to the first half of the test period in the current data, consistent with research showing that learning in this paradigm is attenuated throughout the test block as a consequence of exposure to the flat frequency distributions presented at test <ref type="bibr">(Liu &amp; Jaeger, 2018</ref><ref type="bibr">, 2019)</ref>. 5    Given that perceptual learning in the aggregate was only observed during the first half of the test period, consistent with past research <ref type="bibr">(Liu &amp; Jaeger, 2018</ref><ref type="bibr">, 2019)</ref> -and that past research has shown that receptive language ability is linked to distributional learning <ref type="bibr">(Colby et al., 2018;</ref><ref type="bibr">Theodore et al., 2019)</ref>, the presumed mechanism responsible for diminished learning during the lexically guided perceptual learning test phase -the next set of analyses tested for links between the language measures and perceptual learning isolating performance to the first half of each test 5 For both Experiment 1 and Experiment 2, an additional analysis examined performance across consecutive test block halves (e.g., first half of /s/-bias test block, second half of /s/-bias test block, first half of /&#643;/-bias test block, second half of /&#643;/-bias test block) using sliding contrast comparisons. The results of these analyses suggest that performance in the second block reflected boundary movement in the opposite direction of the first block instead of simple "unlearning" during the first test block. These analyses can be viewed on the OSF repository for this manuscript: <ref type="url">https://osf.io/r5sp9/</ref>. block. To do so, trial-level data (0 = /&#643;/ response, 1 = /s/ response) were fit to a series of mixed effects models, one for each CELF subtest. Subtests were tested in separate models due to potential collinearity among predictors. The fixed effects in each model consisted of step, bias, the CELF subtest, and all interactions among the three factors. Bias was sum-coded as described for the aggregate model; step and CELF subtest were entered into the model as continuous variables (scaled/centered around the mean). The random effects structure consisted of random intercepts by participant and random slopes by participant for step, continuum, and their interaction.</p><p>In all models, evidence of a link between subtest performance and the perceptual learning effect would manifest as an interaction between bias and subtest. The results of the four mixed effects models are shown in Table <ref type="table">3</ref>. There was no significant interaction between bias and subtest for any of the expressive or receptive language measures. Figure <ref type="figure">7</ref>, panel A shows the beta estimate and 95% confidence interval for the bias by subtest interaction for all four subtests. To illustrate the nature of the (null) interactions, Figure <ref type="figure">7</ref>, panel B shows performance on the perceptual learning task according to a median split of participants based on USP score. Though there is a numerical trend for the learning effect to be larger for those with weaker compared to stronger receptive language (i.e., larger beta estimates for the bias by subtest interactions for the two receptive language measures compared to the expressive language measures), these relationships were not statistically reliable.</p><p>Like for the Ganong task, single-order correlations between the magnitude of the learning effect (quantified as the difference in proportion /s/ responses between the first half of the /s/bias and /&#643;/-bias test blocks) and subtest standard scores were run to facilitate comparison with extant literature. These results are presented in Table <ref type="table">2</ref>. In all cases, qualitatively similar results to those demonstrated by the GLMM analyses were found.</p><p>Relationship between Ganong and perceptual learning tasks. The results presented thus far show that individuals with weaker receptive language showed a larger Ganong effect, consistent with past research <ref type="bibr">(Schwartz et al., 2013)</ref>. The same pattern also held for one of the expressive language measures. However, none of the language measures was a reliable predictor of the magnitude of the perceptual learning effect. This finding contrasts with results from Colby and colleagues <ref type="bibr">(Colby et al., 2018)</ref>, who found that stronger receptive vocabulary was associated with increased learning. A final analysis tested the prediction from both modular and interactive accounts of perceptual learning (Figure <ref type="figure">1</ref>), which posit a positive relationship between lexical recruitment and strength of perceptual learning. For each participant, we (1) quantified the magnitude of the Ganong effect as the difference in proportion /k/ responses between the gisskiss and gift-kift continua and (2) quantified the magnitude of the perceptual learning effect as the difference in proportion /s/ responses between the first half of the /s/-bias and /&#643;/-bias test blocks. In both cases, higher difference scores indicate larger effects. As can be seen in Figure <ref type="figure">8</ref>, panel A, there was no correlation between the two measures (r = -0.08, p = .492). This held even when simply comparing the correlation of rank order between the two measures (&#120588; = -0.07, p = .587).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Experiment 2</head><p>The results of Experiment 1 showed that receptive language ability, as measured by the Understanding Spoken Paragraphs and Semantic Relationships CELF subtests, was inversely associated with lexical recruitment. Compared to individuals with stronger receptive language, individuals with weaker receptive language showed increased reliance on lexical information.</p><p>The same relationship was also observed for the Recalling Sentences subtest, which is specified as a measure of expressive language in the CELF manual, but may in fact reflect receptive language ability given that perception and memory processes are required to successfully complete this task. However, none of the CELF subtests was associated with lexically guided perceptual learning, and no reliable relationship in the magnitude of the lexical recruitment and perceptual learning effects was observed. That is, the magnitude of the Ganong effect did not predict the magnitude of the learning effect, in contrast to predictions made by current theories of lexically guided perceptual learning. Moreover, the lack of a relationship between the two tasks challenges previous interpretations of individual differences in lexically guided perceptual learning.</p><p>Recall that in Experiment 1, bias in the perceptual learning task was manipulated withinsubjects in order to address potential issues with asymmetry in learning across bias conditions.</p><p>Though past research has indeed found evidence that listeners rapidly recalibrate for lexically guided perceptual learning when bias is manipulated within-subjects <ref type="bibr">(Saltzman &amp; Myers, 2018)</ref>, this method of measuring perceptual learning remains nonstandard in this domain. In this context, Experiment 2 was conducted as a replication of <ref type="bibr">Saltzman and Myers (2018)</ref>, who found that listeners rapidly retuned phonetic boundaries when lexically-biased exposure changed within an experimental session. Doing so would confirm the validity of the methodological decision to manipulate bias within-subjects in the current work. In addition, we aimed to replicate the (null) relationship between the magnitude of the Ganong effect and the magnitude of the learning effect that was observed in Experiment 1 with a larger sample size.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Method</head><p>Participants. Participants (n = 120; 61 men, 59 women) were recruited from the Prolific participant pool (<ref type="url">https://www.prolific.co</ref>). All participants were monolingual speakers of American English between 19 and 35 years of age (mean = 26, SD = 5) who were currently residing in the US and had no history of language disorders according to self-report. All participants passed the headphone screen of <ref type="bibr">Woods et al. (2017)</ref>, which is a protocol designed to ensure headphone compliance for web-based studies. Participants were compensated with $5.33 for completing the study. An additional 43 participants were tested but excluded from the study due to failure to pass the headphone screen or failure to meet compliance checks (e.g., participant only pressed one button for the entire experiment); this attrition rate is consistent with other webbased studies <ref type="bibr">(Brown et al., 2018;</ref><ref type="bibr">Thomas &amp; Clifford, 2017;</ref><ref type="bibr">Woods et al., 2017)</ref> Stimuli and procedure. The stimuli were identical to those used in Experiment 1. The procedure was identical to that outlined for Experiment 1 with three exceptions. First, all testing was completed online. The experiment was programmed using the Gorilla platform (<ref type="url">https://gorilla.sc</ref>), which was also used to host the study online. Second, participants were randomly assigned to either the SS-SH order group (n = 60) or the SH-SS order group (n = 60) for the lexically guided perceptual learning task. The SS-SH order group thus provides a replication of the learning task used in Experiment 1, where all listeners received /s/-bias exposure (followed by test) and then /&#643;/-bias exposure (followed by test). The SH-SS order group received the same exposure but first completed the /&#643;/-bias block (exposure followed by test) and then completed the /s/-bias block (exposure followed by test). Third, participants did not complete the CELF battery because it cannot be administered in a web-based format.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head><p>Ganong task. Performance was analyzed as outlined for the aggregate model in Experiment 1 and is displayed in Figure <ref type="figure">4</ref>, panel B. The GLMM showed a significant effect of VOT (&#120573; = 2.874, SE = 0.116, z = 24.752, p &lt; .001), a significant effect of continuum (&#120573; = 1.558, SE = 0.128, z = 12.135, p &lt; .001), and a marginal interaction between VOT and continuum (&#120573; = 0.201, SE = 0.104, z = 1.922, p = .055). These results confirm the presence of a Ganong effect in Experiment 2.</p><p>Perceptual learning task. Accuracy (proportion correct) on the lexical decision exposure task was near ceiling (mean = 0.95, SD = 0.04, range = 0.81 -0.99). For the initial analysis of the test data, performance was analyzed as outlined for the aggregate model in Experiment 1 (thus collapsing across order groups) and is displayed in Figure <ref type="figure">6</ref>, panel B. As in Experiment 1, a significant interaction was observed between bias and half (&#120573; = 0.875, SE = 0.168, z = 5.225, p &lt; .001), with learning attenuated in the second half of the test period compared to the first half of the test period. Given this interaction -and to optimally promote comparison to Experiment 1subsequent analyses were limited to performance in the first half of each test block.</p><p>The second analysis directly compared learning between the two order groups. Trial-level responses (/s/ = 0, /&#643;/ = 1) were submitted to a GLMM with the fixed effects of step, bias, order, and their interactions.</p><p>Step was entered as a continuous variable, bias and order were sum-coded (/s/ = 0.5, /&#643;/ = -0.5; SS-SH = 0.5, SH-SS = -0.5). The random effects structure consisted of random intercepts by participant, and random slopes by participant for step, bias, and their interaction.</p><p>The model showed a main effect of step (&#120573; = 4.340, SE = 0.187, z = 23.178, p &lt; .001) and bias (&#120573; = 1.787, SE = 0.310, z = 5.764, p &lt; .001), and an interaction between step and bias (&#120573; = 1.012, SE = 0.338, z = 2.999, p = .003), the latter indicating that the magnitude of the learning effect differed across continuum steps. There was no main effect of order (&#120573; = 0.026, SE = 0.377, z = 0.069, p = .945), but there was a significant interaction between bias and order (&#120573; = -0.977, SE = 0.492, z = -1.985, p = .047). The three-way interaction between step, bias, and order and all interactions between the three factors.</p><p>Step was entered as a continuous variable and bias was sum-coded as in the aggregate model. Block was also sum-coded (test block 1 = 0.5; test block 2 = -0.5). Random effects included random intercepts by participant, and random slopes by participant for step, bias, and block. No interaction between bias and block was observed (&#120573; = -0.091, SE = 0.739, z = 0.124, p = .901), thus providing no evidence that the magnitude of the between-subjects bias effect differed between the two test blocks.</p><p>Relationship between Ganong and perceptual learning tasks. The magnitude of the Ganong effect and the magnitude of the perceptual learning effect was quantified for each participant as described for Experiment 1. The relationship between the two effects is shown in Figure <ref type="figure">8</ref>, panel B. As for Experiment 1, there was no correlation between the two tasks in terms of either absolute magnitude (r = -0.01, p = .918) or rank order (&#120588; = -0.10, p = .277). The same patterns held when the correlations were performed within each order group separately (SS-SH: r = 0.09, p = .490, &#120588; = -0.07, p = .596; SH-SS: r = -0.06, p = .630, &#120588; = -0.08, p = .553).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Summary</head><p>The goal of the current study was twofold. First, we assessed whether individual differences in lexically guided perceptual learning associated with receptive language ability reflect variation in lexical reliance or variation in perceptual learning itself. Second, we assessed whether there is a relationship between lexical recruitment and lexically guided perceptual learning in general, as predicted by both interactive and modular models of perceptual learning.</p><p>With regard to the first question, the results of Experiment 1 suggest two key findings.</p><p>First, weaker language ability was associated with a larger Ganong effect, indicative of increased reliance on lexical information in these individuals. The magnitude of the Ganong effect was predicted by both of measures of receptive language ability and one measure of expressive language ability. These results are consistent with findings demonstrating that children with specific language impairment, which is associated with receptive language deficits, exhibit a larger Ganong effect compared to typically-developing children <ref type="bibr">(Schwartz et al., 2013)</ref>. Second, we found no evidence of a relationship between our measures of lexically guided perceptual learning and language ability, suggesting that individuals with weaker language ability have an intact perceptual learning mechanism despite their weaknesses in broad language phenotype. These results diverge from those of <ref type="bibr">Colby and colleagues (2018)</ref>, who found that weaker receptive language ability (as measured by receptive vocabulary) was associated with diminished lexically guided perceptual learning. Results were comparable when derived from single-order correlations, which may yield a more transparent measure of effect size, as well as when derived from generalized linear mixed effects models, which specifically model and thus account for individual differences in the identification response function.</p><p>Both experiments offer insight on our second question, which concerned the relationship between lexical recruitment and lexically guided perceptual learning in general. Despite the hypothesized relationship between these two constructs, we observed no evidence to suggest a relationship between lexical recruitment and lexically guided perceptual learning across two experiments that collectively tested 190 participants.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Implications for theory</head><p>Previous research <ref type="bibr">(Colby et al., 2018;</ref><ref type="bibr">Scharenborg et al., 2015)</ref>, as well as both the TRACE and Merge models of speech perception, suggest the existence of a relationship between lexical recruitment and lexically guided perceptual learning. For example, <ref type="bibr">Colby et al. (2018)</ref> found that individuals with lower receptive vocabulary showed attenuated lexically guided perceptual learning, which they hypothesized may reflect a decreased reliance on top-down information for speech perception. In addition, <ref type="bibr">Scharenborg et al. (2015)</ref> suggested that individuals with lower attention-switching capability demonstrate diminished lexically guided perceptual learning effects because they rely on top-down lexical information to a greater degree than those with higher attention-switching capability, who instead rely more highly on bottom-up phonetic information. In the current work, we found no evidence to suggest that individual variation in lexically guided perceptual learning was linked to receptive or expressive language ability; moreover, we found no evidence to suggest a relationship between lexical recruitment and lexically guided perceptual learning more generally.</p><p>Though the interpretation of null results is inherently challenging, the lack of a relationship between lexical recruitment and lexically guided perceptual learning may be treated with some degree of credibility. <ref type="bibr">Zheng and Samuel (2020)</ref> outlined three criteria that could mitigate concern in interpreting null results: adequate power, sufficient between-subjects variability, and stable within-subjects performance. The current experiments clearly meet the first two criteria. Experiment 1 tested 70 participants and Experiment 2 tested 120 participants. These sample sizes are well above those generally tested for studies of lexically guided perceptual learning, and post-hoc sensitivity analyses suggest that they were sufficiently powered (1 -&#223; = 0.80, &#120572; = 0.05) to detect small to moderate effects (r = 0.33 given n = 70, r = 0.25 given n = 120). The samples tested in this study yielded substantial between-subjects variability for all tasks as shown in Figure <ref type="figure">2</ref> (CELF subtest scores) and Figure <ref type="figure">8</ref> (performance in the Ganong and lexically guided perceptual learning tasks). Regarding the third criterion, as noted by <ref type="bibr">Zheng and Samuel (2020)</ref>, the nature of the lexically guided perceptual learning effect makes it very difficult to properly assess its within-subject stability, which we discuss further below.</p><p>As in <ref type="bibr">Colby et al. (2018)</ref> and <ref type="bibr">Scharenborg et al. (2015)</ref>, the current study used the lexically guided perceptual learning paradigm. The discrepancy between past research and the current findings may be related to the specific tasks used to measure individual differences in language ability, lexical recruitment, and/or the specific population being tested. <ref type="bibr">Colby et al. (2018)</ref> measured receptive language ability using the PPVT, whereas receptive language ability in the current work was measured using two subtests of the CELF. Accordingly, Colby and colleagues measured receptive vocabulary in isolation, whereas the language measures used in the current study encompass multiple elements of receptive language. It may be the case that individual differences in perceptual learning reflect contributions from vocabulary size that are dissociable from measures that assess receptive language ability more broadly. Further research directed towards dissociating which aspects of language processing are related to lexically guided perceptual learning should be conducted through the use of more specific measures of language ability. In past studies, reliance on lexical information was hypothesized to be the mediator of observed relationships between individual differences on the PPVT (for younger and older adults) or Trail Making task (for older adults) and lexically guided perceptual learning. In the current study, lexical recruitment was directly measured for younger adults using the Ganong task. The Ganong task is widely accepted as reflecting the contribution of lexical information to speech perception <ref type="bibr">(Ishida et al., 2016;</ref><ref type="bibr">Pitt, 1995)</ref> and is therefore likely to be a valid index of lexical recruitment; however, future research should examine whether the results observed here extend to other measures of lexical recruitment and other populations (e.g., older adults).</p><p>It is also possible that the lack of observed relationship between lexical recruitment and lexically guided perceptual learning is related to a potential threshold effect for a lexical influence on learning. Figure <ref type="figure">1</ref> depicts the relationship predicted by both interactive and modular theories in which lexically guided perceptual learning is contingent on lexical recruitment. A potential explanation to reconcile the discrepancy between our findings and these models of speech perception is that only a certain degree of lexical access (or a certain size of the lexicon) is necessary to cue perceptual learning and that beyond this threshold, additional strength in lexical recruitment does not further contribute to lexically guided perceptual learning. That is, lexical contributions could quickly meet a point of diminishing returns. While we observed wide individual variability in the Ganong effect in our two participant samples, it is possible not enough individuals with lexical recruitment at a level below this threshold were recruited, leading to the observed lack of relationship between lexical recruitment and lexically guided perceptual learning in the current work.</p><p>This explanation may also contribute to the pattern of results we observed regarding the null effect of language ability on lexically guided perceptual learning. Research on developmental language disorder (and specific language impairment) has suggested that higherlevel deficits in receptive language may stem from impairments early in the processing stream, including general auditory processing and global speech perception abilities (e.g., <ref type="bibr">Joanisse &amp; Seidenberg, 2003;</ref><ref type="bibr">McArthur &amp; Bishop, 2004)</ref>. Despite potential deficits in using bottom-up information to guide speech perception, the current study and previous work <ref type="bibr">(Schwartz et al., 2013)</ref> suggest that individuals with lower receptive language ability use top-down lexical information to scaffold speech perception to a higher extent than individuals with higher receptive language ability. It is possible that increased reliance on lexical information is a compensatory mechanism for earlier deficits in speech perception. Compensation of this sort would have benefits not only for online processing, but also for post-perceptual processes. For example, if the relative contribution of lexical information to speech perception is higher in those with weaker receptive language in order to mitigate weaker contributions of phonetic information, then individuals with weaker language ability may surpass the minimal threshold posited above, leading to performance equivalent to those with higher receptive language ability for lexically guided perceptual learning.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Limitations and considerations for future research</head><p>Though the current study supports examination of the relationship between receptive language, lexical recruitment, and perceptual learning, the current work does not support the identification of causal mechanisms. While it is plausible that strengthened lexical recruitment in individuals with weaker receptive language ability could be a compensatory mechanism for courser-grained perceptual analysis, the design of the current study does not bear directly on this possibility. Further research is necessary in order to explicate the mechanisms behind the increased reliance on the lexicon observed in both children <ref type="bibr">(Schwartz et al., 2013)</ref> and adults with weaker language ability.</p><p>As alluded to earlier in the discussion, a problem facing individual differences research in cognitive science more broadly is a lack of knowledge about the degree to which the chosen tasks are stable measures within an individual. While assessments such as the PPVT (used by <ref type="bibr">Colby et al., 2018;</ref><ref type="bibr">Williams &amp; Wang, 1997)</ref> and the Trail Making Test (used by <ref type="bibr">Scharenborg et al., 2015;</ref><ref type="bibr">Giovagnoli et al., 1996;</ref><ref type="bibr">Seo et al., 2006)</ref> are known to have sufficient test-retest reliability, relatively less is known about stability in performance on speech perception tasks, including the ones used here. That is, there is a dearth of evidence regarding whether performance in the Ganong and/or lexically guided perceptual learning tasks is stable over timeand stimuli -at the level of individual participants. For example, will a person who shows a large Ganong effect for a given stimulus set tested on a Monday also show a large Ganong effect for a different stimulus set when tested on a Friday? As the field advances our efforts to understand individual differences in perceptual and cognitive tasks, additional research is needed in order to confirm that our tasks reflect valid (and thus stable) measures of individual differences.</p><p>We acknowledge that this is no mean feat, especially when measuring stability of performance for tasks that assess learning. Recently, <ref type="bibr">Saltzman and Myers (2018)</ref> examined whether the size of a perceptual boundary shift induced by lexically guided perceptual learning was consistent in individuals who completed the same lexically guided perceptual learning task twice (approximately one week apart). At each session, listeners completed both /s/-bias and /&#643;/bias exposure phases, and the boundary shift was measured as the difference in /&#643;/-responses between the two phases. They found no relationship in performance across the two sessions, suggesting low individual consistency for lexically guided perceptual learning. However, this study has been retracted <ref type="bibr">(Saltzman &amp; Myers, 2020</ref>) and thus it is not clear whether these results are stable. Moreover, assessing the test-retest reliability of this learning paradigm introduces substantial challenges related to disassociating effects of short-term learning from more longterm learning introduced by multiple test sessions. For example, if learning in this paradigm persists over more long-term time periods (as suggested by <ref type="bibr">Eisner &amp; McQueen, 2006;</ref><ref type="bibr">cf. Liu &amp; Jaeger, 2018</ref><ref type="bibr">, 2019)</ref>, then the a priori expectation for individuals who learn would be no correlation between the boundary shift across test sessions because learning from the first session would inherently lead to no learning taking place in a second session. In addition, if an extrinsic factor (as opposed to a stable individual factor) were responsible for a lack of learning in the first session (e.g., completed after a lack of sleep), this may not necessarily lead to no learning occurring in the second session (e.g., completed after a good night's rest). Thus, it is impossible to completely rule out insufficient stability of this effect as a contributor to null results.</p><p>Additional research regarding the stability of both of these effects within an individual is warranted not only to explicate the theoretical relationship between lexical recruitment and lexically guided perceptual learning, but also to support clinical use of these tasks. Results from past research <ref type="bibr">(Schwartz et al., 2013)</ref> and the current study have shown that a larger Ganong effect is associated with SLI and weaker receptive language ability, respectively. Previous research has demonstrated that a larger Ganong effect is also associated with weaker speech in noise perception <ref type="bibr">(Lam et al., 2017)</ref>. Given that weaker speech in noise perception has been shown to be predictive of language impairment <ref type="bibr">(Ziegler et al., 2005</ref><ref type="bibr">(Ziegler et al., , 2011) )</ref> and that language impairment is associated with broad receptive language deficits, it is possible that once a better understanding of factors contributing to the individual differences and internal consistency of the Ganong effect is gained, this task could become a valuable, time-effective tool for use in clinical batteries for the assessment of language impairment.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusions</head><p>The findings of Experiment 1 are consistent with a theory positing that individuals with weaker language ability demonstrate increased reliance on lexical information for speech perception compared to those with stronger receptive language ability. Increased reliance on lexical information among those with weaker receptive language ability was observed for online lexical recruitment, but no differences in lexically guided perceptual learning as a function of language ability were observed. Individuals with weaker receptive language ability therefore appear to maintain an intact lexically guided perceptual learning mechanism. Further research is needed in order to understand whether the relationship between lexical recruitment and language ability reflects compensation for earlier deficits in speech perception, and if so, where in the speech processing stream these deficits occur.</p><p>To the degree that the chosen measures accurately capture lexical recruitment and lexically guided perceptual learning at the level of individual participants, the findings of both experiments converge to suggest no graded relationship between these two constructs. This result can be accommodated by current theories of speech perception if they are modified to model this relationship as being governed threshold level of lexical recruitment that is necessary and sufficient to cue lexically guided perceptual learning.      </p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>learning, there is no graded relationship between these two constructs.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_1"><p>A retraction notice(Saltzman &amp; Myers,  </p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_2"><p>2020) for this study was issued after the initial submission of the current manuscript. Because the results presented in<ref type="bibr">Saltzman and Myers (2018)</ref> contributed to the scientific premise of the current work, we describe them here so that the introduction is a veridical representation of our understanding of the scientific record as this study was developed.</p></note>
		</body>
		</text>
</TEI>
