skip to main content

Title: The rational use of causal inference to guide reinforcement learning strengthens with age

Beliefs about the controllability of positive or negative events in the environment can shape learning throughout the lifespan. Previous research has shown that adults’ learning is modulated by beliefs about the causal structure of the environment such that they update their value estimates to a lesser extent when the outcomes can be attributed to hidden causes. This study examined whether external causes similarly influenced outcome attributions and learning across development. Ninety participants, ages 7 to 25 years, completed a reinforcement learning task in which they chose between two options with fixed reward probabilities. Choices were made in three distinct environments in which different hidden agents occasionally intervened to generate positive, negative, or random outcomes. Participants’ beliefs about hidden-agent intervention aligned with the true probabilities of the positive, negative, or random outcome manipulation in each of the three environments. Computational modeling of the learning data revealed that while the choices made by both adults (ages 18–25) and adolescents (ages 13–17) were best fit by Bayesian reinforcement learning models that incorporate beliefs about hidden-agent intervention, those of children (ages 7–12) were best fit by a one learning rate model that updates value estimates based on choice outcomes alone. Together, these results suggest that while children demonstrate explicit awareness of the causal structure of the task environment, they do not implicitly use beliefs about the causal structure of the environment to guide reinforcement learning in the same manner as adolescents and adults.

more » « less
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
npj Science of Learning
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Intervening on causal systems can illuminate their underlying structures. Past work has shown that, relative to adults, young children often make intervention decisions that appear to confirm a single hypothesis rather than those that optimally discriminate alternative hypotheses. Here, we investigated how the ability to make informative causal interventions changes across development. Ninety participants between the ages of 7 and 25 completed 40 different puzzles in which they had to intervene on various causal systems to determine their underlying structures. Each puzzle comprised a three‐ or four‐node computer chip with hidden wires. On each trial, participants viewed two possible arrangements of the chip's hidden wires and had to select a single node to activate. After observing the outcome of their intervention, participants selected a wire configuration and rated their confidence in their selection. We characterized participant choices with a Bayesian measurement model that indexed the extent to which participants selected nodes that would best disambiguate the two possible causal structures versus those that had high causal centrality in one of the two causal hypotheses but did not necessarily discriminate between them. Our model estimates revealed that the use of a discriminatory strategy increased through early adolescence. Further, developmental improvements in intervention strategy were related to changes in the ability to accurately judge the strength of evidence that interventions revealed, as indexed by participants' confidence in their selections. Our results suggest that improvements in causal information‐seeking extend into adolescence and may be driven by metacognitive sensitivity to the efficacy of previous interventions in discriminating competing ideas.

    more » « less
  2. Abstract Background

    The language of the science curriculum is complex, even in the early grades. To communicate their scientific observations, children must produce complex syntax, particularly complement clauses (e.g.,I think it will float;We noticed that it vibrates). Complex syntax is often challenging for children with developmental language disorder (DLD), and thus their learning and communication of science may be compromised.


    We asked whether recast therapy delivered in the context of a science curriculum led to gains in complement clause use and scientific content knowledge. To understand the efficacy of recast therapy, we compared changes in science and language knowledge in children who received treatment for complement clauses embedded in a first‐grade science curriculum to two active control conditions (vocabulary + science, phonological awareness + science).

    Methods & Procedures

    This 2‐year single‐site three‐arm parallel randomized controlled trial was conducted in Delaware, USA. Children with DLD, not yet in first grade and with low accuracy on complement clauses, were eligible. Thirty‐three 4–7‐year‐old children participated in the summers of 2018 and 2019 (2020 was cancelled due to COVID‐19). We assigned participants to arms using 1:1:1 pseudo‐random allocation (avoiding placing siblings together). The intervention consisted of 39 small‐group sessions of recast therapy, robust vocabulary instruction or phonological awareness intervention during eight science units over 4 weeks, followed by two science units (1 week) taught without language intervention. Pre‐/post‐measures were collected 3 weeks before and after camp by unmasked assessors.

    Outcomes & Results

    Primary outcome measures were accuracy on a 20‐item probe of complement clause production and performance on ten 10‐item unit tests (eight science + language, two science only). Complete data were available for 31 children (10 grammar, 21 active control); two others were lost to follow‐up. Both groups made similar gains on science unit tests for science + language content (pre versus post,d= 2.9,p< 0.0001; group,p= 0.24). The grammar group performed significantly better at post‐test than the active control group (d= 2.5,p= 0.049) on complement clause probes and marginally better on science‐only unit tests (d= 2.5,p= 0.051).

    Conclusions & Implications

    Children with DLD can benefit from language intervention embedded in curricular content and learn both language and science targets taught simultaneously. Tentative findings suggest that treatment for grammar targets may improve academic outcomes.

    What this paper addsWhat is already known on the subject

    We know that recast therapy focused on morphology is effective but very time consuming. Treatment for complex syntax in young children has preliminary efficacy data available. Prior research provides mixed evidence as to children’s ability to learn language targets in conjunction with other information.

    What this study adds

    This study provides additional data supporting the efficacy of intensive complex syntax recast therapy for children ages 4–7 with Developmental Language Disorder. It also provides data that children can learn language targets and science curricular content simultaneously.

    What are the clinical implications of this work?

    As SLPs, we have to talk about something to deliver language therapy; we should consider talking about curricular content. Recast therapy focused on syntactic frames is effective with young children.

    more » « less
  3. null (Ed.)
    Background: Online challenges, phenomena that are very familiar to adolescents and young adults who spend large portions of time on social media, range from minimally harmful behaviors intended to support philanthropic endeavors to significantly harmful behaviors that may culminate in injury or death. Objective: This study investigated the beliefs that lead adolescents and young adults to participate in these activities by analyzing the Amyotrophic Lateral Sclerosis (ALS) Ice Bucket Challenge (IBC) to represent the former and the Cinnamon Challenge (CC), the latter. Methods: We conducted a retrospective quantitative study with a total of 471 participants between the ages of 13 and 35 who either had participated in the ALS IBC or the CC or had never participated in any online challenge. We used binomial logistic regression models to classify those who participated in ALS IBC or CC versus those who didn’t with the beliefs from the Integrated Behavioral Model (IBM) as predictors. Results: Our findings showed that both CC and ALS IBC participants had significantly greater positive emotional responses, value for the outcomes of the challenge, and expectation of the public to participate in the challenge in comparison to individuals who never participated in any challenge. In addition, only CC participants perceived positive public opinion about the challenge and perceived the challenge to be easy with no harmful consequences, in comparison to individuals who never participated in any challenge. Conclusions: The constructs that contribute to the spread of online challenge vary based on the level of self-harm involved in it and its purpose. We recommend that intervention efforts be tailored to address the beliefs associated with different types of online challenges. 
    more » « less
  4. null (Ed.)
    Many people believe in equality of opportunity but overlook and minimize the structural factors that shape social inequalities in the United States and around the world, such as systematic exclusion (e.g., educational, occupational) based on group membership (e.g., gender, race, socioeconomic status). As a result, social inequalities persist and place marginalized social groups at elevated risk for negative emotional, learning, and health outcomes. Where do the beliefs and behaviors that underlie social inequalities originate? Recent evidence from developmental science indicates that an awareness of social inequalities begins in childhood and that children seek to explain the underlying causes of the disparities that they observe and experience. Moreover, children and adolescents show early capacities for understanding and rectifying inequalities when regulating access to resources in peer contexts. Drawing on a social reasoning developmental framework, we synthesize what is currently known about children’s and adolescents’ awareness, beliefs, and behavior concerning social inequalities and highlight promising avenues by which developmental science can help reduce harmful assumptions and foster a more just society. 
    more » « less
  5. Abstract

    When human adults make decisions (e.g., wearing a seat belt), we often consider the negative consequences that would ensue if our actions were to fail, even if we have never experienced such a failure. Do the same considerations guide our understanding of other people's decisions? In this paper, we investigated whether adults, who have many years of experience making such decisions, and 6‐ and 7‐year‐old children, who have less experience and are demonstrably worse at judging the consequences of their own actions, conceive others' actions as motivated both by reward (how good reaching one's intended goal would be), and by what we call “danger” (how badly one's action could end). In two pre‐registered experiments, we tested whether adults and 6‐ and 7‐year‐old children tailor their predictions and explanations of an agent's action choices to the specific degree of danger and reward entailed by each action. Across four different tasks, we found that children and adults expected others to negatively appraise dangerous situations and minimize the danger of their actions. Children's and adults' judgments varied systematically in accord with both the degree of danger the agent faced and the value the agent placed on the goal state it aimed to achieve. However, children did not calibrate their inferences abouthow muchan agent valued the goal state of a successful action in accord with the degree of danger the action entailed, and adults calibrated these inferences more weakly than inferences concerning the agent's future action choices. These results suggest that from childhood, people use a degree of danger and reward to make quantitative, fine‐grained explanations and predictions about other people's behavior, consistent with computational models on theory of mind that contain continuous representations of other agents' action plans.

    more » « less