skip to main content


Title: A modeling framework for adaptive lifelong learning with transfer and savings through gating in the prefrontal cortex

The prefrontal cortex encodes and stores numerous, often disparate, schemas and flexibly switches between them. Recent research on artificial neural networks trained by reinforcement learning has made it possible to model fundamental processes underlying schema encoding and storage. Yet how the brain is able to create new schemas while preserving and utilizing old schemas remains unclear. Here we propose a simple neural network framework that incorporates hierarchical gating to model the prefrontal cortex’s ability to flexibly encode and use multiple disparate schemas. We show how gating naturally leads to transfer learning and robust memory savings. We then show how neuropsychological impairments observed in patients with prefrontal damage are mimicked by lesions of our network. Our architecture, which we call DynaMoE, provides a fundamental framework for how the prefrontal cortex may handle the abundance of schemas necessary to navigate the real world.

 
more » « less
NSF-PAR ID:
10200700
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Proceedings of the National Academy of Sciences
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
117
Issue:
47
ISSN:
0027-8424
Page Range / eLocation ID:
p. 29872-29882
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Disruption of circadian rhythms, such as shift work and jet lag, are associated with negative physiological and behavioral outcomes, including changes in affective state, learning and memory, and cognitive function. The prefrontal cortex (PFC) is heavily involved in all of these processes. Many PFC-associated behaviors are time-of-day dependent, and disruption of daily rhythms negatively impacts these behavioral outputs. Yet how disruption of daily rhythms impacts the fundamental function of PFC neurons, and the mechanism(s) by which this occurs, remains unknown. Using a mouse model, we demonstrate that the activity and action potential dynamics of prelimbic PFC neurons are regulated by time-of-day in a sex specific manner. Further, we show that postsynaptic K+channels play a central role in physiological rhythms, suggesting an intrinsic gating mechanism mediating physiological activity. Finally, we demonstrate that environmental circadian desynchronization alters the intrinsic functioning of these neurons independent of time-of-day. These key discoveries demonstrate that daily rhythms contribute to the mechanisms underlying the essential physiology of PFC circuits and provide potential mechanisms by which circadian disruption may impact the fundamental properties of neurons.

     
    more » « less
  2. Key points

    Visual attention involves discrete multispectral oscillatory responses in visual and ‘higher‐order’ prefrontal cortices.

    Prefrontal cortex laterality effects during visual selective attention are poorly characterized.

    High‐definition transcranial direct current stimulation dynamically modulated right‐lateralized fronto‐visual theta oscillations compared to those observed in left fronto‐visual pathways.

    Increased connectivity in right fronto‐visual networks after stimulation of the left dorsolateral prefrontal cortex resulted in faster task performance in the context of distractors.

    Our findings show clear laterality effects in theta oscillatory activity along prefrontal–visual cortical pathways during visual selective attention.

    Abstract

    Studies of visual attention have implicated oscillatory activity in the recognition, protection and temporal organization of attended representations in visual cortices. These studies have also shown that higher‐order regions such as the prefrontal cortex are critical to attentional processing, but far less is understood regarding prefrontal laterality differences in attention processing. To examine this, we selectively applied high‐definition transcranial direct current stimulation (HD‐tDCS) to the left or right dorsolateral prefrontal cortex (DLPFC). We predicted that HD‐tDCS of the leftversusright prefrontal cortex would differentially modulate performance on a visual selective attention task, and alter the underlying oscillatory network dynamics. Our randomized crossover design included 27 healthy adults that underwent three separate sessions of HD‐tDCS (sham, left DLPFC and right DLPFC) for 20 min. Following stimulation, participants completed an attention protocol during magnetoencephalography. The resulting oscillatory dynamics were imaged using beamforming, and peak task‐related neural activity was subjected to dynamic functional connectivity analyses to evaluate the impact of stimulation site (i.e. left and right DLPFC) on neural interactions. Our results indicated that HD‐tDCS over the left DLPFC differentially modulated right fronto‐visual functional connectivity within the theta band compared to HD‐tDCS of the right DLPFC and further, specifically modulated the oscillatory response for detecting targets among an array of distractors. Importantly, these findings provide network‐specific insight into the complex oscillatory mechanisms serving visual selective attention.

     
    more » « less
  3. Abstract

    The real world is uncertain, and while ever changing, it constantly presents itself in terms of new sets of behavioral options. To attain the flexibility required to tackle these challenges successfully, most mammalian brains are equipped with certain computational abilities that rely on the prefrontal cortex (PFC). By examining learning in terms of internal models associating stimuli, actions, and outcomes, we argue here that adaptive behavior relies on specific interactions between multiple systems including: (1) selective models learning stimulus–action associations through rewards; (2) predictive models learning stimulus- and/or action–outcome associations through statistical inferences anticipating behavioral outcomes; and (3) contextual models learning external cues associated with latent states of the environment. Critically, the PFC combines these internal models by forming task sets to drive behavior and, moreover, constantly evaluates the reliability of actor task sets in predicting external contingencies to switch between task sets or create new ones. We review different models of adaptive behavior to demonstrate how their components map onto this unifying framework and specific PFC regions. Finally, we discuss how our framework may help to better understand the neural computations and the cognitive architecture of PFC regions guiding adaptive behavior.

     
    more » « less
  4. Abstract

    The meta-reinforcement learning (meta-RL) framework, which involves RL over multiple timescales, has been successful in training deep RL models that generalize to new environments. It has been hypothesized that the prefrontal cortex may mediate meta-RL in the brain, but the evidence is scarce. Here we show that the orbitofrontal cortex (OFC) mediates meta-RL. We trained mice and deep RL models on a probabilistic reversal learning task across sessions during which they improved their trial-by-trial RL policy through meta-learning. Ca2+/calmodulin-dependent protein kinase II-dependent synaptic plasticity in OFC was necessary for this meta-learning but not for the within-session trial-by-trial RL in experts. After meta-learning, OFC activity robustly encoded value signals, and OFC inactivation impaired the RL behaviors. Longitudinal tracking of OFC activity revealed that meta-learning gradually shapes population value coding to guide the ongoing behavioral policy. Our results indicate that two distinct RL algorithms with distinct neural mechanisms and timescales coexist in OFC to support adaptive decision-making.

     
    more » « less
  5. Abstract

    Although brain–machine interfaces (BMIs) are directly controlled by the modulation of a select local population of neurons, distributed networks consisting of cortical and subcortical areas have been implicated in learning and maintaining control. Previous work in rodents has demonstrated the involvement of the striatum in BMI learning. However, the prefrontal cortex has been largely ignored when studying motor BMI control despite its role in action planning, action selection, and learning abstract tasks. Here, we compare local field potentials simultaneously recorded from primary motor cortex (M1), dorsolateral prefrontal cortex (DLPFC), and the caudate nucleus of the striatum (Cd) while nonhuman primates perform a two-dimensional, self-initiated, center-out task under BMI control and manual control. Our results demonstrate the presence of distinct neural representations for BMI and manual control in M1, DLPFC, and Cd. We find that neural activity from DLPFC and M1 best distinguishes control types at the go cue and target acquisition, respectively, while M1 best predicts target-direction at both task events. We also find effective connectivity from DLPFC → M1 throughout both control types and Cd → M1 during BMI control. These results suggest distributed network activity between M1, DLPFC, and Cd during BMI control that is similar yet distinct from manual control.

     
    more » « less