skip to main content


Title: Self-Supervised Online Reward Shaping in Sparse-Reward Environments
Award ID(s):
1925082 1749204 1724157 1638107
NSF-PAR ID:
10301964
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Proceedings of the IEEERSJ International Conference on Intelligent Robots and Systems
ISSN:
2153-0858
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The reward-hypersensitivity model posits that trait reward hypersensitivity should elicit hyper/hypo-approach motivation following exposure to recent life events that activate (goal striving and goal attainment) or deactivate (goal failure) the reward system, respectively. To test these hypotheses, we had 87 young adults with high trait reward (HRew) sensitivity or moderate trait reward (MRew) sensitivity report frequency of life events via the Life Event Interview. Brain activation was assessed during the functional MRI monetary-incentive-delay task. Greater exposure to goal-striving events was associated with higher nucleus accumbens (NAc) reward anticipation among HRew participants and lower orbitofrontal cortex (OFC) reward anticipation among MRew participants. Greater exposure to goal-failure events was associated with higher NAc and OFC reward anticipation only among HRew participants. This study demonstrated different neural reward anticipation (but not outcome) following reward-relevant events for HRew individuals compared with MRew individuals. Trait reward sensitivity and reward-relevant life events may jointly modulate reward-related brain function, which has implications for understanding psychopathology.

     
    more » « less
  2. Do dopaminergic reward structures represent the expected utility of information similarly to a reward? Optimal experimental design models from Bayesian decision theory and statistics have proposed a theoretical framework for quantifying the expected value of information that might result from a query. In particular, this formulation quantifies the value of information before the answer to that query is known, in situations where payoffs are unknown and the goal is purely epistemic: That is, to increase knowledge about the state of the world. Whether and how such a theoretical quantity is represented in the brain is unknown. Here we use an event-related functional MRI (fMRI) task design to disentangle information expectation, information revelation and categorization outcome anticipation, and response-contingent reward processing in a visual probabilistic categorization task. We identify a neural signature corresponding to the expectation of information, involving the left lateral ventral striatum. Moreover, we show a temporal dissociation in the activation of different reward-related regions, including the nucleus accumbens, medial prefrontal cortex, and orbitofrontal cortex, during information expectation versus reward-related processing.

     
    more » « less