Risk prediction error signaling: A two-component response?
Organisms use rewards to navigate and adapt to (uncertain) environments. Error-based learning about rewards issupported by the dopaminergic system, which is thought to signal reward prediction errors to make adjustmentsto past predictions. More recently, the phasic dopamine response was suggested to have two components: thefirstrapid component is thought to signal the detection of a potentially rewarding stimulus; the second, slightly latercomponent characterizes the stimulus by its reward prediction error. Error-based learning signals have also beenfound for risk. However, whether the neural generators of these signals employ a two-component coding schemelike the dopaminergic system is unknown. Here, using human high density EEG, we ask whether risk learning, ormore generally speaking surprise-based learning under uncertainty, is similarly comprised of two temporallydissociable components. Using a simple card game, we show that the risk prediction error is reflected in theamplitude of the P3b component. This P3b modulation is preceded by an earlier component, that is modulated bythe stimulus salience. Source analyses are compatible with the idea that both the early salience signal and the laterrisk prediction error signal are generated in insular, frontal, and temporal cortex. The identified sources are partsof the risk processing network that receives input from noradrenergic cells in the locus coeruleus. Finally, the P3bamplitude modulation is mirrored by an analogous modulation of pupil size, which is consistent with the idea thatboth the P3b and pupil size indirectly reflect locus coeruleus activity.
1-s2.0-S1053811920302536-main.pdf
Publisher's version
openaccess
CC BY
2.17 MB
Adobe PDF
3e66916ad31a0dd0e0cda75355d23e0b