Affiliation: Tamagawa University
- Inter-module credit assignment in modular reinforcement learningKazuyuki Samejima
Human Information Science Laboratories, ATR International, 2 2 2 Hikaridai, Seika, Soraku, Kyoto 619 0288, Japan
Neural Netw 16:985-94. 2003..We implement modular reward for a multiple model-based reinforcement learning (MMRL) architecture and show its effectiveness in simulations of a pursuit task with hidden states and a continuous-time non-linear control task...
- A neural correlate of reward-based behavioral learning in caudate nucleus: a functional magnetic resonance imaging study of a stochastic decision taskMasahiko Haruno
Computational Neuroscience Laboratories, Advanced Telecommunications Research Institute, Kyoto 619 0288, Japan
J Neurosci 24:1660-5. 2004..These findings suggest that the caudate nucleus is one of the main loci for reward-based behavioral learning...
- Brain mechanism of reward prediction under predictable and unpredictable environmental dynamicsSaori C Tanaka
Department of Bioinformatics and Genomics, Nara Institute of Science and Technology, Japan
Neural Netw 19:1233-41. 2006....
- Multiple representations of belief states and action values in corticobasal ganglia loopsKazuyuki Samejima
Tamagawa University Brain Science Institute, 6 1 1 Tamagawa gakuen, Machida, Tokyo, Japan 195 8610
Ann N Y Acad Sci 1104:213-28. 2007..Furthermore, the medial prefrontal cortex (MPFC) coordinates these multiple value representations and actions at different levels of hierarchy by monitoring the error in predictions...
- Representation of action-specific reward values in the striatumKazuyuki Samejima
Department of Computational Neurobiology, ATR Computational Neuroscience Laboratories, 619 0288 Kyoto, Japan
Science 310:1337-40. 2005..Fewer neurons were tuned to relative values or action choice. These results suggest representation of action values in the striatum, which can guide action selection in the basal ganglia circuit...
- Stimulus-dependent adjustment of reward prediction error in the midbrainHiromasa Takemura
Department of Life Sciences, The University of Tokyo, Tokyo, Japan
PLoS ONE 6:e28337. 2011..These results suggest that the human reward system can incorporate the level of the stimulus discriminability flexibly into reward computations by modulating previously acquired reward values for a typical stimulus...
- Neural correlates of cognitive dissonance and choice-induced preference changeKeise Izuma
Brain Science Institute, Tamagawa University, Machida, Tokyo 194 8610, Japan
Proc Natl Acad Sci U S A 107:22014-9. 2010..Our findings provide important insights into the neural basis of how actions can alter an individual's preferences...
- Multiple model-based reinforcement learningKenji Doya
Human Information Science Laboratories, ATR International, Seika, Soraku, Kyoto 619 0288, Japan
Neural Comput 14:1347-69. 2002..The performance of MMRL was demonstrated for discrete case in a nonstationary hunting task in a grid world and for continuous case in a nonlinear, nonstationary control task of swinging up a pendulum with variable physical parameters...