Human and Machine Learning in Non-Markovian Decision Making

Clarke, Aaron Michael; Friedrich, Johannes; Tartaglia, Elisa M.; Marchesotti, Silvia; Senn, Walter; Herzog, Michael H.

doi:10.6082/ed0e6-ktw89

Published April 21, 2015 | Version v1

Journal article Open

Human and Machine Learning in Non-Markovian Decision Making

1. École Polytechnique Fédérale de Lausanne
2. University of Berne
3. University of Chicago

Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), where a series of rewarded decisions must be made, is a particularly important type of learning. Computational and behavioral studies of RL have focused mainly on Markovian decision processes, where the next state depends on only the current state and action. Little is known about non-Markovian decision making, where the next state depends on more than the current state and action. Learning is non-Markovian, for example, when there is no unique mapping between actions and feedback. We have produced a model based on spiking neurons that can handle these non-Markovian conditions by performing policy gradient descent [1]. Here, we examine the model's performance and compare it with human learning and a Bayes optimal reference, which provides an upper-bound on performance. We find that in all cases, our population of spiking neurons model well-describes human performance.

Data availability

All relevant data are plotted in the manuscript. In addition, the raw data are also available at the Open Science Framework (https://osf.io/login/?next=/9sacv/): https://osf.io/9sacv/?view_only=187a993a964342cfab1e8f7f65678fa9.

Files

journal.pone.0123105.pdf

Files (876.5 kB)

Name	Size	Download all
journal.pone.0123105.pdf Article md5:8fe469f014bff2cb081e2b54ac08994d	532.2 kB	Preview Download
journal.pone.0123105.zip md5:d5cc446e01f243233375c0e6e9182b08	344.4 kB	Preview Download

Additional details

DOI: 10.1371/journal.pone.0123105
Other: oai:uchicago.tind.io:10371

Swiss National Science Foundation
Learning from delayed and sparse feedback
Human Brain Project
SystemsX.ch
Swiss National Science Foundation
Perspective Researcher fellowship
ProDoc
Top-down and bottom-up processes in perceptual learning
Swiss National Science Foundation
Perspective Researcher fellowship

Division(s): Physical Sciences Division
Department(s): Neurobiology, Statistics

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Human and Machine Learning in Non-Markovian Decision Making

Data availability

Files

journal.pone.0123105.pdf

Files (876.5 kB)

Additional details

Identifiers

Funding

UChicago Information

Human and Machine Learning in Non-Markovian Decision Making

Creators

Description

Data availability

Files

journal.pone.0123105.pdf

Files (876.5 kB)

Additional details

Identifiers

Funding

UChicago Information