TY - JOUR
T1 - Balancing exploration and exploitation with information and randomization
AU - Wilson, Robert C.
AU - Bonawitz, Elizabeth
AU - Costa, Vincent D.
AU - Ebitz, R. Becket
N1 - Publisher Copyright:
© 2020 Elsevier Ltd
PY - 2021/4
Y1 - 2021/4
N2 - Explore-exploit decisions require us to trade off the benefits of exploring unknown options to learn more about them, with exploiting known options, for immediate reward. Such decisions are ubiquitous in nature, but from a computational perspective, they are notoriously hard. There is therefore much interest in how humans and animals make these decisions and recently there has been an explosion of research in this area. Here we provide a biased and incomplete snapshot of this field focusing on the major finding that many organisms use two distinct strategies to solve the explore-exploit dilemma: a bias for information (‘directed exploration’) and the randomization of choice (‘random exploration’). We review evidence for the existence of these strategies, their computational properties, their neural implementations, as well as how directed and random exploration vary over the lifespan. We conclude by highlighting open questions in this field that are ripe to both explore and exploit.
AB - Explore-exploit decisions require us to trade off the benefits of exploring unknown options to learn more about them, with exploiting known options, for immediate reward. Such decisions are ubiquitous in nature, but from a computational perspective, they are notoriously hard. There is therefore much interest in how humans and animals make these decisions and recently there has been an explosion of research in this area. Here we provide a biased and incomplete snapshot of this field focusing on the major finding that many organisms use two distinct strategies to solve the explore-exploit dilemma: a bias for information (‘directed exploration’) and the randomization of choice (‘random exploration’). We review evidence for the existence of these strategies, their computational properties, their neural implementations, as well as how directed and random exploration vary over the lifespan. We conclude by highlighting open questions in this field that are ripe to both explore and exploit.
UR - http://www.scopus.com/inward/record.url?scp=85095438281&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095438281&partnerID=8YFLogxK
U2 - 10.1016/j.cobeha.2020.10.001
DO - 10.1016/j.cobeha.2020.10.001
M3 - Review article
AN - SCOPUS:85095438281
SN - 2352-1546
VL - 38
SP - 49
EP - 56
JO - Current Opinion in Behavioral Sciences
JF - Current Opinion in Behavioral Sciences
ER -