TY - CONF
T1 - Variational information planning for sequential decision making
AU - Pacheco, Jason
AU - Fisher, John W.
N1 - Funding Information:
thank Sue Zheng for useful technical discussions. This work was partially supported by the ONR (N00014-17-1-2072), the Department of Energy (CVT Consortium), and DNDO under the ARI program.
Funding Information:
The first author would like to thank Sue Zheng for useful technical discussions. This work was partially supported by the ONR (N00014-17-1-2072), the Department of Energy (CVT Consortium), and DNDO under the ARI program.
Publisher Copyright:
© 2019 by the author(s).
PY - 2020
Y1 - 2020
N2 - We consider the setting of sequential decision making where, at each stage, potential actions are evaluated based on expected reduction in posterior uncertainty, given by mutual information (MI). As MI typically lacks a closed form, we propose an approach which maintains variational approximations of, both, the posterior and MI utility. Our planning objective extends an established variational bound on MI to the setting of sequential planning. The result, variational information planning (VIP), is an efficient method for sequential decision making. We further establish convexity of the variational planning objective and, under conditional exponential family approximations, we show that the optimal MI bound arises from a relaxation of the well-known exponential family moment matching property. We demonstrate VIP for sensor selection, experiment design, and active learning, where it meets or exceeds methods requiring more computation, or those specialized to the task.
AB - We consider the setting of sequential decision making where, at each stage, potential actions are evaluated based on expected reduction in posterior uncertainty, given by mutual information (MI). As MI typically lacks a closed form, we propose an approach which maintains variational approximations of, both, the posterior and MI utility. Our planning objective extends an established variational bound on MI to the setting of sequential planning. The result, variational information planning (VIP), is an efficient method for sequential decision making. We further establish convexity of the variational planning objective and, under conditional exponential family approximations, we show that the optimal MI bound arises from a relaxation of the well-known exponential family moment matching property. We demonstrate VIP for sensor selection, experiment design, and active learning, where it meets or exceeds methods requiring more computation, or those specialized to the task.
UR - http://www.scopus.com/inward/record.url?scp=85085040219&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85085040219&partnerID=8YFLogxK
M3 - Paper
AN - SCOPUS:85085040219
T2 - 22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019
Y2 - 16 April 2019 through 18 April 2019
ER -