TY - GEN
T1 - Bayesian 3D tracking from monocular video
AU - Brau, Ernesto
AU - Guan, Jinyan
AU - Simek, Kyle
AU - Pero, Luca Del
AU - Dawson, Colin Reimer
AU - Barnard, Kobus
PY - 2013
Y1 - 2013
N2 - We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multi-target tracking must address the fact that the model's dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence, we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.
AB - We develop a Bayesian modeling approach for tracking people in 3D from monocular video with unknown cameras. Modeling in 3D provides natural explanations for occlusions and smoothness discontinuities that result from projection, and allows priors on velocity and smoothness to be grounded in physical quantities: meters and seconds vs. pixels and frames. We pose the problem in the context of data association, in which observations are assigned to tracks. A correct application of Bayesian inference to multi-target tracking must address the fact that the model's dimension changes as tracks are added or removed, and thus, posterior densities of different hypotheses are not comparable. We address this by marginalizing out the trajectory parameters so the resulting posterior over data associations has constant dimension. This is made tractable by using (a) Gaussian process priors for smooth trajectories and (b) approximately Gaussian likelihood functions. Our approach provides a principled method for incorporating multiple sources of evidence, we present results using both optical flow and object detector outputs. Results are comparable to recent work on 3D tracking and, unlike others, our method requires no pre-calibrated cameras.
KW - 3D scene modeling
KW - Bayesian inference
KW - MCMCDA
KW - multi-object tracking
UR - http://www.scopus.com/inward/record.url?scp=84898821643&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84898821643&partnerID=8YFLogxK
U2 - 10.1109/ICCV.2013.418
DO - 10.1109/ICCV.2013.418
M3 - Conference contribution
AN - SCOPUS:84898821643
SN - 9781479928392
T3 - Proceedings of the IEEE International Conference on Computer Vision
SP - 3368
EP - 3375
BT - Proceedings - 2013 IEEE International Conference on Computer Vision, ICCV 2013
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2013 14th IEEE International Conference on Computer Vision, ICCV 2013
Y2 - 1 December 2013 through 8 December 2013
ER -