TY - GEN
T1 - Six degree-of-freedom hovering using lidar altimetry via reinforcement meta-learning
AU - Gaudet, Brian
AU - Linares, Richard
AU - Furfaro, Robert
N1 - Publisher Copyright:
© 2020, American Institute of Aeronautics and Astronautics Inc, AIAA. All rights reserved.
PY - 2020
Y1 - 2020
N2 - We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown dynamics, and without an asteroid shape model or navigation aids. Indeed, during optimization the agent is confronted with a new randomly generated asteroid for each episode, insuring that it does not learn an asteroid’s shape, texture, or environmental dynamics. This allows the deployed policy to generalize well to novel asteroid characteristics, which we demonstrate in our experiments. The hovering controller has the potential to simplify mission planning by allowing asteroid body-fixed hovering immediately upon the spacecraft’s arrival to an asteroid. This in turn simplifies shape model generation and allows resource mapping via remote sensing immediately upon arrival at the target asteroid.
AB - We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown dynamics, and without an asteroid shape model or navigation aids. Indeed, during optimization the agent is confronted with a new randomly generated asteroid for each episode, insuring that it does not learn an asteroid’s shape, texture, or environmental dynamics. This allows the deployed policy to generalize well to novel asteroid characteristics, which we demonstrate in our experiments. The hovering controller has the potential to simplify mission planning by allowing asteroid body-fixed hovering immediately upon the spacecraft’s arrival to an asteroid. This in turn simplifies shape model generation and allows resource mapping via remote sensing immediately upon arrival at the target asteroid.
UR - http://www.scopus.com/inward/record.url?scp=85091916232&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85091916232&partnerID=8YFLogxK
U2 - 10.2514/6.2020-0953
DO - 10.2514/6.2020-0953
M3 - Conference contribution
AN - SCOPUS:85091916232
SN - 9781624105951
T3 - AIAA Scitech 2020 Forum
BT - AIAA Scitech 2020 Forum
PB - American Institute of Aeronautics and Astronautics Inc, AIAA
T2 - AIAA Scitech Forum, 2020
Y2 - 6 January 2020 through 10 January 2020
ER -