TY - GEN
T1 - Integrated guidance and control for pinpoint mars landing using reinforcement learning
AU - Gaudet, Brian
AU - Linares, Richard
AU - Furfaro, Roberto
N1 - Publisher Copyright:
© 2018 Univelt Inc. All rights reserved.
PY - 2018
Y1 - 2018
N2 - Future Mars missions will require advanced guidance, navigation, and control algorithms for the powered descent phase in order to target specific surface locations and achieve pinpoint accuracy (landing error ellipse < 5m radius). This requires both a navigation system capable of estimating the lander’s state in real-time and a guidance and control system that can map the estimated lander state to body-frame actuator commands. In this paper we present a novel integrated guidance and control algorithm designed by applying the principles of reinforcement learning theory. The key innovation is the use of reinforcement learning to learn a policy mapping the lander’s estimated state directly to actuator commands, with the policy resulting in accurate and fuel efficient trajectories. Specifically, we use proximal policy optimization, a policy gradient method, to learn the policy. We present simulation results demonstrating the guidance and control system’s performance in a 6-DOF simulation environment, and demonstrate robustness to noise and system parameter uncertainty.
AB - Future Mars missions will require advanced guidance, navigation, and control algorithms for the powered descent phase in order to target specific surface locations and achieve pinpoint accuracy (landing error ellipse < 5m radius). This requires both a navigation system capable of estimating the lander’s state in real-time and a guidance and control system that can map the estimated lander state to body-frame actuator commands. In this paper we present a novel integrated guidance and control algorithm designed by applying the principles of reinforcement learning theory. The key innovation is the use of reinforcement learning to learn a policy mapping the lander’s estimated state directly to actuator commands, with the policy resulting in accurate and fuel efficient trajectories. Specifically, we use proximal policy optimization, a policy gradient method, to learn the policy. We present simulation results demonstrating the guidance and control system’s performance in a 6-DOF simulation environment, and demonstrate robustness to noise and system parameter uncertainty.
UR - https://www.scopus.com/pages/publications/85069482375
UR - https://www.scopus.com/pages/publications/85069482375#tab=citedBy
M3 - Conference contribution
AN - SCOPUS:85069482375
SN - 9780877036579
T3 - Advances in the Astronautical Sciences
SP - 3135
EP - 3154
BT - AAS/AIAA Astrodynamics Specialist Conference, 2018
A2 - Singla, Puneet
A2 - Weisman, Ryan M.
A2 - Marchand, Belinda G.
A2 - Jones, Brandon A.
PB - Univelt Inc.
T2 - AAS/AIAA Astrodynamics Specialist Conference, 2018
Y2 - 19 August 2018 through 23 August 2018
ER -