TY - GEN
T1 - Robust Waypoint Guidance of a Hexacopter on Mars using Meta-Reinforcement Learning
AU - Federici, Lorenzo
AU - Furfaro, Roberto
AU - Zavoli, Alessandro
AU - Matteis, Guido De
N1 - Publisher Copyright:
© 2023, American Institute of Aeronautics and Astronautics Inc, AIAA. All rights reserved.
PY - 2023
Y1 - 2023
N2 - This paper presents a meta-reinforcement learning approach to the robust and autonomous waypoint guidance of a six-rotor unmanned aerial vehicle in Mars’ atmosphere. The metalearning is implemented by using a recurrent neural network as a control policy to map data about the hexacopter state provided by onboard sensors to the six rotor angular speeds. The network is trained by proximal policy optimization, a state-of-the-art policy gradient reinforcement learning algorithm. During the training, the network is also provided with information about the previous control output and reward, to improve the policy adaptability to different environment instances. Several mission scenarios, involving uncertainties on Mars’ atmosphere’s properties, the presence of random wind gusts, and Gaussian noise on the collected sensor data, are investigated to assess the robustness of the proposed approach in realistic operative conditions. The flexibility and performance of meta-reinforcement learning are also compared against standard reinforcement learning with a fully-connected neural network, to better highlight the potential of the proposed methodology in real-world autonomous guidance applications.
AB - This paper presents a meta-reinforcement learning approach to the robust and autonomous waypoint guidance of a six-rotor unmanned aerial vehicle in Mars’ atmosphere. The metalearning is implemented by using a recurrent neural network as a control policy to map data about the hexacopter state provided by onboard sensors to the six rotor angular speeds. The network is trained by proximal policy optimization, a state-of-the-art policy gradient reinforcement learning algorithm. During the training, the network is also provided with information about the previous control output and reward, to improve the policy adaptability to different environment instances. Several mission scenarios, involving uncertainties on Mars’ atmosphere’s properties, the presence of random wind gusts, and Gaussian noise on the collected sensor data, are investigated to assess the robustness of the proposed approach in realistic operative conditions. The flexibility and performance of meta-reinforcement learning are also compared against standard reinforcement learning with a fully-connected neural network, to better highlight the potential of the proposed methodology in real-world autonomous guidance applications.
UR - https://www.scopus.com/pages/publications/85200370695
UR - https://www.scopus.com/inward/citedby.url?scp=85200370695&partnerID=8YFLogxK
U2 - 10.2514/6.2023-2663
DO - 10.2514/6.2023-2663
M3 - Conference contribution
AN - SCOPUS:85200370695
SN - 9781624106996
T3 - AIAA SciTech Forum and Exposition, 2023
BT - AIAA SciTech Forum and Exposition, 2023
PB - American Institute of Aeronautics and Astronautics Inc, AIAA
T2 - AIAA SciTech Forum and Exposition, 2023
Y2 - 23 January 2023 through 27 January 2023
ER -