TY - GEN
T1 - Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning
AU - Gaudet, Brian
AU - Furfaro, Roberto
N1 - Publisher Copyright:
© 2023, American Institute of Aeronautics and Astronautics Inc, AIAA. All rights reserved.
PY - 2023
Y1 - 2023
N2 - We use reinforcement meta learning to optimize a line of sight curvature policy that increases the effectiveness of a guidance system against maneuvering targets. The policy is implemented as a recurrent neural network that maps navigation system outputs to a Euler 321 attitude representation. The attitude representation is then used to construct a direction cosine matrix that biases the observed line of sight vector. The line of sight rotation rate derived from the biased line of sight is then mapped to a commanded acceleration by the guidance system. By varying the bias as a function of navigation system outputs, the policy enhances accuracy against highly maneuvering targets. Importantly, our method does not require an estimate of target acceleration. In our experiments, we demonstrate that when our method is combined with proportional navigation, the system significantly outperforms augmented proportional navigation with perfect knowledge of target acceleration, achieving improved accuracy with less control effort against a wide range of target maneuvers.
AB - We use reinforcement meta learning to optimize a line of sight curvature policy that increases the effectiveness of a guidance system against maneuvering targets. The policy is implemented as a recurrent neural network that maps navigation system outputs to a Euler 321 attitude representation. The attitude representation is then used to construct a direction cosine matrix that biases the observed line of sight vector. The line of sight rotation rate derived from the biased line of sight is then mapped to a commanded acceleration by the guidance system. By varying the bias as a function of navigation system outputs, the policy enhances accuracy against highly maneuvering targets. Importantly, our method does not require an estimate of target acceleration. In our experiments, we demonstrate that when our method is combined with proportional navigation, the system significantly outperforms augmented proportional navigation with perfect knowledge of target acceleration, achieving improved accuracy with less control effort against a wide range of target maneuvers.
UR - http://www.scopus.com/inward/record.url?scp=85200117122&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85200117122&partnerID=8YFLogxK
U2 - 10.2514/6.2023-1627
DO - 10.2514/6.2023-1627
M3 - Conference contribution
AN - SCOPUS:85200117122
SN - 9781624106996
T3 - AIAA SciTech Forum and Exposition, 2023
BT - AIAA SciTech Forum and Exposition, 2023
PB - American Institute of Aeronautics and Astronautics Inc, AIAA
T2 - AIAA SciTech Forum and Exposition, 2023
Y2 - 23 January 2023 through 27 January 2023
ER -