TY - GEN
T1 - Reinforcement meta-learning for angle-only intercept guidance of maneuvering targets
AU - Gaudet, Brian
AU - Furfaro, Roberto
AU - Linares, Richard
N1 - Publisher Copyright:
© 2020, American Institute of Aeronautics and Astronautics Inc, AIAA. All rights reserved.
PY - 2020
Y1 - 2020
N2 - We present a novel guidance law that uses observations consisting solely of seeker line of sight angle measurements and their rate of change. The policy is optimized using reinforcement meta-learning and demonstrated in a simulated terminal phase of a mid-course exo-atmospheric interception. Importantly, the guidance law does not require range estimation, making it particularly suitable for passive seekers. The optimized policy maps stabilized seeker line of sight angles and their rate of change directly to commanded thrust for the mis-sile’s divert thrusters. The use of reinforcement meta-learning allows the optimized policy to adapt to target acceleration, and we demonstrate that the policy has superior performance as compared to augmented zero-effort miss guidance with perfect target acceleration knowledge. The optimized policy is computationally efficient and requires minimal memory, and should be compatible with today’s flight processors.
AB - We present a novel guidance law that uses observations consisting solely of seeker line of sight angle measurements and their rate of change. The policy is optimized using reinforcement meta-learning and demonstrated in a simulated terminal phase of a mid-course exo-atmospheric interception. Importantly, the guidance law does not require range estimation, making it particularly suitable for passive seekers. The optimized policy maps stabilized seeker line of sight angles and their rate of change directly to commanded thrust for the mis-sile’s divert thrusters. The use of reinforcement meta-learning allows the optimized policy to adapt to target acceleration, and we demonstrate that the policy has superior performance as compared to augmented zero-effort miss guidance with perfect target acceleration knowledge. The optimized policy is computationally efficient and requires minimal memory, and should be compatible with today’s flight processors.
UR - http://www.scopus.com/inward/record.url?scp=85091936679&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85091936679&partnerID=8YFLogxK
U2 - 10.2514/6.2020-0609
DO - 10.2514/6.2020-0609
M3 - Conference contribution
AN - SCOPUS:85091936679
SN - 9781624105951
T3 - AIAA Scitech 2020 Forum
SP - 1
EP - 16
BT - AIAA Scitech 2020 Forum
PB - American Institute of Aeronautics and Astronautics Inc, AIAA
T2 - AIAA Scitech Forum, 2020
Y2 - 6 January 2020 through 10 January 2020
ER -