Missile homing-phase guidance law design using reinforcement learning

Brian Gaudet, Roberto Furfaro

Research output: Chapter in Book/Report/Conference proceedingConference contribution

36 Scopus citations

Abstract

A new approach to missile guidance law design is proposed, where reinforcement learning (RL) is used to learn a homing-phase guidance law that is optimal with respect to the missile's airframe dynamics as well as sensor and actuator noise and delays. It is demonstrated that this new approach results in a guidance law giving superior performance to either PN guidance or enhanced PN guidance laws developed using Lyapunov theory. Although optimal control theory can be used to derive an optimal control law under certain idealized modeling assumptions, we discuss how the RL approach gives more flexibility and higher expected performance for real-world systems.

Original languageEnglish (US)
Title of host publicationAIAA Guidance, Navigation, and Control Conference 2012
PublisherAmerican Institute of Aeronautics and Astronautics Inc.
ISBN (Print)9781600869389
DOIs
StatePublished - 2012
EventAIAA Guidance, Navigation, and Control Conference 2012 - Minneapolis, MN, United States
Duration: Aug 13 2012Aug 16 2012

Publication series

NameAIAA Guidance, Navigation, and Control Conference 2012

Other

OtherAIAA Guidance, Navigation, and Control Conference 2012
Country/TerritoryUnited States
CityMinneapolis, MN
Period8/13/128/16/12

ASJC Scopus subject areas

  • Aerospace Engineering
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Missile homing-phase guidance law design using reinforcement learning'. Together they form a unique fingerprint.

Cite this