Waypoint-Based generalized ZEM/ZEV feedback guidance for planetary landing via a reinforcement learning approach

Roberto Furfaro, Richard Linares

Research output: Chapter in Book/Report/Conference proceedingConference contribution

18 Scopus citations

Abstract

Precision landing on large planetary bodies is a critical technology for future human and robotic exploration of the solar system. Indeed, over the past decade, landing systems for robotic Mars missions have been developed with the specific goal of deploying robotic agents (e.g. rovers, landers) on the Martian surface. In this paper, we proposed a novel algorithm that can generate powered, closedloop trajectories to enforce flight constraints (e.g. no crashing on slope surfaces) while ensuring precision landing. More specifically, we propose a waypointbased ZEM/ZEV algorithm that employs a dynamic programming approach via Value Iteration to determine the best location of the waypoints for a set of constrained landing over large planetary bodies (e.g. Moon and Mars). Here, the Reinforcement Learning (RL) framework is employed to integrate ZEM/ZEV with a waypoint selection policy as function of the current state of the spacecraft during the powered descent phase (i.e. position and velocity). Here, a set of openloop, constrained, fuel-efficient trajectories are numerically computed using pseudo-spectral methods. A set of states from the open-loop optimal trajectories are stored as candidate waypoints. The latter are employed by the ZEM/ZEV algorithm as intermediate targets to steer the spacecraft toward the final target point on the planetary surface. The problem is cast as a Markov Decision Process (MDP) and the resulting dynamics programming problem is solved via generalized policy evaluation to select the next best intermediate target point as function of the previous one. The behavior of the integrated guidance algorithm is evaluated in Mars powered landing scenarios that involve demanding requirements both in landing location and flight path. Both constraints satisfaction and fuel efficiency are analyzed to show the effectiveness of the proposed approach.

Original languageEnglish (US)
Title of host publicationDynamics and Control of Space Systems, DyCoSS 2017
EditorsYury N. Razoumny, Jean-Michel Contant, Anna D. Guerman, Filippo Graziani
PublisherUnivelt Inc.
Pages401-416
Number of pages16
ISBN (Print)9780877036432
StatePublished - 2017
Event3rd International Academy of Astronautics Conference on Dynamics and Control of Space Systems, DyCoSS 2017 - Moscow, Russian Federation
Duration: May 30 2017Jun 1 2017

Publication series

NameAdvances in the Astronautical Sciences
Volume161
ISSN (Print)0065-3438

Other

Other3rd International Academy of Astronautics Conference on Dynamics and Control of Space Systems, DyCoSS 2017
Country/TerritoryRussian Federation
CityMoscow
Period5/30/176/1/17

ASJC Scopus subject areas

  • Aerospace Engineering
  • Space and Planetary Science

Fingerprint

Dive into the research topics of 'Waypoint-Based generalized ZEM/ZEV feedback guidance for planetary landing via a reinforcement learning approach'. Together they form a unique fingerprint.

Cite this