Research on UAV Anti-Pursing Maneuvering Decision Based on Improved Twin Delayed Deep Deterministic Policy Gradient Method

Home > Archive>Volume 22, Issue 4, 2021 >15-21

Research on UAV Anti-Pursing Maneuvering Decision Based on Improved Twin Delayed Deep Deterministic Policy Gradient Method
DOI:
                        
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:V279
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In view of the problem of autonomous maneuvering counterpursuing in close air combat, a Markov decisionmaking process model for UAV counterpursuing is established, and for the abovementioned reasons, an autonomous maneuvering decisionmaking method for unmanned aerial vehicles (UAVs) based on deep reinforcement learning is proposed. The new method is based on the empirical replay area reconstruction, and improves the Twin Delayed Deep Deterministic policy gradient (TD3) algorithm, and generates the optimal strategy network by fitting the strategy function and the state action value function. The simulation experiments show that under condition of random initial position/attitude, being confronted with the drones adopted by the pure pursuit methods, the winning rate of intelligent drones trained by this method exceeds 93%. Compared with traditional TD3 and Deep Deterministic policy gradient (DDPG), this method is faster at convergence and higher in stability.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:
Revised:
Adopted:
Online: September 13,2021
Published:

Welcome to visit《 Journal of Air Force Engineering University 》Official website!

Home

Journal Introduction in Brief

Notices to Submission

Submission Process

Publishing Ethics

Contact Us

中文

Get Citation

Share

Article Metrics

History

Article QR Code