Welcome to visit《 Journal of Air Force Engineering University 》Official website!

Consultation hotline:029-84786242 RSS EMAIL-ALERT
Maneuvering Decision of UCAV in Close Air Combat Based on LSTM-PPO Algorithm
DOI:
CSTR:
Author:
Affiliation:

Clc Number:

V271.4

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    With the increasing military application of unmanned combat aircraft (UCAV), unmanned combat will become the main combat mode in the future air battlefield. In closerange air combat, the environment is complex and the combat situation changes rapidly. The method based on game theory cannot meet the realtime requirements due to the large amount of data iteration, and the datadriven method has the problems of long training time and low execution efficiency. To solve this problem, a UCAV maneuver decision method based on deep reinforcement learning algorithm is proposed in this paper. Firstly, the flight drive module is constructed on the basis of UCAV threedegreeoffreedom model to form the state transition updating mechanism. Then, on the basis of PPO algorithm, ornsteinuhlenbeck (OU) random noise was added to improve UCAV's ability to explore unknown state space, and LSTM was combined to enhance UCAV's ability to learn sequence sample data, so as to improve the training efficiency and effect of the algorithm. Finally, the effectiveness and superiority of the proposed method are verified by designing three groups of closerange air combat simulation experiments and comparing the performance with PPO algorithm.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: July 18,2022
  • Published: June 25,2022
Article QR Code