发表状态 | 已发表Published |
题名 | UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient |
作者 | |
发表日期 | 2022 |
发表期刊 | Wireless Communications and Mobile Computing
![]() |
ISSN/eISSN | 1530-8669 |
卷号 | 2022 |
摘要 | Deep deterministic policy gradient (DDPG) algorithm is a reinforcement learning method, which has been widely used in UAV path planning. However, the critic network of DDPG is frequently updated in the training process. It leads to an inevitable overestimation problem and increases the training computational complexity. Therefore, this paper presents a multicritic-delayed DDPG method for solving the UAV path planning. It uses multicritic networks and delayed learning methods to reduce the overestimation problem of DDPG and adds noise to improve the robustness in the real environment. Moreover, a UAV mission platform is built to train and evaluate the effectiveness and robustness of the proposed method. Simulation results show that the proposed algorithm has a higher convergence speed, a better convergence effect, and stability. It indicates that UAV can learn more knowledge from the complex environment. |
DOI | 10.1155/2022/9017079 |
URL | 查看来源 |
收录类别 | SCIE |
语种 | 英语English |
WOS研究方向 | Computer Science ; Engineering ; Telecommunications |
WOS类目 | Computer Science, Information Systems ; Engineering, Electrical & Electronic ; Telecommunications |
WOS记录号 | WOS:000806502100016 |
Scopus入藏号 | 2-s2.0-85127888123 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/8926 |
专题 | 理工科技学院 |
通讯作者 | Gu, Fangqing |
作者单位 | 1.School of Mathematics and Statistics,Guangdong University of Technology,Guangzhou,China 2.Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai,China |
推荐引用方式 GB/T 7714 | Wu, Runjia,Gu, Fangqing,Liu, Hailinet al. UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient[J]. Wireless Communications and Mobile Computing, 2022, 2022. |
APA | Wu, Runjia, Gu, Fangqing, Liu, Hailin, & Shi, Hongjian. (2022). UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient. Wireless Communications and Mobile Computing, 2022. |
MLA | Wu, Runjia,et al."UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient". Wireless Communications and Mobile Computing 2022(2022). |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论