科研成果详情

发表状态已发表Published
题名UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient
作者
发表日期2022
发表期刊Wireless Communications and Mobile Computing
ISSN/eISSN1530-8669
卷号2022
摘要

Deep deterministic policy gradient (DDPG) algorithm is a reinforcement learning method, which has been widely used in UAV path planning. However, the critic network of DDPG is frequently updated in the training process. It leads to an inevitable overestimation problem and increases the training computational complexity. Therefore, this paper presents a multicritic-delayed DDPG method for solving the UAV path planning. It uses multicritic networks and delayed learning methods to reduce the overestimation problem of DDPG and adds noise to improve the robustness in the real environment. Moreover, a UAV mission platform is built to train and evaluate the effectiveness and robustness of the proposed method. Simulation results show that the proposed algorithm has a higher convergence speed, a better convergence effect, and stability. It indicates that UAV can learn more knowledge from the complex environment.

DOI10.1155/2022/9017079
URL查看来源
收录类别SCIE
语种英语English
WOS研究方向Computer Science ; Engineering ; Telecommunications
WOS类目Computer Science, Information Systems ; Engineering, Electrical & Electronic ; Telecommunications
WOS记录号WOS:000806502100016
Scopus入藏号2-s2.0-85127888123
引用统计
被引频次:4[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/8926
专题理工科技学院
通讯作者Gu, Fangqing
作者单位
1.School of Mathematics and Statistics,Guangdong University of Technology,Guangzhou,China
2.Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai,China
推荐引用方式
GB/T 7714
Wu, Runjia,Gu, Fangqing,Liu, Hailinet al. UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient[J]. Wireless Communications and Mobile Computing, 2022, 2022.
APA Wu, Runjia, Gu, Fangqing, Liu, Hailin, & Shi, Hongjian. (2022). UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient. Wireless Communications and Mobile Computing, 2022.
MLA Wu, Runjia,et al."UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient". Wireless Communications and Mobile Computing 2022(2022).
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Wu, Runjia]的文章
[Gu, Fangqing]的文章
[Liu, Hailin]的文章
百度学术
百度学术中相似的文章
[Wu, Runjia]的文章
[Gu, Fangqing]的文章
[Liu, Hailin]的文章
必应学术
必应学术中相似的文章
[Wu, Runjia]的文章
[Gu, Fangqing]的文章
[Liu, Hailin]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。