UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient

doi:10.1155/2022/9017079

科研成果详情

发表状态	已发表Published
题名	UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient
作者	Wu, Runjia 1; Gu, Fangqing 1; Liu, Hailin 1; Shi, Hongjian2
发表日期	2022
发表期刊	Wireless Communications and Mobile Computing
ISSN/eISSN	1530-8669
卷号	2022
摘要	Deep deterministic policy gradient (DDPG) algorithm is a reinforcement learning method, which has been widely used in UAV path planning. However, the critic network of DDPG is frequently updated in the training process. It leads to an inevitable overestimation problem and increases the training computational complexity. Therefore, this paper presents a multicritic-delayed DDPG method for solving the UAV path planning. It uses multicritic networks and delayed learning methods to reduce the overestimation problem of DDPG and adds noise to improve the robustness in the real environment. Moreover, a UAV mission platform is built to train and evaluate the effectiveness and robustness of the proposed method. Simulation results show that the proposed algorithm has a higher convergence speed, a better convergence effect, and stability. It indicates that UAV can learn more knowledge from the complex environment.
DOI	10.1155/2022/9017079
URL	查看来源
收录类别	SCIE
语种	英语English
WOS研究方向	Computer Science ; Engineering ; Telecommunications
WOS类目	Computer Science, Information Systems ; Engineering, Electrical & Electronic ; Telecommunications
WOS记录号	WOS:000806502100016
Scopus入藏号	2-s2.0-85127888123
引用统计	被引频次：4[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	https://repository.uic.edu.cn/handle/39GCC9TT/8926
专题	理工科技学院
通讯作者	Gu, Fangqing
作者单位	1.School of Mathematics and Statistics,Guangdong University of Technology,Guangzhou,China 2.Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai,China
推荐引用方式 GB/T 7714	Wu, Runjia,Gu, Fangqing,Liu, Hailinet al. UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient[J]. Wireless Communications and Mobile Computing, 2022, 2022.
APA	Wu, Runjia, Gu, Fangqing, Liu, Hailin, & Shi, Hongjian. (2022). UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient. Wireless Communications and Mobile Computing, 2022.
MLA	Wu, Runjia,et al."UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient". Wireless Communications and Mobile Computing 2022(2022).