Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC

doi:10.1109/TMM.2020.2992968

Details of Research Outputs

Status	已发表Published
Title	Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC
Creator	Zhou, Mingliang 1; Wei, Xuekai 2; Kwong, Sam 2; Jia, Weijia3 ; Fang, Bin 4
Date Issued	2021
Source Publication	IEEE Transactions on Multimedia
ISSN	1520-9210
Volume	23 Pages:1106-1121
Abstract	Rate control (RC) plays a critical role in the transmission of high-quality video data under certain bandwidth restrictions in High Efficiency Video Coding (HEVC). Most current HEVC RC algorithms based on spatiooral information for rate-distortion (R-D) model parameters cannot effectively handle the cases with dynamic video sequences that contain fast moving objects, significant object occlusion or scene changes. In this paper, we propose an RC method based on deep reinforcement learning (DRL) for dynamic video sequences in HEVC to improve the coding efficiency. First, the rate control problem is formulated as a Markov decision process (MDP) problem. Second, with the MDP model, we develop a DRL-based algorithm to find the optimal quantization parameters (QPs) by training a deep neural network. The resulting intelligent agent selects the optimal RC strategy to reduce distortion, buffer and quality fluctuations by observing the current state of the encoder. The asynchronous advantage actor-critic (A3C) method is used to solve the MDP problem. Finally, the proposed DRL-based RC method is implemented in the newest video coding standard. Experimental results show that the proposed method offers substantially enhanced RC accuracy and consistently outperforms HEVC reference software and other state-of-the-art algorithms. © 1999-2012 IEEE.
Keyword	dynamically changing video rate control rate-distortion optimization Reinforcement learning
DOI	10.1109/TMM.2020.2992968
URL	View source
Indexed By	SCIE
Language	英语English
WOS Research Area	Computer Science ; Telecommunications
WOS Subject	Computer Science, Information Systems ; Computer Science, Software Engineering ; Telecommunications
WOS ID	WOS:000623420300022
SciVal Topic Prominence	T.5873
Citation statistics	Cited Times:46[WOS] [WOS Record] [Related Records in WOS]
Document Type	Journal article
Identifier	http://repository.uic.edu.cn/handle/39GCC9TT/1834
Collection	Graduate School
Corresponding Author	Zhou, Mingliang; Kwong, Sam
Affiliation	1.School of Computer Science, Chongqing University, Chongqing, 400044, China 2.Department of Computer Science, City University of Hong Kong, Kowloon, 999077, Hong Kong 3.BNU-UIC Joint Ai Research Institute, Beijing Normal University and Uic (Zhuhai), Guangdong, 519087, China 4.School of Computer Science, Chongqing University, Chongqing, 400044, China
Recommended Citation GB/T 7714	Zhou, Mingliang,Wei, Xuekai,Kwong, Samet al. Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC[J]. IEEE Transactions on Multimedia, 2021, 23: 1106-1121.
APA	Zhou, Mingliang, Wei, Xuekai, Kwong, Sam, Jia, Weijia, & Fang, Bin. (2021). Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC. IEEE Transactions on Multimedia, 23, 1106-1121.
MLA	Zhou, Mingliang,et al."Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC". IEEE Transactions on Multimedia 23(2021): 1106-1121.

Files in This Item:
There are no files associated with this item.

Related Services

Usage statistics

Google Scholar

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Institution:
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh