Status | 已发表Published |
Title | Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC |
Creator | |
Date Issued | 2021 |
Source Publication | IEEE Transactions on Multimedia
![]() |
ISSN | 1520-9210 |
Volume | 23Pages:1106-1121 |
Abstract | Rate control (RC) plays a critical role in the transmission of high-quality video data under certain bandwidth restrictions in High Efficiency Video Coding (HEVC). Most current HEVC RC algorithms based on spatiooral information for rate-distortion (R-D) model parameters cannot effectively handle the cases with dynamic video sequences that contain fast moving objects, significant object occlusion or scene changes. In this paper, we propose an RC method based on deep reinforcement learning (DRL) for dynamic video sequences in HEVC to improve the coding efficiency. First, the rate control problem is formulated as a Markov decision process (MDP) problem. Second, with the MDP model, we develop a DRL-based algorithm to find the optimal quantization parameters (QPs) by training a deep neural network. The resulting intelligent agent selects the optimal RC strategy to reduce distortion, buffer and quality fluctuations by observing the current state of the encoder. The asynchronous advantage actor-critic (A3C) method is used to solve the MDP problem. Finally, the proposed DRL-based RC method is implemented in the newest video coding standard. Experimental results show that the proposed method offers substantially enhanced RC accuracy and consistently outperforms HEVC reference software and other state-of-the-art algorithms. © 1999-2012 IEEE. |
Keyword | dynamically changing video rate control rate-distortion optimization Reinforcement learning |
DOI | 10.1109/TMM.2020.2992968 |
URL | View source |
Indexed By | SCIE |
Language | 英语English |
WOS Research Area | Computer Science ; Telecommunications |
WOS Subject | Computer Science, Information Systems ; Computer Science, Software Engineering ; Telecommunications |
WOS ID | WOS:000623420300022 |
SciVal Topic Prominence | T.5873 |
Citation statistics | |
Document Type | Journal article |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/1834 |
Collection | Graduate School |
Corresponding Author | Zhou, Mingliang; Kwong, Sam |
Affiliation | 1.School of Computer Science, Chongqing University, Chongqing, 400044, China 2.Department of Computer Science, City University of Hong Kong, Kowloon, 999077, Hong Kong 3.BNU-UIC Joint Ai Research Institute, Beijing Normal University and Uic (Zhuhai), Guangdong, 519087, China 4.School of Computer Science, Chongqing University, Chongqing, 400044, China |
Recommended Citation GB/T 7714 | Zhou, Mingliang,Wei, Xuekai,Kwong, Samet al. Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC[J]. IEEE Transactions on Multimedia, 2021, 23: 1106-1121. |
APA | Zhou, Mingliang, Wei, Xuekai, Kwong, Sam, Jia, Weijia, & Fang, Bin. (2021). Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC. IEEE Transactions on Multimedia, 23, 1106-1121. |
MLA | Zhou, Mingliang,et al."Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC". IEEE Transactions on Multimedia 23(2021): 1106-1121. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment