Status | 已发表Published |
Title | Spatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning |
Creator | |
Date Issued | 2023-03-01 |
Source Publication | IEEE Transactions on Broadcasting
![]() |
ISSN | 0018-9316 |
Volume | 69Issue:1Pages:130-143 |
Abstract | In this paper, we propose a pyramidal spatiotemporal feature hierarchy (PSFH)-based no-reference (NR) video quality assessment (VQA) method using transfer learning. First, we generate simulated videos by a generative adversarial network (GAN)-based image restoration model. The residual maps between the distorted frames and simulated frames, which can capture rich information, are utilized as one input of the quality regression network. Second, we use 3D convolution operations to construct a PSFH network with five stages. The spatiotemporal features incorporating the shared features transferred from the pretrained image restoration model are fused stage by stage. Third, with the guidance of the transferred knowledge, each stage generates multiple feature mapping layers that encode different semantics and degradation information using 3D convolution layers and gated recurrent units (GRUs). Finally, five approximate perceptual quality scores and a precise prediction score are obtained by fully connected (FC) networks. The whole model is trained under a finely designed loss function that combines pseudo-Huber loss and Pearson linear correlation coefficient (PLCC) loss to improve the robustness and prediction accuracy. According to the extensive experiments, outstanding results can be obtained compared with other state-of-the-art methods. Both the source code and models are available online.1 |
Keyword | 3D convolution generative adversarial network pyramidal spatiotemporal feature transfer learning Video quality assessment |
DOI | 10.1109/TBC.2022.3192997 |
URL | View source |
Indexed By | SCIE |
Language | 英语English |
WOS Research Area | Engineering ; Telecommunications |
WOS Subject | Engineering, Electrical & Electronic ; Telecommunications |
WOS ID | WOS:000836588300001 |
Scopus ID | 2-s2.0-85135750036 |
Citation statistics | |
Document Type | Journal article |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/10304 |
Collection | Faculty of Science and Technology |
Corresponding Author | Zhou, Mingliang |
Affiliation | 1.Chongqing University,School of Computer Science,Chongqing,400044,China 2.City University of Hong Kong,Computer Science Department,Hong Kong,Hong Kong 3.Nanjing University of Science and Technology,School of Computer Science and Engineering,Nanjing,210094,China 4.Beijing Normal University,BNU-UIC Institute of Artificial Intelligence and Future Networks,Zhuhai,519088,China 5.BNU-HKBU United International College,Guangdong Key Laboratory of AI Multi-Modal Data Processing,Zhuhai,519087,China |
Recommended Citation GB/T 7714 | Xian, Weizhi,Zhou, Mingliang,Fang, Binet al. Spatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning[J]. IEEE Transactions on Broadcasting, 2023, 69(1): 130-143. |
APA | Xian, Weizhi., Zhou, Mingliang., Fang, Bin., Liao, Xingran., Ji, Cheng., .. & Jia, Weijia. (2023). Spatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning. IEEE Transactions on Broadcasting, 69(1), 130-143. |
MLA | Xian, Weizhi,et al."Spatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning". IEEE Transactions on Broadcasting 69.1(2023): 130-143. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment