Details of Research Outputs

Status已发表Published
TitleSpatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning
Creator
Date Issued2023-03-01
Source PublicationIEEE Transactions on Broadcasting
ISSN0018-9316
Volume69Issue:1Pages:130-143
Abstract

In this paper, we propose a pyramidal spatiotemporal feature hierarchy (PSFH)-based no-reference (NR) video quality assessment (VQA) method using transfer learning. First, we generate simulated videos by a generative adversarial network (GAN)-based image restoration model. The residual maps between the distorted frames and simulated frames, which can capture rich information, are utilized as one input of the quality regression network. Second, we use 3D convolution operations to construct a PSFH network with five stages. The spatiotemporal features incorporating the shared features transferred from the pretrained image restoration model are fused stage by stage. Third, with the guidance of the transferred knowledge, each stage generates multiple feature mapping layers that encode different semantics and degradation information using 3D convolution layers and gated recurrent units (GRUs). Finally, five approximate perceptual quality scores and a precise prediction score are obtained by fully connected (FC) networks. The whole model is trained under a finely designed loss function that combines pseudo-Huber loss and Pearson linear correlation coefficient (PLCC) loss to improve the robustness and prediction accuracy. According to the extensive experiments, outstanding results can be obtained compared with other state-of-the-art methods. Both the source code and models are available online.1

Keyword3D convolution generative adversarial network pyramidal spatiotemporal feature transfer learning Video quality assessment
DOI10.1109/TBC.2022.3192997
URLView source
Indexed BySCIE
Language英语English
WOS Research AreaEngineering ; Telecommunications
WOS SubjectEngineering, Electrical & Electronic ; Telecommunications
WOS IDWOS:000836588300001
Scopus ID2-s2.0-85135750036
Citation statistics
Cited Times:11[WOS]   [WOS Record]     [Related Records in WOS]
Document TypeJournal article
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/10304
CollectionFaculty of Science and Technology
Corresponding AuthorZhou, Mingliang
Affiliation
1.Chongqing University,School of Computer Science,Chongqing,400044,China
2.City University of Hong Kong,Computer Science Department,Hong Kong,Hong Kong
3.Nanjing University of Science and Technology,School of Computer Science and Engineering,Nanjing,210094,China
4.Beijing Normal University,BNU-UIC Institute of Artificial Intelligence and Future Networks,Zhuhai,519088,China
5.BNU-HKBU United International College,Guangdong Key Laboratory of AI Multi-Modal Data Processing,Zhuhai,519087,China
Recommended Citation
GB/T 7714
Xian, Weizhi,Zhou, Mingliang,Fang, Binet al. Spatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning[J]. IEEE Transactions on Broadcasting, 2023, 69(1): 130-143.
APA Xian, Weizhi., Zhou, Mingliang., Fang, Bin., Liao, Xingran., Ji, Cheng., .. & Jia, Weijia. (2023). Spatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning. IEEE Transactions on Broadcasting, 69(1), 130-143.
MLA Xian, Weizhi,et al."Spatiotemporal Feature Hierarchy-Based Blind Prediction of Natural Video Quality via Transfer Learning". IEEE Transactions on Broadcasting 69.1(2023): 130-143.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Xian, Weizhi]'s Articles
[Zhou, Mingliang]'s Articles
[Fang, Bin]'s Articles
Baidu academic
Similar articles in Baidu academic
[Xian, Weizhi]'s Articles
[Zhou, Mingliang]'s Articles
[Fang, Bin]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Xian, Weizhi]'s Articles
[Zhou, Mingliang]'s Articles
[Fang, Bin]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.