Details of Research Outputs

Status已发表Published
TitlePrior knowledge guided text to image generation
Creator
Date Issued2024
Source PublicationPattern Recognition Letters
ISSN0167-8655
Volume177Pages:89-95
Abstract

Generating a realistic and semantically consistent image from a given text is a challenging task. Due to the limited information of natural language, it is difficult to generate vivid images with fine details. To address this problem, we propose a Prior Knowledge Guided GAN for text to image generation. Specifically, the proposed method consists of several Knowledge Guided Up-Blocks. We decompose the image into a superposition of several visual regions, each of which requires corresponding prior knowledge to enrich its visual details. Correspondingly, we construct each Up-Block by incorporating relevant prior knowledge as input, aiming to enhance the quality of each visual region. Prior knowledge progressively provides more visual detail through affine transformations. Finally, high-quality images are synthesized by fusing all image regions. Experimental results on the CUB and COCO datasets demonstrate the superior performance of the proposed method.

KeywordGenerative Adversarial Networks Knowledge Guided GAN Text-to-image synthesis
DOI10.1016/j.patrec.2023.12.003
URLView source
Indexed BySCIE
Language英语English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:001135675000001
Scopus ID2-s2.0-85179754774
Citation statistics
Cited Times:7[WOS]   [WOS Record]     [Related Records in WOS]
Document TypeJournal article
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/11067
CollectionBeijing Normal-Hong Kong Baptist University
Corresponding AuthorXu, Ning
Affiliation
1.The School of Electrical and Information Engineering,Tianjin University,Tianjin,China
2.The Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei,China
3.The 30th Research Institute of China Electronics Technology Corporation,Chengdu,China
4.Kuaishou Technology,Beijing,China
5.New Media Center,People's Daily,Beijing,China
6.Beijing Normal University - Hong Kong Baptist University United International College (UIC),Guangzhou,China
7.Baidu Inc.,Beijing,China
Recommended Citation
GB/T 7714
Liu ,An-An,Sun, Zefang,Xu, Ninget al. Prior knowledge guided text to image generation[J]. Pattern Recognition Letters, 2024, 177: 89-95.
APA Liu ,An-An., Sun, Zefang., Xu, Ning., Kang, Rongbao., Cao, Jinbo., .. & Li, Xuanya. (2024). Prior knowledge guided text to image generation. Pattern Recognition Letters, 177, 89-95.
MLA Liu ,An-An,et al."Prior knowledge guided text to image generation". Pattern Recognition Letters 177(2024): 89-95.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Liu ,An-An]'s Articles
[Sun, Zefang]'s Articles
[Xu, Ning]'s Articles
Baidu academic
Similar articles in Baidu academic
[Liu ,An-An]'s Articles
[Sun, Zefang]'s Articles
[Xu, Ning]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Liu ,An-An]'s Articles
[Sun, Zefang]'s Articles
[Xu, Ning]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.