Status | 已发表Published |
Title | Prior knowledge guided text to image generation |
Creator | |
Date Issued | 2024 |
Source Publication | Pattern Recognition Letters
![]() |
ISSN | 0167-8655 |
Volume | 177Pages:89-95 |
Abstract | Generating a realistic and semantically consistent image from a given text is a challenging task. Due to the limited information of natural language, it is difficult to generate vivid images with fine details. To address this problem, we propose a Prior Knowledge Guided GAN for text to image generation. Specifically, the proposed method consists of several Knowledge Guided Up-Blocks. We decompose the image into a superposition of several visual regions, each of which requires corresponding prior knowledge to enrich its visual details. Correspondingly, we construct each Up-Block by incorporating relevant prior knowledge as input, aiming to enhance the quality of each visual region. Prior knowledge progressively provides more visual detail through affine transformations. Finally, high-quality images are synthesized by fusing all image regions. Experimental results on the CUB and COCO datasets demonstrate the superior performance of the proposed method. |
Keyword | Generative Adversarial Networks Knowledge Guided GAN Text-to-image synthesis |
DOI | 10.1016/j.patrec.2023.12.003 |
URL | View source |
Indexed By | SCIE |
Language | 英语English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence |
WOS ID | WOS:001135675000001 |
Scopus ID | 2-s2.0-85179754774 |
Citation statistics | |
Document Type | Journal article |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/11067 |
Collection | Beijing Normal-Hong Kong Baptist University |
Corresponding Author | Xu, Ning |
Affiliation | 1.The School of Electrical and Information Engineering,Tianjin University,Tianjin,China 2.The Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei,China 3.The 30th Research Institute of China Electronics Technology Corporation,Chengdu,China 4.Kuaishou Technology,Beijing,China 5.New Media Center,People's Daily,Beijing,China 6.Beijing Normal University - Hong Kong Baptist University United International College (UIC),Guangzhou,China 7.Baidu Inc.,Beijing,China |
Recommended Citation GB/T 7714 | Liu ,An-An,Sun, Zefang,Xu, Ninget al. Prior knowledge guided text to image generation[J]. Pattern Recognition Letters, 2024, 177: 89-95. |
APA | Liu ,An-An., Sun, Zefang., Xu, Ning., Kang, Rongbao., Cao, Jinbo., .. & Li, Xuanya. (2024). Prior knowledge guided text to image generation. Pattern Recognition Letters, 177, 89-95. |
MLA | Liu ,An-An,et al."Prior knowledge guided text to image generation". Pattern Recognition Letters 177(2024): 89-95. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment