发表状态 | 已发表Published |
题名 | Prior knowledge guided text to image generation |
作者 | |
发表日期 | 2024 |
发表期刊 | Pattern Recognition Letters
![]() |
ISSN/eISSN | 0167-8655 |
卷号 | 177页码:89-95 |
摘要 | Generating a realistic and semantically consistent image from a given text is a challenging task. Due to the limited information of natural language, it is difficult to generate vivid images with fine details. To address this problem, we propose a Prior Knowledge Guided GAN for text to image generation. Specifically, the proposed method consists of several Knowledge Guided Up-Blocks. We decompose the image into a superposition of several visual regions, each of which requires corresponding prior knowledge to enrich its visual details. Correspondingly, we construct each Up-Block by incorporating relevant prior knowledge as input, aiming to enhance the quality of each visual region. Prior knowledge progressively provides more visual detail through affine transformations. Finally, high-quality images are synthesized by fusing all image regions. Experimental results on the CUB and COCO datasets demonstrate the superior performance of the proposed method. |
关键词 | Generative Adversarial Networks Knowledge Guided GAN Text-to-image synthesis |
DOI | 10.1016/j.patrec.2023.12.003 |
URL | 查看来源 |
收录类别 | SCIE |
语种 | 英语English |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Artificial Intelligence |
WOS记录号 | WOS:001135675000001 |
Scopus入藏号 | 2-s2.0-85179754774 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/11067 |
专题 | 北师香港浸会大学 |
通讯作者 | Xu, Ning |
作者单位 | 1.The School of Electrical and Information Engineering,Tianjin University,Tianjin,China 2.The Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei,China 3.The 30th Research Institute of China Electronics Technology Corporation,Chengdu,China 4.Kuaishou Technology,Beijing,China 5.New Media Center,People's Daily,Beijing,China 6.Beijing Normal University - Hong Kong Baptist University United International College (UIC),Guangzhou,China 7.Baidu Inc.,Beijing,China |
推荐引用方式 GB/T 7714 | Liu ,An-An,Sun, Zefang,Xu, Ninget al. Prior knowledge guided text to image generation[J]. Pattern Recognition Letters, 2024, 177: 89-95. |
APA | Liu ,An-An., Sun, Zefang., Xu, Ning., Kang, Rongbao., Cao, Jinbo., .. & Li, Xuanya. (2024). Prior knowledge guided text to image generation. Pattern Recognition Letters, 177, 89-95. |
MLA | Liu ,An-An,et al."Prior knowledge guided text to image generation". Pattern Recognition Letters 177(2024): 89-95. |
条目包含的文件 | 条目无相关文件。 |
个性服务 |
查看访问统计 |
谷歌学术 |
谷歌学术中相似的文章 |
[Liu ,An-An]的文章 |
[Sun, Zefang]的文章 |
[Xu, Ning]的文章 |
百度学术 |
百度学术中相似的文章 |
[Liu ,An-An]的文章 |
[Sun, Zefang]的文章 |
[Xu, Ning]的文章 |
必应学术 |
必应学术中相似的文章 |
[Liu ,An-An]的文章 |
[Sun, Zefang]的文章 |
[Xu, Ning]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论