科研成果详情

发表状态已发表Published
题名Prior knowledge guided text to image generation
作者
发表日期2024
发表期刊Pattern Recognition Letters
ISSN/eISSN0167-8655
卷号177页码:89-95
摘要

Generating a realistic and semantically consistent image from a given text is a challenging task. Due to the limited information of natural language, it is difficult to generate vivid images with fine details. To address this problem, we propose a Prior Knowledge Guided GAN for text to image generation. Specifically, the proposed method consists of several Knowledge Guided Up-Blocks. We decompose the image into a superposition of several visual regions, each of which requires corresponding prior knowledge to enrich its visual details. Correspondingly, we construct each Up-Block by incorporating relevant prior knowledge as input, aiming to enhance the quality of each visual region. Prior knowledge progressively provides more visual detail through affine transformations. Finally, high-quality images are synthesized by fusing all image regions. Experimental results on the CUB and COCO datasets demonstrate the superior performance of the proposed method.

关键词Generative Adversarial Networks Knowledge Guided GAN Text-to-image synthesis
DOI10.1016/j.patrec.2023.12.003
URL查看来源
收录类别SCIE
语种英语English
WOS研究方向Computer Science
WOS类目Computer Science, Artificial Intelligence
WOS记录号WOS:001135675000001
Scopus入藏号2-s2.0-85179754774
引用统计
被引频次:7[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/11067
专题北师香港浸会大学
通讯作者Xu, Ning
作者单位
1.The School of Electrical and Information Engineering,Tianjin University,Tianjin,China
2.The Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei,China
3.The 30th Research Institute of China Electronics Technology Corporation,Chengdu,China
4.Kuaishou Technology,Beijing,China
5.New Media Center,People's Daily,Beijing,China
6.Beijing Normal University - Hong Kong Baptist University United International College (UIC),Guangzhou,China
7.Baidu Inc.,Beijing,China
推荐引用方式
GB/T 7714
Liu ,An-An,Sun, Zefang,Xu, Ninget al. Prior knowledge guided text to image generation[J]. Pattern Recognition Letters, 2024, 177: 89-95.
APA Liu ,An-An., Sun, Zefang., Xu, Ning., Kang, Rongbao., Cao, Jinbo., .. & Li, Xuanya. (2024). Prior knowledge guided text to image generation. Pattern Recognition Letters, 177, 89-95.
MLA Liu ,An-An,et al."Prior knowledge guided text to image generation". Pattern Recognition Letters 177(2024): 89-95.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Liu ,An-An]的文章
[Sun, Zefang]的文章
[Xu, Ning]的文章
百度学术
百度学术中相似的文章
[Liu ,An-An]的文章
[Sun, Zefang]的文章
[Xu, Ning]的文章
必应学术
必应学术中相似的文章
[Liu ,An-An]的文章
[Sun, Zefang]的文章
[Xu, Ning]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。