消息
×
loading..
MS-GAN: Text to image synthesis with attention-modulated generators and similarity-aware discriminators
2020
会议录名称30TH BRITISH MACHINE VISION CONFERENCE, BMVC 2019
发表状态已发表
摘要

Existing approaches for text-to-image synthesis often produce images that either contain artifacts or do not well match the text, when the input text description is complex. In this paper, we propose a novel model named MS-GAN, composed of multi-stage attention-Modulated generators and Similarity-aware discriminators, to address these problems. Our proposed generator consists of multiple convolutional blocks that are modulated by both globally and locally attended features calculated between the output image and the text. With such an attention-modulation, our generator can better preserve the semantic information of the text during the text-to-image transformation. Moreover, we propose a similarity-aware discriminator to explicitly constrain the semantic consistency between the text and the synthesized image. Experimental results on Caltech-UCSD Birds and MS-COCO datasets demonstrate that our model can generate images that look more realistic and better match the given text description, compared to the state-of-the-art models.
© 2019. The copyright of this document resides with its authors.

会议地点Cardiff, United kingdom
收录类别EI
资助项目[2017YFA070 0800] ; National Natural Science Foundation of China[61876171] ; Beijing Municipal Science and Technology Commission[Z181100003918012]
出版者BMVA Press
EI入藏号20202708903151
EI主题词Discriminators ; Semantics
EI分类号Modulators, Demodulators, Limiters, Discriminators, Mixers:713.3 ; Computer Applications:723.5
原始文献类型Conference article (CA)
文献类型会议论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/118931
专题信息科学与技术学院_硕士生
信息科学与技术学院_特聘教授组_陈熙霖组
通讯作者Ma, Bingpeng
作者单位
1.School of Information Science and Technology, ShanghaiTech University, Shanghai, China.
2.Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China.
3.University of Chinese Academy of Sciences, Beijing, China.
4.CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai, China.
第一作者单位信息科学与技术学院
第一作者的第一单位信息科学与技术学院
推荐引用方式
GB/T 7714
Mao, Fengling,Ma, Bingpeng,Chang, Hong,et al. MS-GAN: Text to image synthesis with attention-modulated generators and similarity-aware discriminators[C]:BMVA Press,2020.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Mao, Fengling]的文章
[Ma, Bingpeng]的文章
[Chang, Hong]的文章
百度学术
百度学术中相似的文章
[Mao, Fengling]的文章
[Ma, Bingpeng]的文章
[Chang, Hong]的文章
必应学术
必应学术中相似的文章
[Mao, Fengling]的文章
[Ma, Bingpeng]的文章
[Chang, Hong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。