| |||||||
ShanghaiTech University Knowledge Management System
MS-GAN: Text to image synthesis with attention-modulated generators and similarity-aware discriminators | |
2020 | |
会议录名称 | 30TH BRITISH MACHINE VISION CONFERENCE, BMVC 2019 |
发表状态 | 已发表 |
摘要 | Existing approaches for text-to-image synthesis often produce images that either contain artifacts or do not well match the text, when the input text description is complex. In this paper, we propose a novel model named MS-GAN, composed of multi-stage attention-Modulated generators and Similarity-aware discriminators, to address these problems. Our proposed generator consists of multiple convolutional blocks that are modulated by both globally and locally attended features calculated between the output image and the text. With such an attention-modulation, our generator can better preserve the semantic information of the text during the text-to-image transformation. Moreover, we propose a similarity-aware discriminator to explicitly constrain the semantic consistency between the text and the synthesized image. Experimental results on Caltech-UCSD Birds and MS-COCO datasets demonstrate that our model can generate images that look more realistic and better match the given text description, compared to the state-of-the-art models. |
会议地点 | Cardiff, United kingdom |
收录类别 | EI |
资助项目 | [2017YFA070 0800] ; National Natural Science Foundation of China[61876171] ; Beijing Municipal Science and Technology Commission[Z181100003918012] |
出版者 | BMVA Press |
EI入藏号 | 20202708903151 |
EI主题词 | Discriminators ; Semantics |
EI分类号 | Modulators, Demodulators, Limiters, Discriminators, Mixers:713.3 ; Computer Applications:723.5 |
原始文献类型 | Conference article (CA) |
文献类型 | 会议论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/118931 |
专题 | 信息科学与技术学院_硕士生 信息科学与技术学院_特聘教授组_陈熙霖组 |
通讯作者 | Ma, Bingpeng |
作者单位 | 1.School of Information Science and Technology, ShanghaiTech University, Shanghai, China. 2.Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China. 3.University of Chinese Academy of Sciences, Beijing, China. 4.CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai, China. |
第一作者单位 | 信息科学与技术学院 |
第一作者的第一单位 | 信息科学与技术学院 |
推荐引用方式 GB/T 7714 | Mao, Fengling,Ma, Bingpeng,Chang, Hong,et al. MS-GAN: Text to image synthesis with attention-modulated generators and similarity-aware discriminators[C]:BMVA Press,2020. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。