| |||||||
ShanghaiTech University Knowledge Management System
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation | |
2023-12-26 | |
会议录名称 | ARXIV |
ISSN | 1063-6919 |
发表状态 | 已发表 |
DOI | arXiv:2312.16272 |
摘要 | Recent advancements in subject-driven image generation have led to zero-shot generation, yet precise selection and focus on crucial subject representations remain challenging. Addressing this, we introduce the SSR-Encoder, a novel architecture designed for selectively capturing any subject from single or multiple reference images. It responds to various query modalities including text and masks, without necessitating test-time fine-tuning. The SSR-Encoder combines a Token-to-Patch Aligner that aligns query inputs with image patches and a Detail-Preserving Subject Encoder for extracting and preserving fine features of the subjects, thereby generating subject embeddings. These embeddings, used in conjunction with original text embeddings, condition the generation process. Characterized by its model generalizability and efficiency, the SSR-Encoder adapts to a range of custom models and control modules. Enhanced by the Embedding Consistency Regularization Loss for improved training, our extensive experiments demonstrate its effectiveness in versatile and high-quality image generation, indicating its broad applicability. |
会议地点 | Seattle, WA, USA |
会议日期 | 16-22 June 2024 |
URL | 查看原文 |
WOS类目 | Computer Science, Software Engineering |
WOS记录号 | PPRN:86852415 |
来源库 | IEEE |
文献类型 | 会议论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/349900 |
专题 | 信息科学与技术学院_硕士生 |
作者单位 | 1.Shanghai Jiao Tong Univ, Shanghai, Peoples R China 2.Xiaohongshu Inc, Shanghai, Peoples R China 3.Beijing Univ Posts&Telecommunicat, Beijing, Peoples R China 4.Carnegie Mellon Univ, Pittsburgh, PA, USA 5.Shanghai Tech Univ, Shanghai, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Yuxuan,Liu, Jiaming,Song, Yiren,et al. SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation[C],2023. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。