Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation
Bowen Xue and Qixin Yan and Wenjing Wang and Hao Liu and Chen Li
在生成式AI领域,生成与用户指定身份匹配的高保真人视频既重要又具有挑战性。现有方法通常依赖过多的训练参数,且缺乏与其他AIGC工具的兼容性。本文提出Stand-In,一种用于视频生成中身份保持的轻量级即插即用框架。具体而言,我们在预训练视频生成模型中引入了一个条件图像分支。通过带有条件位置映射的限制性self-attention实现身份控制,并且仅需2000对样本即可快速学习。尽管仅包含和训练约1
Generating high-fidelity human videos that match user-specified identities is important yet challenging in the field of generative AI. Existing methods often rely on an excessive number of training parameters and lack compatibility with other AIGC tools. In this paper, we propose Stand-In, a lightweight and plug-and-play framework for identity preservation in video generation. Specifically, we introduce a conditional image branch into the pre-trained video generation model. Identity control is a...