InsEdit: Towards Instruction-based Visual Editing via Data-Efficient Video Diffusion Models Adaptation
arXiv:2604.08646v1 Announce Type: new Abstract: Instruction-based video editing is a natural way to control video content with text, but adapting a video generation model into an editor usually appears data-hungry. At the same time, high-quality video editing data remains scarce. In this paper, we show that a video generation backbone can become a strong video editor without large scale video editing data. We present InsEdit, an instruction-based editing model built on HunyuanVideo-1.5. InsEdit combines a visual editing architecture […]