DreamOmni2: Multimodal Instruction-based Editing and Generation This repository contains DreamOmni2, a family of multimodal autoregressive models capable of various vision and language tasks, particularly excelling in multimodal instruction-based editing and generation. These tasks support both text and image instructions and extend the scope to include both concrete and abstract concepts, greatly enhancing their practical applications. For traditional subject-driven generation based on concr...