Data Generation
Paper • 2407.05282 • Published • 12Note UltraEdit, a large-scale (~4M editing samples), automatically generated dataset for instruction-based image editing. UltraEdit offers large-scale samples with rich editing tasks and fewer biases. Task: instruction-based image editing datasets
Training Task Experts through Retrieval Based Distillation
Paper • 2407.05463 • Published • 7Note Need to create high-quality, task-specific datasets but don’t have any existing datasets? Introducing ReBase! Our method retrieves diverse data samples from multiple datasets and transforms them to fit your needs.
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
Paper • 2407.03471 • Published • 28Note AURORA Dataset (Action-Reasoning-Object-Attribute), a collection of high-quality training data, human-annotated and curated from videos and simulation engines. We focus on a key aspect of quality training data: triplets (source image, prompt, target image) contain a single meaningful visual change described by the prompt, i.e., truly minimal changes between source and target images.