metadata
license: mit
datasets:
- McGill-NLP/AURORA
language:
- en
library_name: diffusers
pipeline_tag: image-to-image
tags:
- editing
- vision-language
For more details: https://github.com/McGill-NLP/AURORA or read the paper: https://arxiv.org/abs/2407.03471
Citation
@inproceedings{krojer2024aurora,
author={Benno Krojer and Dheeraj Vattikonda and Luis Lara and Varun Jampani and Eva Portelance and Christopher Pal and Siva Reddy},
title={{Learning Action and Reasoning-Centric Image Editing from Videos and Simulations}},
booktitle={NeurIPS},
year={2024},
note={Spotlight Paper},
url={https://arxiv.org/abs/2407.03471}
}