--- license: mit tags: - world-model - robotic-manipulation - video-generation - video-prediction - gpt base_model: - thuml/ivideogpt-oxe-64-act-free --- # iVideoGPT (Fine-tuned to BAIR Robot Pushing, 64x64 resolution, action-free) Fine-tuned model introduced in the paper [iVideoGPT: Interactive VideoGPTs are Scalable World Models](https://arxiv.org/abs/2405.15223). See https://github.com/thuml/iVideoGPT for examples for using this model. ## Citation ``` @inproceedings{wu2024ivideogpt, title={iVideoGPT: Interactive VideoGPTs are Scalable World Models}, author={Jialong Wu and Shaofeng Yin and Ningya Feng and Xu He and Dong Li and Jianye Hao and Mingsheng Long}, booktitle={Advances in Neural Information Processing Systems}, year={2024} } ```