|
--- |
|
license: mit |
|
tags: |
|
- world-model |
|
- robotic-manipulation |
|
- video-generation |
|
- video-prediction |
|
- gpt |
|
base_model: |
|
- thuml/ivideogpt-oxe-64-act-free |
|
--- |
|
|
|
# iVideoGPT (Fine-tuned to BAIR Robot Pushing, 64x64 resolution, action-free) |
|
|
|
Fine-tuned model introduced in the paper [iVideoGPT: Interactive VideoGPTs are Scalable World Models](https://arxiv.org/abs/2405.15223). |
|
|
|
See https://github.com/thuml/iVideoGPT for examples for using this model. |
|
|
|
## Citation |
|
|
|
``` |
|
@inproceedings{wu2024ivideogpt, |
|
title={iVideoGPT: Interactive VideoGPTs are Scalable World Models}, |
|
author={Jialong Wu and Shaofeng Yin and Ningya Feng and Xu He and Dong Li and Jianye Hao and Mingsheng Long}, |
|
booktitle={Advances in Neural Information Processing Systems}, |
|
year={2024} |
|
} |
|
``` |