manchery's picture
Update README.md
93e00a1 verified
metadata
license: mit
tags:
  - world-model
  - robotic-manipulation
  - video-generation
  - video-prediction
  - gpt
base_model:
  - thuml/ivideogpt-oxe-64-act-free

iVideoGPT (Fine-tuned to BAIR Robot Pushing, 64x64 resolution, action-free)

Fine-tuned model introduced in the paper iVideoGPT: Interactive VideoGPTs are Scalable World Models.

See https://github.com/thuml/iVideoGPT for examples for using this model.

Citation

@inproceedings{wu2024ivideogpt,
    title={iVideoGPT: Interactive VideoGPTs are Scalable World Models}, 
    author={Jialong Wu and Shaofeng Yin and Ningya Feng and Xu He and Dong Li and Jianye Hao and Mingsheng Long},
    booktitle={Advances in Neural Information Processing Systems},
    year={2024}
}