thuml
/

ivideogpt-bair-64-act-free

robotic-manipulation

video-generation

video-prediction

Model card Files Files and versions Community

ivideogpt-bair-64-act-free / README.md

manchery's picture

Update README.md

93e00a1 verified 28 days ago

|

history blame contribute delete

771 Bytes

	---
	license: mit
	tags:
	- world-model
	- robotic-manipulation
	- video-generation
	- video-prediction
	- gpt
	base_model:
	- thuml/ivideogpt-oxe-64-act-free
	---

	# iVideoGPT (Fine-tuned to BAIR Robot Pushing, 64x64 resolution, action-free)

	Fine-tuned model introduced in the paper [iVideoGPT: Interactive VideoGPTs are Scalable World Models](https://arxiv.org/abs/2405.15223).

	See https://github.com/thuml/iVideoGPT for examples for using this model.

	## Citation

	```
	@inproceedings{wu2024ivideogpt,
	title={iVideoGPT: Interactive VideoGPTs are Scalable World Models},
	author={Jialong Wu and Shaofeng Yin and Ningya Feng and Xu He and Dong Li and Jianye Hao and Mingsheng Long},
	booktitle={Advances in Neural Information Processing Systems},
	year={2024}
	}
	```