base_model: | |
- thuml/bytesized32-world-model-sft | |
datasets: | |
- thuml/bytesized32-world-model-cot | |
license: mit | |
tags: | |
- text-game | |
- world-model | |
- rlvr | |
pipeline_tag: text-generation | |
library_name: transformers | |
[Project Page](https://thuml.github.io/RLVR-World/) | |
[Github Repository](https://github.com/thuml/RLVR-World) | |
See https://github.com/thuml/RLVR-World for examples for using this model. | |
## Citation | |
``` | |
@article{wu2025rlvr, | |
title={RLVR-World: Training World Models with Reinforcement Learning}, | |
author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, | |
journal={arXiv preprint arXiv:2505.13934}, | |
year={2025}, | |
} | |
``` |