--- base_model: - thuml/bytesized32-world-model-sft datasets: - thuml/bytesized32-world-model-cot license: mit tags: - text-game - world-model - rlvr pipeline_tag: text-generation library_name: transformers --- [Project Page](https://thuml.github.io/RLVR-World/) [Github Repository](https://github.com/thuml/RLVR-World) See https://github.com/thuml/RLVR-World for examples for using this model. ## Citation ``` @article{wu2025rlvr, title={RLVR-World: Training World Models with Reinforcement Learning}, author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, journal={arXiv preprint arXiv:2505.13934}, year={2025}, } ```