Safetensors
qwen2
text-game
world-model
rlvr
manchery commited on
Commit
501a03c
·
verified ·
1 Parent(s): a6dca4a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -3,6 +3,11 @@ license: mit
3
  tags:
4
  - text-game
5
  - world-model
 
 
 
 
 
6
  ---
7
  See https://github.com/thuml/RLVR-World for examples for using this model.
8
 
@@ -14,4 +19,4 @@ See https://github.com/thuml/RLVR-World for examples for using this model.
14
  author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long},
15
  journal={arXiv preprint arXiv:2505.13934},
16
  year={2025},
17
- }
 
3
  tags:
4
  - text-game
5
  - world-model
6
+ - rlvr
7
+ datasets:
8
+ - thuml/bytesized32-world-model-cot
9
+ base_model:
10
+ - thuml/bytesized32-world-model-sft
11
  ---
12
  See https://github.com/thuml/RLVR-World for examples for using this model.
13
 
 
19
  author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long},
20
  journal={arXiv preprint arXiv:2505.13934},
21
  year={2025},
22
+ }