Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thuml
/
bytesized32-world-model-rlvr-binary-reward
like
0
Follow
THUML @ Tsinghua University
43
Safetensors
thuml/bytesized32-world-model-cot
qwen2
text-game
world-model
rlvr
arxiv:
2505.13934
License:
mit
Model card
Files
Files and versions
Community
main
bytesized32-world-model-rlvr-binary-reward
Ctrl+K
Ctrl+K
2 contributors
History:
4 commits
manchery
Update README.md
501a03c
verified
7 days ago
.gitattributes
Safe
1.57 kB
Upload 6 files
7 days ago
README.md
486 Bytes
Update README.md
7 days ago
config.json
790 Bytes
Upload 6 files
7 days ago
generation_config.json
Safe
190 Bytes
Upload 6 files
7 days ago
model.safetensors
3.55 GB
LFS
Upload 6 files
7 days ago
special_tokens_map.json
Safe
508 Bytes
Upload 6 files
7 days ago
tokenizer.json
Safe
11.4 MB
LFS
Upload 6 files
7 days ago
tokenizer_config.json
Safe
6.96 kB
Upload 6 files
7 days ago