reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
rasdani
rasdani
AI & ML interests
None yet
Recent Activity
liked
a dataset
2 days ago
EssentialAI/essential-web-v1.0
liked
a model
2 days ago
BAAI/RoboBrain2.0-7B
updated
a dataset
3 days ago
rasdani/git-diff-Qwen-4B-rollouts