reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
rasdani PRO
rasdani
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 9 hours ago
rasdani/SkyRL-v0-293-data-oracle-8k-context
updated
a dataset
about 10 hours ago
rasdani/SkyRL-v0-293-data-oracle
updated
a dataset
3 days ago
rasdani/SWE-bench_Verified_oracle_32k_100