Aurelien Lucchi
alucchi
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 6 hours ago
alucchi/grpo_mmeta-llama_dgsm8k_n200_e100_oadam5e-06_b8_8_a0.04
published
a dataset
about 6 hours ago
alucchi/grpo_mmeta-llama_dgsm8k_n200_e100_oadam5e-06_b8_8_a0.04
updated
a dataset
about 6 hours ago
alucchi/tmp3
Organizations
None yet
alucchi's activity
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
21
#15 opened about 1 month ago
by
lewtun
