Aurelien Lucchi's picture

1

Aurelien Lucchi

alucchi

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 6 hours ago

alucchi/grpo_mmeta-llama_dgsm8k_n200_e100_oadam5e-06_b8_8_a0.04

published a dataset about 6 hours ago

alucchi/grpo_mmeta-llama_dgsm8k_n200_e100_oadam5e-06_b8_8_a0.04

updated a dataset about 6 hours ago

alucchi/tmp3

View all activity

Organizations

None yet

alucchi's activity

New activity in open-r1/README about 1 month ago

[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO

#15 opened about 1 month ago by