mlfoundations-dev/hp_ablations_gemma_adambeta2_0.995_dcftv1.2 Text Generation • Updated 19 days ago • 14
mlfoundations-dev/hp_ablations_gemma_adambeta1_0.9_dcftv1.2 Text Generation • Updated 19 days ago • 20
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.999_dcftv1.2 Text Generation • Updated 19 days ago • 21
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.99_dcftv1.2 Text Generation • Updated 19 days ago • 20
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.95_dcftv1.2 Text Generation • Updated 19 days ago • 18
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.98_dcftv1.2 Text Generation • Updated 19 days ago • 16
mlfoundations-dev/hp_ablations_gemma_scheduler_constant_dcftv1.2 Text Generation • Updated 19 days ago • 14
mlfoundations-dev/hp_ablations_gemma_adambeta2_0.9995_dcftv1.2 Text Generation • Updated 19 days ago • 18
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_dcftv1.2 Text Generation • Updated 19 days ago • 20
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-6_dcftv1.2 Text Generation • Updated 19 days ago • 27
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-7_dcftv1.2 Text Generation • Updated 19 days ago • 18
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-6_dcftv1.2 Text Generation • Updated 19 days ago • 16
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr5e-7_dcftv1.2 Text Generation • Updated 19 days ago • 17
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-7_dcftv1.2 Text Generation • Updated 19 days ago • 40
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr5e-7_dcftv1.2 Text Generation • Updated 19 days ago • 22
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_dcftv1.2 Text Generation • Updated 19 days ago • 23
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.15_dcftv1.2 Text Generation • Updated 19 days ago • 19
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.10_dcftv1.2 Text Generation • Updated 19 days ago • 22
mlfoundations-dev/hp_ablations_gemma_scheduler_inverse_sqrt_dcftv1.2 Text Generation • Updated 19 days ago • 16
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.05_dcftv1.2 Text Generation • Updated 19 days ago • 16