-
-
-
-
-
-
Inference Providers
Active filters:
full
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
•
Updated
•
6
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_qwen_bsz128_lr8e-6
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_qwen_bsz64_lr5e-6
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_qwen_bsz128_lr5e-6
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_qwen_bsz64_lr8e-6
Text Generation
•
Updated
•
6
mlfoundations-dev/hp_ablations_grid_qwen_bsz512_lr8e-6
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_qwen_bsz256_lr5e-6
Text Generation
•
Updated
•
8
mlfoundations-dev/hp_ablations_grid_qwen_bsz512_lr5e-6
Text Generation
•
Updated
•
7
mlfoundations-dev/hp_ablations_grid_qwen_bsz256_lr8e-6
Text Generation
•
Updated
•
7
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
•
Updated
•
9
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
•
Updated
•
4
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
•
Updated
•
4
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
•
Updated
•
8
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
•
Updated
•
4
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
•
Updated
•
4
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
•
Updated
•
9
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.15
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
•
Updated
•
6
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7
Text Generation
•
Updated
•
6
mlfoundations-dev/llama3-1_8b_webinstruct_original_700k
Text Generation
•
Updated
•
7
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
•
Updated
•
6
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.15
Text Generation
•
Updated
•
5
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7
Text Generation
•
Updated
•
6