lm-eval
--model gguf
--model_args base_url=http://localhost:8000
--tasks hellaswag
--batch_size 1
--limit 10
Tasks | Version | Filter | n-shot | Metric | Value | Stderr | ||
---|---|---|---|---|---|---|---|---|
hellaswag | 1 | none | 0 | acc | ↑ | 0.3 | ± | 0.1528 |
none | 0 | acc_norm | ↑ | 0.4 | ± | 0.1633 |
- Downloads last month
- 5