lm-eval
--model gguf
--model_args base_url=http://localhost:8000
--tasks hellaswag
--batch_size 1
--limit 10

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
hellaswag	1	none	0	acc	↑	0.3	±	0.1528
		none	0	acc_norm	↑	0.4	±	0.1633

GGUF

Model size

1.24B params

Architecture

llama

4-bit

Inference API

Unable to determine this model's library. Check the docs .