Model Description

This is the pythia-160m from EleutherAI re-uploaded as an exercise.

Evaluation Results

According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark.

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
hellaswag	1	none	0	acc	↑	0.2872	±	0.0045
		none	0	acc_norm	↑	0.3082	±	0.0046

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(88)

this model