Model Description

This is the pythia-160m from EleutherAI re-uploaded as an exercise.

Evaluation Results

According to project requirement, we used lm-evalutation-harness from EleutherAI to evaluate pythia-160m on the 'Hellaswag' benchmark.

Hellaswag

Tasks Version Filter n-shot Metric Value Stderr
hellaswag 1 none 0 acc ↑ 0.2872 ± 0.0045
none 0 acc_norm ↑ 0.3082 ± 0.0046
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DamiFass/pythia-160m-Project-week1

Finetuned
(88)
this model