ryan0712's picture
Update README.md
748ff94
|
raw
history blame
1.33 kB
metadata
license: other
language:
  - en

Model Details

This is an unofficial implementation of "AlpaGasus: Training a better Alpaca with Fewer Data." with LLaMA2 & QLoRA! Training code is available at our repo.

Benchmark Metrics

Metric Value
MMLU 55.27
ARC 61.09
HellaSwag 82.46
TruthfulQA 38.53
Avg. 59.34

Training Dataset

"StudentLLM/Alpagasus-2-13b-QLoRA-merged" used gpt4life's gpt-3.5-turbo filtered dataset, 'alpaca_t45.json'. Configuration of the dataset is as follows:

Prompt Template

### Instruction:

<prompt> (without the <>)

### Response: