rshacter
/

rs_uplimit1_model

rshacter commited on Oct 21, 2024

Commit

c00127a

verified ·

1 Parent(s): 2a6422c

Create README.md

This is the first submission for the Uplimit Fine-tuning LLMs course: Evaluating a model

Files changed (1) hide show

README.md ADDED Viewed

+Model Card: Uplimit Project 1 part 1
+Model Description:
+This is a model to test run publishing models. It has no real model assessment value.
+This is a Large Language Model (LLM) trained on a dataset of DIBT/10k_prompts_ranked.
+It was evaluated using using Eleuther Evaluation Harness
+Hellaswag
+Passed argument batch_size = auto:4.0. Detecting largest batch size
+Determined largest batch size: 64
+Passed argument batch_size = auto:4.0. Detecting largest batch size
+Determined largest batch size: 64
+hf (pretrained=EleutherAI/pythia-160m,revision=step100000,dtype=float), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: auto:4 (64,64,64,64,64)
+|  Tasks  |Version|Filter|n-shot| Metric |   |Value |   |Stderr|
+|---------|------:|------|-----:|--------|---|-----:|---|-----:|
+|hellaswag|      1|none  |     0|acc     |↑  |0.2872|±  |0.0045|
+|         |       |none  |     0|acc_norm|↑  |0.3082|±  |0.0046|
+How to Use
+To use this model, simply download the checkpoint and load it into your preferred deep learning framework.