Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
---
|
2 |
-
base_model:
|
3 |
inference: true
|
4 |
model_type: llama
|
5 |
datasets:
|
@@ -10,7 +10,7 @@ tags:
|
|
10 |
|
11 |
# Llama-2-7b-pruned70-retrained
|
12 |
|
13 |
-
This repo contains model files for a [Llama 2 7B](https://huggingface.co/meta-llama/Llama-2-7b-hf) model that has had
|
14 |
|
15 |
**Authors**: Neural Magic, Cerebras
|
16 |
|
|
|
1 |
---
|
2 |
+
base_model: neuralmagic/Llama-2-7b-prune50-retrained
|
3 |
inference: true
|
4 |
model_type: llama
|
5 |
datasets:
|
|
|
10 |
|
11 |
# Llama-2-7b-pruned70-retrained
|
12 |
|
13 |
+
This repo contains model files for a [Llama 2 7B](https://huggingface.co/meta-llama/Llama-2-7b-hf) model that has had 50% of the parameters pruned in one-shot with [SparseGPT](https://arxiv.org/abs/2301.00774), then retrained by [Cerebras](https://huggingface.co/cerebras) with 50B [UPDATE] from SlimPajama while maintaining sparsity. It was then one-shot pruned to 70% sparsity and trained for another 100B tokens.
|
14 |
|
15 |
**Authors**: Neural Magic, Cerebras
|
16 |
|