princeton-nlp
/

Sheared-LLaMA-2.7B-Pruned

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

princeton-nlp commited on Jan 23

Commit

ed635e9

•

1 Parent(s): a437579

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -15,4 +15,8 @@ license: llama2
 **License**: Must comply with license of Llama2 since it's a model derived from Llama2.
-Sheared-LLaMA-2.7B-Pruned is the model pruned from [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) **without continued pre-training**. We used roughly 0.4B tokens to perform the pruning experiment.

 **License**: Must comply with license of Llama2 since it's a model derived from Llama2.
+Sheared-LLaMA-2.7B-Pruned is the model pruned from [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) **without continued pre-training**.
+We used roughly 0.4B tokens to perform the pruning experiment. This model could be a good use to study
+- effective data mixtures for continued pre-training
+- comparisons to other pruning techniques
+- extensive evaluations to understand how pruning affects knowledge and reasoning capabilities of LLMs