tlphams
/

zoyllm-7b-slimorca

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tlphams commited on Dec 4, 2023

Commit

34aa2e5

•

1 Parent(s): 76b85d5

Update README.md

Files changed (1) hide show

README.md +49 -0

README.md CHANGED Viewed

@@ -1,3 +1,52 @@
 ---
 license: cc-by-nc-sa-4.0
 ---

 ---
 license: cc-by-nc-sa-4.0
+pipeline_tag: text-generation
+language:
+  - en
+tags:
+- finetuned
 ---
+# Model Card for ZoyLLM-7B-SlimOrca
+The ZoyLLM-7B-SlimOrca Large Language Model (LLM) is a LoRA-finetuned generative text model with Mistral-7B-v0.1 is the base model.
+Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks we tested.
+## Model Architecture
+ZoyLLM-7B-SlimOrca is a transformer model, with the following architecture choices:
+- Grouped-Query Attention
+- Sliding-Window Attention
+- Byte-fallback BPE tokenizer
+## Datasets
+- Self-introduction (20 samples)
+- SlimOrca (100k samples random sampled)
+- EverythingLM v3
+## Template
+We finetuned the model using a template similar to the dolphin chat template
+```
+<|im_start|>system
+{system}<|im_end|>
+<|im_start|>user
+{prompt}<|im_end|>
+<|im_start|>assistant
+```
+## Troubleshooting
+- If you see the following error:
+```
+KeyError: 'mistral'
+```
+- Or:
+```
+NotImplementedError: Cannot copy out of meta tensor; no data!
+```
+Ensure you are utilizing a stable version of Transformers, 4.34.0 or newer.
+## The Zoy AI Team
+Pham Tung Lam, Nguyen Duc Nhan.