MaziyarPanahi
/

Llama-3-8B-Instruct-DPO-v0.3

Text Generation

text-generation-inference

Model card Files Files and versions Community

Update README.md

#5

by MaziyarPanahi - opened Apr 25

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -27,9 +27,9 @@ datasets:
 <img src="./llama-3-merges.webp" alt="Llama-3 DPO Logo" width="500" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
-# Llama-3-8B-Instruct-DPO-v0.3
-This model is a fine-tune (DPO) of `meta-llama/Meta-Llama-3-8B-Instruct` model.
 # How to use
@@ -86,7 +86,7 @@ terminators = [
 outputs = pipeline(
     prompt,
-    max_new_tokens=256,
     eos_token_id=terminators,
     do_sample=True,
     temperature=0.6,

 <img src="./llama-3-merges.webp" alt="Llama-3 DPO Logo" width="500" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
+# Llama-3-8B-Instruct-DPO-v0.3 (32k)
+This model is a fine-tune (DPO) of `meta-llama/Meta-Llama-3-8B-Instruct` model. I have used `rope_theta` to extend the context length up to 32K safely.
 # How to use
 outputs = pipeline(
     prompt,
+    max_new_tokens=8192,
     eos_token_id=terminators,
     do_sample=True,
     temperature=0.6,