dicta-il
/

dictalm2.0-instruct-GPTQ

@@ -31,15 +31,15 @@ In order to leverage instruction fine-tuning, your prompt should be surrounded b
 E.g.
 ```
-text = """<s>[INST] What is your favourite condiment? [/INST]
-Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!</s>[INST] Do you have mayonnaise recipes? [/INST]"
 ```
 This format is available as a [chat template](https://huggingface.co/docs/transformers/main/chat_templating) via the `apply_chat_template()` method:
 ## Example Code
-Running this code requires under 5GB of GPU VRAM.
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -50,7 +50,7 @@ model = AutoModelForCausalLM.from_pretrained("dicta-il/dictalm2.0-instruct-GPTQ"
 tokenizer = AutoTokenizer.from_pretrained("dicta-il/dictalm2.0-instruct-GPTQ")
 messages = [
-    {"role": "user", "content": "מה הרוטב אהוב עליך?"},
     {"role": "assistant", "content": "טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!"},
     {"role": "user", "content": "האם יש לך מתכונים למיונז?"}
 ]
@@ -60,7 +60,7 @@ encoded = tokenizer.apply_chat_template(messages, return_tensors="pt").to(device
 generated_ids = model.generate(encoded, max_new_tokens=50, do_sample=True)
 decoded = tokenizer.batch_decode(generated_ids)
 print(decoded[0])
-# <s> [INST] מה הרוטב אהוב עליך? [/INST]
 # טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!</s>  [INST] האם יש לך מתכונים למיונז? [/INST]
 # בטח, הנה מתכון קל מאוד למיונז ביתי:
 #

 E.g.
 ```
+text = """<s>[INST] איזה רוטב אהוב עליך? [/INST]
+טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!</s>[INST] האם יש לך מתכונים למיונז? [/INST]"
 ```
 This format is available as a [chat template](https://huggingface.co/docs/transformers/main/chat_templating) via the `apply_chat_template()` method:
 ## Example Code
+Running this code requires less than 5GB of GPU VRAM.
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 tokenizer = AutoTokenizer.from_pretrained("dicta-il/dictalm2.0-instruct-GPTQ")
 messages = [
+    {"role": "user", "content": "איזה רוטב אהוב עליך?"},
     {"role": "assistant", "content": "טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!"},
     {"role": "user", "content": "האם יש לך מתכונים למיונז?"}
 ]
 generated_ids = model.generate(encoded, max_new_tokens=50, do_sample=True)
 decoded = tokenizer.batch_decode(generated_ids)
 print(decoded[0])
+# <s> [INST] איזה רוטב אהוב עליך? [/INST]
 # טוב, אני די מחבב כמה טיפות מיץ לימון סחוט טרי. זה מוסיף בדיוק את הכמות הנכונה של טעם חמצמץ לכל מה שאני מבשל במטבח!</s>  [INST] האם יש לך מתכונים למיונז? [/INST]
 # בטח, הנה מתכון קל מאוד למיונז ביתי:
 #