CerebrumTech
/

cere-llama-3-8b-tr

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

oguzhandoganoglu commited on Aug 29

Commit

b075f1f

•

1 Parent(s): 443524a

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -4,17 +4,17 @@ language:
 - tr
 ---
-<img src="https://cdn-uploads.huggingface.co/production/uploads/6639e48c27ef2d37a71eb4aa/Ds_KOVYwhRQ1FQY8S4WqO.png"
 alt="CEREBRUM LLM" width="420"/>
-# CERE V2 -LLMA-3.1-8b-TR
-This model is an fine-tuned version of a Llama3.1 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
 ## Model Details
-- **Base Model**: LLMA 3.1 8B based LLM
 - **Tokenizer Extension**: Specifically extended for Turkish
 - **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
 - **Training Method**: Initially with DORA, followed by fine-tuning with LORA
@@ -37,11 +37,11 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 device = "cuda" # the device to load the model onto
 model = AutoModelForCausalLM.from_pretrained(
-    "Cerebrum/cere-llama-3.1-8B-tr",
     torch_dtype="auto",
     device_map="auto"
 )
-tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3.1-8B-tr")
 prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
 messages = [
@@ -68,4 +68,4 @@ generated_ids = [
 ]
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
-```

 - tr
 ---
+<img src="https://huggingface.co/CerebrumTech/cere-llama-3-8b-tr/resolve/main/cere2.png"
 alt="CEREBRUM LLM" width="420"/>
+# CERE-LLMA-3-8b-TR
+This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
 ## Model Details
+- **Base Model**: LLMA 3 7B based LLM
 - **Tokenizer Extension**: Specifically extended for Turkish
 - **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
 - **Training Method**: Initially with DORA, followed by fine-tuning with LORA
 device = "cuda" # the device to load the model onto
 model = AutoModelForCausalLM.from_pretrained(
+    "Cerebrum/cere-llama-3-8b-tr",
     torch_dtype="auto",
     device_map="auto"
 )
+tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3-8b-tr")
 prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
 messages = [
 ]
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
+```