CerebrumTech
/

cere-llama-3-8b-tr

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

oguzhandoganoglu commited on Aug 29

Commit

9594dad

•

1 Parent(s): 93ecc1e

Update README.md

Files changed (1) hide show

README.md +8 -5

README.md CHANGED Viewed

@@ -6,13 +6,16 @@ language:
 ---
 <img src="https://huggingface.co/CerebrumTech/cere-llama-3-8b-tr/resolve/main/cere2.png"
 alt="CEREBRUM LLM" width="420"/>
-# CERE-LLMA-3-8b-TR
-This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
 ## Model Details
-- **Base Model**: LLMA 3 7B based LLM
 - **Tokenizer Extension**: Specifically extended for Turkish
 - **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
 - **Training Method**: Initially with DORA, followed by fine-tuning with LORA
@@ -35,11 +38,11 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 device = "cuda" # the device to load the model onto
 model = AutoModelForCausalLM.from_pretrained(
-    "Cerebrum/cere-llama-3-8b-tr",
     torch_dtype="auto",
     device_map="auto"
 )
-tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3-8b-tr")
 prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
 messages = [

 ---
 <img src="https://huggingface.co/CerebrumTech/cere-llama-3-8b-tr/resolve/main/cere2.png"
 alt="CEREBRUM LLM" width="420"/>
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6639e48c27ef2d37a71eb4aa/Ds_KOVYwhRQ1FQY8S4WqO.png)
+# CERE V2 -LLMA-3.1-8b-TR
+This model is an fine-tuned version of a Llama3.1 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
 ## Model Details
+- **Base Model**: LLMA 3.1 8B based LLM
 - **Tokenizer Extension**: Specifically extended for Turkish
 - **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
 - **Training Method**: Initially with DORA, followed by fine-tuning with LORA
 device = "cuda" # the device to load the model onto
 model = AutoModelForCausalLM.from_pretrained(
+    "Cerebrum/cere-llama-3.1-8B-tr",
     torch_dtype="auto",
     device_map="auto"
 )
+tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3.1-8B-tr")
 prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
 messages = [