oguzhandoganoglu commited on
Commit
9594dad
1 Parent(s): 93ecc1e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -5
README.md CHANGED
@@ -6,13 +6,16 @@ language:
6
  ---
7
  <img src="https://huggingface.co/CerebrumTech/cere-llama-3-8b-tr/resolve/main/cere2.png"
8
  alt="CEREBRUM LLM" width="420"/>
9
- # CERE-LLMA-3-8b-TR
10
 
11
- This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
 
 
 
 
12
 
13
  ## Model Details
14
 
15
- - **Base Model**: LLMA 3 7B based LLM
16
  - **Tokenizer Extension**: Specifically extended for Turkish
17
  - **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
18
  - **Training Method**: Initially with DORA, followed by fine-tuning with LORA
@@ -35,11 +38,11 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
35
  device = "cuda" # the device to load the model onto
36
 
37
  model = AutoModelForCausalLM.from_pretrained(
38
- "Cerebrum/cere-llama-3-8b-tr",
39
  torch_dtype="auto",
40
  device_map="auto"
41
  )
42
- tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3-8b-tr")
43
 
44
  prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
45
  messages = [
 
6
  ---
7
  <img src="https://huggingface.co/CerebrumTech/cere-llama-3-8b-tr/resolve/main/cere2.png"
8
  alt="CEREBRUM LLM" width="420"/>
 
9
 
10
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6639e48c27ef2d37a71eb4aa/Ds_KOVYwhRQ1FQY8S4WqO.png)
11
+
12
+ # CERE V2 -LLMA-3.1-8b-TR
13
+
14
+ This model is an fine-tuned version of a Llama3.1 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
15
 
16
  ## Model Details
17
 
18
+ - **Base Model**: LLMA 3.1 8B based LLM
19
  - **Tokenizer Extension**: Specifically extended for Turkish
20
  - **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
21
  - **Training Method**: Initially with DORA, followed by fine-tuning with LORA
 
38
  device = "cuda" # the device to load the model onto
39
 
40
  model = AutoModelForCausalLM.from_pretrained(
41
+ "Cerebrum/cere-llama-3.1-8B-tr",
42
  torch_dtype="auto",
43
  device_map="auto"
44
  )
45
+ tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3.1-8B-tr")
46
 
47
  prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
48
  messages = [