oguzhandoganoglu
commited on
Commit
•
9594dad
1
Parent(s):
93ecc1e
Update README.md
Browse files
README.md
CHANGED
@@ -6,13 +6,16 @@ language:
|
|
6 |
---
|
7 |
<img src="https://huggingface.co/CerebrumTech/cere-llama-3-8b-tr/resolve/main/cere2.png"
|
8 |
alt="CEREBRUM LLM" width="420"/>
|
9 |
-
# CERE-LLMA-3-8b-TR
|
10 |
|
11 |
-
|
|
|
|
|
|
|
|
|
12 |
|
13 |
## Model Details
|
14 |
|
15 |
-
- **Base Model**: LLMA 3
|
16 |
- **Tokenizer Extension**: Specifically extended for Turkish
|
17 |
- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
|
18 |
- **Training Method**: Initially with DORA, followed by fine-tuning with LORA
|
@@ -35,11 +38,11 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
35 |
device = "cuda" # the device to load the model onto
|
36 |
|
37 |
model = AutoModelForCausalLM.from_pretrained(
|
38 |
-
"Cerebrum/cere-llama-3-
|
39 |
torch_dtype="auto",
|
40 |
device_map="auto"
|
41 |
)
|
42 |
-
tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3-
|
43 |
|
44 |
prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
|
45 |
messages = [
|
|
|
6 |
---
|
7 |
<img src="https://huggingface.co/CerebrumTech/cere-llama-3-8b-tr/resolve/main/cere2.png"
|
8 |
alt="CEREBRUM LLM" width="420"/>
|
|
|
9 |
|
10 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6639e48c27ef2d37a71eb4aa/Ds_KOVYwhRQ1FQY8S4WqO.png)
|
11 |
+
|
12 |
+
# CERE V2 -LLMA-3.1-8b-TR
|
13 |
+
|
14 |
+
This model is an fine-tuned version of a Llama3.1 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
|
15 |
|
16 |
## Model Details
|
17 |
|
18 |
+
- **Base Model**: LLMA 3.1 8B based LLM
|
19 |
- **Tokenizer Extension**: Specifically extended for Turkish
|
20 |
- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
|
21 |
- **Training Method**: Initially with DORA, followed by fine-tuning with LORA
|
|
|
38 |
device = "cuda" # the device to load the model onto
|
39 |
|
40 |
model = AutoModelForCausalLM.from_pretrained(
|
41 |
+
"Cerebrum/cere-llama-3.1-8B-tr",
|
42 |
torch_dtype="auto",
|
43 |
device_map="auto"
|
44 |
)
|
45 |
+
tokenizer = AutoTokenizer.from_pretrained("Cerebrum/cere-llama-3.1-8B-tr")
|
46 |
|
47 |
prompt = "Python'da ekrana 'Merhaba Dünya' nasıl yazılır?"
|
48 |
messages = [
|