Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,8 @@ inference:
|
|
15 |
temperature: 0.6
|
16 |
---
|
17 |
|
18 |
-
# AI-Sweden-Models/Llama-3-8B (checkpoint-
|
|
|
19 |
|
20 |
### Intended usage:
|
21 |
This is a base model, it can be finetuned to a particular use case.
|
@@ -55,8 +56,12 @@ Ikväll blir det grillat och det ser jag fram emot!"
|
|
55 |
`AI-Sweden-Models/Llama-3-8B` is a continuation of the pretraining process from `meta-llama/Meta-Llama-3-8B`.
|
56 |
It was trained on a subset from [The Nordic Pile](https://arxiv.org/abs/2303.17183) containing Swedish, Norwegian and Danish.
|
57 |
|
58 |
-
The dataset consists of 227 105 079 296 tokens.
|
59 |
|
60 |
## Benchmarks
|
61 |
|
62 |
-
Coming soon.
|
|
|
|
|
|
|
|
|
|
15 |
temperature: 0.6
|
16 |
---
|
17 |
|
18 |
+
# AI-Sweden-Models/Llama-3-8B (checkpoint-2700)
|
19 |
+
![](https://huggingface.co/AI-Sweden-Models/Llama-3-8B/resolve/main/l3swe.png?download=true)
|
20 |
|
21 |
### Intended usage:
|
22 |
This is a base model, it can be finetuned to a particular use case.
|
|
|
56 |
`AI-Sweden-Models/Llama-3-8B` is a continuation of the pretraining process from `meta-llama/Meta-Llama-3-8B`.
|
57 |
It was trained on a subset from [The Nordic Pile](https://arxiv.org/abs/2303.17183) containing Swedish, Norwegian and Danish.
|
58 |
|
59 |
+
The training dataset consists of 227 105 079 296 tokens.
|
60 |
|
61 |
## Benchmarks
|
62 |
|
63 |
+
Coming soon.
|
64 |
+
|
65 |
+
## Checkpoints
|
66 |
+
* 20/5/2024 (2700)
|
67 |
+
* 13/5/2024 (1500)
|