Update README.md
Browse files
README.md
CHANGED
@@ -1,12 +1,21 @@
|
|
1 |
---
|
2 |
library_name: transformers
|
3 |
-
tags:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
---
|
5 |
|
6 |
-
# Model Card for
|
7 |
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
|
|
|
10 |
|
11 |
|
12 |
## Model Details
|
|
|
1 |
---
|
2 |
library_name: transformers
|
3 |
+
tags:
|
4 |
+
- llm.c
|
5 |
+
license: mit
|
6 |
+
datasets:
|
7 |
+
- HuggingFaceFW/fineweb-edu
|
8 |
+
- teknium/OpenHermes-2.5
|
9 |
+
language:
|
10 |
+
- en
|
11 |
+
pipeline_tag: text-generation
|
12 |
---
|
13 |
|
14 |
+
# Model Card for llm.c gpt2_350M trained on 10b fineweb-edu interleaved with OpenHermes 2.5
|
15 |
|
16 |
<!-- Provide a quick summary of what the model is/does. -->
|
17 |
|
18 |
+
![Loss](loss_curve.png)
|
19 |
|
20 |
|
21 |
## Model Details
|