Update README.md
Browse files
README.md
CHANGED
@@ -83,11 +83,11 @@ print(tokenizer.decode(output))
|
|
83 |
|13b model|13b|40|5120|40|2048|
|
84 |
|
85 |
|
86 |
-
## Training
|
87 |
|
88 |
- **Pre-training:**
|
89 |
-
- **Hardware:**
|
90 |
-
- **Software:** Megatron-
|
91 |
|
92 |
- **Instruction tuning:**
|
93 |
- **Hardware:** 8 A100 40GB GPUs ([mdx cluster](https://mdx.jp/en/))
|
|
|
83 |
|13b model|13b|40|5120|40|2048|
|
84 |
|
85 |
|
86 |
+
## Training
|
87 |
|
88 |
- **Pre-training:**
|
89 |
+
- **Hardware:** 128 A100 40GB GPUs ([mdx cluster](https://mdx.jp/en/))
|
90 |
+
- **Software:** Megatron-LM
|
91 |
|
92 |
- **Instruction tuning:**
|
93 |
- **Hardware:** 8 A100 40GB GPUs ([mdx cluster](https://mdx.jp/en/))
|