Update README.md
Browse files
README.md
CHANGED
@@ -62,64 +62,21 @@ Kyro-n1 is a lightweight and fast reasoning model based on **Qwen/Qwen2.5-14B-In
|
|
62 |
- Number of Attention Heads (GQA): 40 for Q and 8 for KV
|
63 |
- Context Length: Full 131,072 tokens and generation 8192 tokens
|
64 |
|
65 |
-
|
66 |
-
|
67 |
-
|
68 |
-
|
69 |
-
|
70 |
-
|
71 |
-
|
72 |
-
|
73 |
-
|
74 |
-
|
75 |
-
|
76 |
-
|
77 |
-
|
78 |
-
|
79 |
-
|
80 |
-
</td>
|
81 |
-
</tr>
|
82 |
-
<tr>
|
83 |
-
<td rowspan="3" >Kyro (text only)
|
84 |
-
</td>
|
85 |
-
<td rowspan="3" >A new mix of publicly available online data.
|
86 |
-
</td>
|
87 |
-
<td>3B
|
88 |
-
</td>
|
89 |
-
<td>Multilingual Text
|
90 |
-
</td>
|
91 |
-
<td>Multilingual Text and code
|
92 |
-
</td>
|
93 |
-
<td>128k
|
94 |
-
</td>
|
95 |
-
<td>Coming Soon
|
96 |
-
</td>
|
97 |
-
</tr>
|
98 |
-
<tr>
|
99 |
-
<td>7B
|
100 |
-
</td>
|
101 |
-
<td>Multilingual Text
|
102 |
-
</td>
|
103 |
-
<td>Multilingual Text and code
|
104 |
-
</td>
|
105 |
-
<td>128k
|
106 |
-
</td>
|
107 |
-
<td>Coming Soon
|
108 |
-
</td>
|
109 |
-
</tr>
|
110 |
-
<tr>
|
111 |
-
<td>14B
|
112 |
-
</td>
|
113 |
-
<td>Multilingual Text
|
114 |
-
</td>
|
115 |
-
<td>Multilingual Text and code
|
116 |
-
</td>
|
117 |
-
<td>128k
|
118 |
-
</td>
|
119 |
-
<td>This Model
|
120 |
-
</td>
|
121 |
-
</tr>
|
122 |
-
</table>
|
123 |
|
124 |
## **Usage**
|
125 |
|
|
|
62 |
- Number of Attention Heads (GQA): 40 for Q and 8 for KV
|
63 |
- Context Length: Full 131,072 tokens and generation 8192 tokens
|
64 |
|
65 |
+
## **Model Downloads**
|
66 |
+
|
67 |
+
### Kyro-n1 Models
|
68 |
+
|
69 |
+
<div align="center">
|
70 |
+
|
71 |
+
| | **Training Data** | **Params** | **Input modalities** | **Output modalities** | **Context length** | **Download Link** |
|
72 |
+
|--------------|------------------------------------|---------|------------------|----------------------|----------------|----------------|
|
73 |
+
| **Kyro (text only)** | A new mix of publicly available online data. | **3B** | Multilingual Text | Multilingual Text and code | 128k | [🤗 HuggingFace](https://huggingface.co/open-neo/Kyro-n1-3B) |
|
74 |
+
| | | **7B** | Multilingual Text | Multilingual Text and code | 128k | [🤗 HuggingFace](https://huggingface.co/open-neo/Kyro-n1-7B) |
|
75 |
+
| | | **14B** | Multilingual Text | Multilingual Text and code | 128k | [🤗 HuggingFace](https://huggingface.co/open-neo/Kyro-n1-14B) |
|
76 |
+
|
77 |
+
</div>
|
78 |
+
|
79 |
+
### Kyro-1 Models
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
80 |
|
81 |
## **Usage**
|
82 |
|