PhelixZhen
commited on
Commit
•
bea3170
1
Parent(s):
0df74f5
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,20 @@
|
|
1 |
---
|
2 |
license: mit
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
+
language:
|
4 |
+
- en
|
5 |
---
|
6 |
+
|
7 |
+
## This is a pre-release model interface, training started on February 7, 2024, and the model will be released in the future. ##
|
8 |
+
|
9 |
+
The model adopts the Phi architecture, with 550 million parameters. It only supports English and does not support code writing.
|
10 |
+
|
11 |
+
|
12 |
+
The model's dataset is obtained by cleaning and deduplicating open-source datasets, with pre-training using approximately 30 billion instances.
|
13 |
+
|
14 |
+
|
15 |
+
If you are a native English speaker, you might find these sentences uncomfortable to read because the training of this model and the writing of this document were only completed by a very inexperienced Chinese high school student.
|
16 |
+
|
17 |
+
|
18 |
+
Anyway, this is a new attempt. It is trained on consumer-grade devices and without the guidance of professionals, so it's hard for us to expect it to perform exceptionally well.
|
19 |
+
|
20 |
+
But we hope this will be the beginning of a new great exploration.
|