Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ license: apache-2.0
|
|
5 |
# DataSmith
|
6 |
|
7 |
## Introduction
|
8 |
-
DataSmith is a large model
|
9 |
|
10 |
## Models Available
|
11 |
- DataSmith-6B
|
@@ -73,7 +73,7 @@ print(response)
|
|
73 |
|
74 |
```
|
75 |
|
76 |
-
##
|
77 |
We use gpt-4 to generate training corpus by constructing prompt. If you need it, please contact us by email.
|
78 |
|
79 |
## Contributing
|
|
|
5 |
# DataSmith
|
6 |
|
7 |
## Introduction
|
8 |
+
DataSmith is a large model designed to generate JSON-format data from textual content. The DataSmith-6B version, equipped with 6 billion parameters, is fine-tuned using a comprehensive selection of data sources, including news, encyclopedias, legal documents, medical records, advertising, academic papers, books, novels, and various public announcements. This model serves as the foundation for a series of task-specific adaptations.
|
9 |
|
10 |
## Models Available
|
11 |
- DataSmith-6B
|
|
|
73 |
|
74 |
```
|
75 |
|
76 |
+
## Datasets
|
77 |
We use gpt-4 to generate training corpus by constructing prompt. If you need it, please contact us by email.
|
78 |
|
79 |
## Contributing
|