rabbitcat commited on
Commit
dd2f4b2
·
verified ·
1 Parent(s): de9b4be

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -5,7 +5,7 @@ license: apache-2.0
5
  # DataSmith
6
 
7
  ## Introduction
8
- DataSmith is a large model for generating json format data based on text content. DataSmith is a large model for generating json format data from text content. DataSmith-6B is a 6 billion parameter model, which is carried out by comprehensively planning relevant data (including selected news, encyclopedias, law, medicine, advertising, papers, books, novels, announcements and other general fields) For fine-tuning, start with a series of models adapted to the task.
9
 
10
  ## Models Available
11
  - DataSmith-6B
@@ -73,7 +73,7 @@ print(response)
73
 
74
  ```
75
 
76
- ## DataSets
77
  We use gpt-4 to generate training corpus by constructing prompt. If you need it, please contact us by email.
78
 
79
  ## Contributing
 
5
  # DataSmith
6
 
7
  ## Introduction
8
+ DataSmith is a large model designed to generate JSON-format data from textual content. The DataSmith-6B version, equipped with 6 billion parameters, is fine-tuned using a comprehensive selection of data sources, including news, encyclopedias, legal documents, medical records, advertising, academic papers, books, novels, and various public announcements. This model serves as the foundation for a series of task-specific adaptations.
9
 
10
  ## Models Available
11
  - DataSmith-6B
 
73
 
74
  ```
75
 
76
+ ## Datasets
77
  We use gpt-4 to generate training corpus by constructing prompt. If you need it, please contact us by email.
78
 
79
  ## Contributing