AndyChiang
commited on
Commit
•
2e9c141
1
Parent(s):
39b71ab
Update README.md
Browse files
README.md
CHANGED
@@ -49,17 +49,15 @@ print(cs)
|
|
49 |
|
50 |
This model is fine-tuned by [DGen](https://github.com/DRSY/DGen) dataset, which covers multiple domains including science, vocabulary, common sense and trivia. It is compiled from a wide variety of datasets including SciQ, MCQL, AI2 Science Questions, etc. The detail of DGen dataset is shown below.
|
51 |
|
52 |
-
|
|
53 |
-
| ------------------- | ----- | ----- | ----- |
|
54 |
-
|
|
55 |
-
| High school | 54794 | 7794 | 8318 |
|
56 |
-
| Total | 76850 | 11067 | 11516 |
|
57 |
|
58 |
You can also use the [dataset](https://github.com/AndyChiangSH/CDGP/blob/main/datasets/DGen.zip) we have already cleaned.
|
59 |
|
60 |
## Training
|
61 |
|
62 |
-
We use a special way to fine-tune model, which is called **"Answer-Relating Fine-Tune"**. More
|
63 |
|
64 |
### Training hyperparameters
|
65 |
|
|
|
49 |
|
50 |
This model is fine-tuned by [DGen](https://github.com/DRSY/DGen) dataset, which covers multiple domains including science, vocabulary, common sense and trivia. It is compiled from a wide variety of datasets including SciQ, MCQL, AI2 Science Questions, etc. The detail of DGen dataset is shown below.
|
51 |
|
52 |
+
| DGen dataset | Train | Valid | Test | Total |
|
53 |
+
| ------------------- | ----- | ----- | ---- | ----- |
|
54 |
+
| **Number of questions** | 2321 | 300 | 259 | 2880 |
|
|
|
|
|
55 |
|
56 |
You can also use the [dataset](https://github.com/AndyChiangSH/CDGP/blob/main/datasets/DGen.zip) we have already cleaned.
|
57 |
|
58 |
## Training
|
59 |
|
60 |
+
We use a special way to fine-tune model, which is called **"Answer-Relating Fine-Tune"**. More details are in our paper.
|
61 |
|
62 |
### Training hyperparameters
|
63 |
|