Omitted some unnecessary details in hyperpars
Browse files
README.md
CHANGED
@@ -68,11 +68,8 @@ In fine-tuning, the following arguments were used:
|
|
68 |
|
69 |
| arg | value |
|
70 |
|-------------------------------|-------|
|
71 |
-
| `group_by_length` | True |
|
72 |
| `per_device_train_batch_size` | 16 |
|
73 |
| `gradient_accumulation_steps` | 4 |
|
74 |
| `num_train_epochs` | 8 |
|
75 |
-
| `gradient_checkpointing` | True |
|
76 |
-
| `fp16` | True |
|
77 |
| `learning_rate` | 3e-4 |
|
78 |
| `warmup_steps` | 500 |
|
|
|
68 |
|
69 |
| arg | value |
|
70 |
|-------------------------------|-------|
|
|
|
71 |
| `per_device_train_batch_size` | 16 |
|
72 |
| `gradient_accumulation_steps` | 4 |
|
73 |
| `num_train_epochs` | 8 |
|
|
|
|
|
74 |
| `learning_rate` | 3e-4 |
|
75 |
| `warmup_steps` | 500 |
|