Prettified the md table
Browse files
README.md
CHANGED
@@ -66,13 +66,13 @@ transcription = processor.batch_decode(predicted_ids)
|
|
66 |
|
67 |
In fine-tuning, the following arguments were used:
|
68 |
|
69 |
-
|arg
|
70 |
-
|
71 |
-
|
72 |
-
| `per_device_train_batch_size
|
73 |
-
|
74 |
-
|
75 |
-
|
76 |
-
|
77 |
-
|
78 |
-
|
|
|
66 |
|
67 |
In fine-tuning, the following arguments were used:
|
68 |
|
69 |
+
| arg | value |
|
70 |
+
|-------------------------------|-------|
|
71 |
+
| `group_by_length` | True |
|
72 |
+
| `per_device_train_batch_size` | 16 |
|
73 |
+
| `gradient_accumulation_steps` | 4 |
|
74 |
+
| `num_train_epochs` | 8 |
|
75 |
+
| `gradient_checkpointing` | True |
|
76 |
+
| `fp16` | True |
|
77 |
+
| `learning_rate` | 3e-4 |
|
78 |
+
| `warmup_steps` | 500 |
|