Saving weights and logs of step 1000 - epoch 3
Browse files- README.md +2 -1
- flax_model.msgpack +1 -1
- training_state.bin +2 -2
README.md
CHANGED
@@ -48,7 +48,7 @@ The following hyperparameters were used during training:
|
|
48 |
- num_train_dataset_workers: 32
|
49 |
- num_hosts: 1
|
50 |
- total_num_training_examples: 640,000
|
51 |
-
- steps_per_epoch:
|
52 |
- num_beams: None
|
53 |
- weight_decay: 0.01
|
54 |
- adam_beta1: 0.9
|
@@ -63,6 +63,7 @@ The following hyperparameters were used during training:
|
|
63 |
| step | validation_loss | train_loss | validation_wer | validation_cer | validation_exact_wer | validation_exact_cer |
|
64 |
|:----:|:---------------:|:----------:|:--------------:|:--------------:|:--------------------:|:--------------------:|
|
65 |
| 0 | 6.7175 | 4.8002 | 101.9947 | 49.5329 | 103.4574 | 50.6274 |
|
|
|
66 |
|
67 |
|
68 |
### Framework versions
|
|
|
48 |
- num_train_dataset_workers: 32
|
49 |
- num_hosts: 1
|
50 |
- total_num_training_examples: 640,000
|
51 |
+
- steps_per_epoch: 287
|
52 |
- num_beams: None
|
53 |
- weight_decay: 0.01
|
54 |
- adam_beta1: 0.9
|
|
|
63 |
| step | validation_loss | train_loss | validation_wer | validation_cer | validation_exact_wer | validation_exact_cer |
|
64 |
|:----:|:---------------:|:----------:|:--------------:|:--------------:|:--------------------:|:--------------------:|
|
65 |
| 0 | 6.7175 | 4.8002 | 101.9947 | 49.5329 | 103.4574 | 50.6274 |
|
66 |
+
| 1000 | 1.8389 | 0.4886 | 22.2074 | 6.0723 | 26.1968 | 6.6321 |
|
67 |
|
68 |
|
69 |
### Framework versions
|
flax_model.msgpack
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 3055465603
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:231dc8ff5bc449089c6f5a4b4c57043dee0e96681fff77f2ca9cb8df93403695
|
3 |
size 3055465603
|
training_state.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f4bb1411e39ce7e22da76bc4319a13daa8930969ab7745499673f7e84b0b60c9
|
3 |
+
size 528
|