StefanJevtic63 commited on
Commit
5704819
1 Parent(s): f1c2647

End of training

Browse files
README.md ADDED
@@ -0,0 +1,72 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: peft
3
+ language:
4
+ - sr
5
+ license: apache-2.0
6
+ base_model: openai/whisper-large-v2
7
+ tags:
8
+ - generated_from_trainer
9
+ model-index:
10
+ - name: Whisper - Serbian Model
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # Whisper - Serbian Model
18
+
19
+ This model is a fine-tuned version of [openai/whisper-large-v2](https://huggingface.co/openai/whisper-large-v2) on the None dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.1228
22
+
23
+ ## Model description
24
+
25
+ More information needed
26
+
27
+ ## Intended uses & limitations
28
+
29
+ More information needed
30
+
31
+ ## Training and evaluation data
32
+
33
+ More information needed
34
+
35
+ ## Training procedure
36
+
37
+ ### Training hyperparameters
38
+
39
+ The following hyperparameters were used during training:
40
+ - learning_rate: 0.0009
41
+ - train_batch_size: 16
42
+ - eval_batch_size: 16
43
+ - seed: 42
44
+ - gradient_accumulation_steps: 2
45
+ - total_train_batch_size: 32
46
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
+ - lr_scheduler_type: linear
48
+ - lr_scheduler_warmup_steps: 400
49
+ - training_steps: 4000
50
+ - mixed_precision_training: Native AMP
51
+
52
+ ### Training results
53
+
54
+ | Training Loss | Epoch | Step | Validation Loss |
55
+ |:-------------:|:------:|:----:|:---------------:|
56
+ | 0.1206 | 0.0705 | 500 | 0.1705 |
57
+ | 0.1162 | 0.1409 | 1000 | 0.1626 |
58
+ | 0.1189 | 0.2114 | 1500 | 0.1548 |
59
+ | 0.117 | 0.2819 | 2000 | 0.1467 |
60
+ | 0.1117 | 0.3524 | 2500 | 0.1386 |
61
+ | 0.1148 | 0.4228 | 3000 | 0.1313 |
62
+ | 0.1128 | 0.4933 | 3500 | 0.1264 |
63
+ | 0.1277 | 0.5638 | 4000 | 0.1228 |
64
+
65
+
66
+ ### Framework versions
67
+
68
+ - PEFT 0.13.2
69
+ - Transformers 4.46.3
70
+ - Pytorch 2.5.1
71
+ - Datasets 3.0.0
72
+ - Tokenizers 0.20.3
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0ec1f5cd3162555942968e0ec6d49142fa5e680fc12b20b9ca1d332e4ab79c62
3
  size 63056714
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:06040f0a65c33d2a3f082f5bfa1f8a8328ec6bf7f83978e0aabf2a928168fffb
3
  size 63056714
runs/Dec20_05-56-12_DESKTOP-1TFDHRE/events.out.tfevents.1734670574.DESKTOP-1TFDHRE.5100.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5630f51555613ced1c604b3aa06bfc56d40f0a81a3c808a7b37c067cc89126ee
3
- size 37917
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b6d16e5346eff63dcf6b5b93d9fa9c14da02e5921ccaf1b75ef5f87a3866880
3
+ size 42762