sm commited on
Commit
f455dd3
·
verified ·
1 Parent(s): fafc23f

End of training

Browse files
Files changed (2) hide show
  1. README.md +11 -11
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -97,7 +97,7 @@ xformers_attention: false
97
 
98
  This model is a fine-tuned version of [Qwen/Qwen1.5-0.5B-Chat](https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat) on the None dataset.
99
  It achieves the following results on the evaluation set:
100
- - Loss: 0.1833
101
 
102
  ## Model description
103
 
@@ -132,16 +132,16 @@ The following hyperparameters were used during training:
132
  | Training Loss | Epoch | Step | Validation Loss |
133
  |:-------------:|:------:|:----:|:---------------:|
134
  | 0.6096 | 0.0001 | 1 | 0.9462 |
135
- | 0.7819 | 0.0003 | 3 | 0.9376 |
136
- | 1.8223 | 0.0006 | 6 | 0.8625 |
137
- | 0.332 | 0.0009 | 9 | 0.6417 |
138
- | 0.3023 | 0.0012 | 12 | 0.4921 |
139
- | 0.2452 | 0.0015 | 15 | 0.4248 |
140
- | 0.1491 | 0.0018 | 18 | 0.3506 |
141
- | 0.1294 | 0.0021 | 21 | 0.2947 |
142
- | 0.082 | 0.0024 | 24 | 0.2336 |
143
- | 0.0518 | 0.0027 | 27 | 0.2086 |
144
- | 0.0397 | 0.0030 | 30 | 0.1833 |
145
 
146
 
147
  ### Framework versions
 
97
 
98
  This model is a fine-tuned version of [Qwen/Qwen1.5-0.5B-Chat](https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat) on the None dataset.
99
  It achieves the following results on the evaluation set:
100
+ - Loss: 0.1828
101
 
102
  ## Model description
103
 
 
132
  | Training Loss | Epoch | Step | Validation Loss |
133
  |:-------------:|:------:|:----:|:---------------:|
134
  | 0.6096 | 0.0001 | 1 | 0.9462 |
135
+ | 0.7787 | 0.0003 | 3 | 0.9371 |
136
+ | 1.8194 | 0.0006 | 6 | 0.8468 |
137
+ | 0.3322 | 0.0009 | 9 | 0.6456 |
138
+ | 0.3036 | 0.0012 | 12 | 0.4959 |
139
+ | 0.2457 | 0.0015 | 15 | 0.4270 |
140
+ | 0.151 | 0.0018 | 18 | 0.3516 |
141
+ | 0.1293 | 0.0021 | 21 | 0.2962 |
142
+ | 0.0827 | 0.0024 | 24 | 0.2342 |
143
+ | 0.0525 | 0.0027 | 27 | 0.2070 |
144
+ | 0.0386 | 0.0030 | 30 | 0.1828 |
145
 
146
 
147
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f501d0ca81d4234feb05e447ed76d49b90b5a8e8c2e040fae364f927cd8f480b
3
  size 30398410
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e46e72435c393f048a2ac944ae6382ddeb747d6fa573a2a0627e5ec390e836fa
3
  size 30398410