tcarwash commited on
Commit
0cf5f2c
·
verified ·
1 Parent(s): 656b1d8

tcarwash/tinyllama-instruct

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [tinyllama/tinyllama-1.1b-intermediate-step-1431k-3t](https://huggingface.co/tinyllama/tinyllama-1.1b-intermediate-step-1431k-3t) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 1.3012
24
 
25
  ## Model description
26
 
@@ -46,16 +46,13 @@ The following hyperparameters were used during training:
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: constant
48
  - lr_scheduler_warmup_steps: 0.03
49
- - num_epochs: 4
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss |
54
- |:-------------:|:-----:|:-----:|:---------------:|
55
- | 1.4385 | 1.0 | 4263 | 1.2766 |
56
- | 1.4772 | 2.0 | 8526 | 1.2743 |
57
- | 1.0998 | 3.0 | 12789 | 1.2837 |
58
- | 1.3263 | 4.0 | 17052 | 1.3012 |
59
 
60
 
61
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [tinyllama/tinyllama-1.1b-intermediate-step-1431k-3t](https://huggingface.co/tinyllama/tinyllama-1.1b-intermediate-step-1431k-3t) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 1.3383
24
 
25
  ## Model description
26
 
 
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: constant
48
  - lr_scheduler_warmup_steps: 0.03
49
+ - training_steps: 300
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss |
54
+ |:-------------:|:------:|:----:|:---------------:|
55
+ | 1.4051 | 0.0704 | 300 | 1.3383 |
 
 
 
56
 
57
 
58
  ### Framework versions
adapter_config.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "alpha_pattern": {},
3
  "auto_mapping": null,
4
- "base_model_name_or_path": null,
5
  "bias": "none",
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
 
1
  {
2
  "alpha_pattern": {},
3
  "auto_mapping": null,
4
+ "base_model_name_or_path": "tinyllama/tinyllama-1.1b-intermediate-step-1431k-3t",
5
  "bias": "none",
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dcb0045517d11e5dc96444a144c9f7e188f081679dbbc1a52764bdf7ffcf551b
3
- size 36058104
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cf3322ebabf5fdd0f394ea20ade983db6aa6ae49d99316d807711cbca8333b5c
3
+ size 36056608
runs/May04_05-29-08_fc985ddde5a8/events.out.tfevents.1714800551.fc985ddde5a8.702.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:593a294c4880410da5f68b8ab81e95306d8544da5e1a70d70de7b03dd2f8d965
3
+ size 11654
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:db0cad0f34b276b350f7482750bf5f1c4fe718452bce3cf3888dce72e6f5b0e0
3
  size 4984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c424d2d2c93624e7bd109cd8c85073defd3d9f6fc8ff9e60ba5a4db87e81d44
3
  size 4984