Geohunterr commited on
Commit
1c227a7
·
verified ·
1 Parent(s): 56ca19e

Model save

Browse files
Logs/events.out.tfevents.1718300677.78fe09153f4a.177.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e39f11fe12b3d9dc1241eccafcfd0b589861d6e0aab1168f81b23776b49db392
3
- size 6094
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:49a8da737407a3d136ed58ffa2951e324e9dc963055c9e4bcdad7b401767346d
3
+ size 6930
README.md ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: gemma
3
+ library_name: peft
4
+ tags:
5
+ - generated_from_trainer
6
+ base_model: google/paligemma-3b-pt-224
7
+ datasets:
8
+ - ecgt_vision
9
+ model-index:
10
+ - name: workspace
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # workspace
18
+
19
+ This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on the ecgt_vision dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 0.0006
22
+
23
+ ## Model description
24
+
25
+ More information needed
26
+
27
+ ## Intended uses & limitations
28
+
29
+ More information needed
30
+
31
+ ## Training and evaluation data
32
+
33
+ More information needed
34
+
35
+ ## Training procedure
36
+
37
+ ### Training hyperparameters
38
+
39
+ The following hyperparameters were used during training:
40
+ - learning_rate: 2e-05
41
+ - train_batch_size: 16
42
+ - eval_batch_size: 8
43
+ - seed: 42
44
+ - gradient_accumulation_steps: 4
45
+ - total_train_batch_size: 64
46
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
+ - lr_scheduler_type: linear
48
+ - lr_scheduler_warmup_steps: 2
49
+ - num_epochs: 2
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss |
54
+ |:-------------:|:------:|:----:|:---------------:|
55
+ | 1.4751 | 0.9961 | 129 | 0.0008 |
56
+ | 0.0003 | 1.9923 | 258 | 0.0006 |
57
+
58
+
59
+ ### Framework versions
60
+
61
+ - PEFT 0.11.1
62
+ - Transformers 4.41.0
63
+ - Pytorch 2.1.0+cu118
64
+ - Datasets 2.19.1
65
+ - Tokenizers 0.19.1
Untitled.ipynb CHANGED
@@ -860,8 +860,8 @@
860
  "\n",
861
  " <div>\n",
862
  " \n",
863
- " <progress value='130' max='258' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
864
- " [130/258 28:50 < 28:50, 0.07 it/s, Epoch 1.00/2]\n",
865
  " </div>\n",
866
  " <table border=\"1\" class=\"dataframe\">\n",
867
  " <thead>\n",
@@ -872,12 +872,17 @@
872
  " </tr>\n",
873
  " </thead>\n",
874
  " <tbody>\n",
 
 
 
 
 
875
  " </tbody>\n",
876
  "</table><p>\n",
877
  " <div>\n",
878
  " \n",
879
- " <progress value='117' max='130' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
880
- " [117/130 01:30 < 00:10, 1.27 it/s]\n",
881
  " </div>\n",
882
  " "
883
  ],
@@ -887,6 +892,18 @@
887
  },
888
  "metadata": {},
889
  "output_type": "display_data"
 
 
 
 
 
 
 
 
 
 
 
 
890
  }
891
  ],
892
  "source": [
 
860
  "\n",
861
  " <div>\n",
862
  " \n",
863
+ " <progress value='259' max='258' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
864
+ " [258/258 59:40, Epoch 1.99/2]\n",
865
  " </div>\n",
866
  " <table border=\"1\" class=\"dataframe\">\n",
867
  " <thead>\n",
 
872
  " </tr>\n",
873
  " </thead>\n",
874
  " <tbody>\n",
875
+ " <tr>\n",
876
+ " <td>0</td>\n",
877
+ " <td>1.475100</td>\n",
878
+ " <td>0.000765</td>\n",
879
+ " </tr>\n",
880
  " </tbody>\n",
881
  "</table><p>\n",
882
  " <div>\n",
883
  " \n",
884
+ " <progress value='78' max='130' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
885
+ " [ 78/130 01:00 < 00:40, 1.28 it/s]\n",
886
  " </div>\n",
887
  " "
888
  ],
 
892
  },
893
  "metadata": {},
894
  "output_type": "display_data"
895
+ },
896
+ {
897
+ "name": "stderr",
898
+ "output_type": "stream",
899
+ "text": [
900
+ "/usr/local/lib/python3.10/dist-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.\n",
901
+ " warnings.warn(\n",
902
+ "/usr/local/lib/python3.10/dist-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.\n",
903
+ " warnings.warn(\n",
904
+ "/usr/local/lib/python3.10/dist-packages/torch/utils/checkpoint.py:61: UserWarning: None of the inputs have requires_grad=True. Gradients will be None\n",
905
+ " warnings.warn(\n"
906
+ ]
907
  }
908
  ],
909
  "source": [
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b05fce35a49cac791224fdcaac81a84989be4422c06c3abfd0b81226071f51e5
3
  size 45258384
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b3de0ea06c5f4ebd61eb07d272f091c40e58730148eb3faed31db50b47bd8c8a
3
  size 45258384