Model save

Browse files

Files changed (4) hide show

Logs/events.out.tfevents.1718300677.78fe09153f4a.177.0 +2 -2
README.md +65 -0
Untitled.ipynb +21 -4
adapter_model.safetensors +1 -1

Logs/events.out.tfevents.1718300677.78fe09153f4a.177.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e39f11fe12b3d9dc1241eccafcfd0b589861d6e0aab1168f81b23776b49db392
-size 6094

 version https://git-lfs.github.com/spec/v1
+oid sha256:49a8da737407a3d136ed58ffa2951e324e9dc963055c9e4bcdad7b401767346d
+size 6930

README.md ADDED Viewed

	@@ -0,0 +1,65 @@

+---
+license: gemma
+library_name: peft
+tags:
+- generated_from_trainer
+base_model: google/paligemma-3b-pt-224
+datasets:
+- ecgt_vision
+model-index:
+- name: workspace
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# workspace
+This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on the ecgt_vision dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0006
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 64
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 2
+- num_epochs: 2
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 1.4751        | 0.9961 | 129  | 0.0008          |
+| 0.0003        | 1.9923 | 258  | 0.0006          |
+### Framework versions
+- PEFT 0.11.1
+- Transformers 4.41.0
+- Pytorch 2.1.0+cu118
+- Datasets 2.19.1
+- Tokenizers 0.19.1

Untitled.ipynb CHANGED Viewed

@@ -860,8 +860,8 @@
        "\n",
        "    <div>\n",
        "      \n",
-       "      <progress value='130' max='258' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
-       "      [130/258 28:50 < 28:50, 0.07 it/s, Epoch 1.00/2]\n",
        "    </div>\n",
        "    <table border=\"1\" class=\"dataframe\">\n",
        "  <thead>\n",
@@ -872,12 +872,17 @@
        "    </tr>\n",
        "  </thead>\n",
        "  <tbody>\n",
        "  </tbody>\n",
        "</table><p>\n",
        "    <div>\n",
        "      \n",
-       "      <progress value='117' max='130' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
-       "      [117/130 01:30 < 00:10, 1.27 it/s]\n",
        "    </div>\n",
        "    "
       ],
@@ -887,6 +892,18 @@
      },
      "metadata": {},
      "output_type": "display_data"
     }
    ],
    "source": [

        "\n",
        "    <div>\n",
        "      \n",
+       "      <progress value='259' max='258' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
+       "      [258/258 59:40, Epoch 1.99/2]\n",
        "    </div>\n",
        "    <table border=\"1\" class=\"dataframe\">\n",
        "  <thead>\n",
        "    </tr>\n",
        "  </thead>\n",
        "  <tbody>\n",
+       "    <tr>\n",
+       "      <td>0</td>\n",
+       "      <td>1.475100</td>\n",
+       "      <td>0.000765</td>\n",
+       "    </tr>\n",
        "  </tbody>\n",
        "</table><p>\n",
        "    <div>\n",
        "      \n",
+       "      <progress value='78' max='130' style='width:300px; height:20px; vertical-align: middle;'></progress>\n",
+       "      [ 78/130 01:00 < 00:40, 1.28 it/s]\n",
        "    </div>\n",
        "    "
       ],
      },
      "metadata": {},
      "output_type": "display_data"
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/usr/local/lib/python3.10/dist-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.\n",
+      "  warnings.warn(\n",
+      "/usr/local/lib/python3.10/dist-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.\n",
+      "  warnings.warn(\n",
+      "/usr/local/lib/python3.10/dist-packages/torch/utils/checkpoint.py:61: UserWarning: None of the inputs have requires_grad=True. Gradients will be None\n",
+      "  warnings.warn(\n"
+     ]
     }
    ],
    "source": [

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b05fce35a49cac791224fdcaac81a84989be4422c06c3abfd0b81226071f51e5
 size 45258384

 version https://git-lfs.github.com/spec/v1
+oid sha256:b3de0ea06c5f4ebd61eb07d272f091c40e58730148eb3faed31db50b47bd8c8a
 size 45258384