End of training

Browse files

Files changed (9) hide show

README.md +75 -0
config.json +42 -0
model.safetensors +3 -0
preprocessor_config.json +23 -0
runs/Jan29_13-55-32_modal/events.out.tfevents.1738158933.modal.2.0 +3 -0
runs/Jan29_13-55-32_modal/events.out.tfevents.1738158933.modal.2.1 +3 -0
runs/Jan29_13-55-32_modal/events.out.tfevents.1738159015.modal.2.2 +3 -0
runs/Jan29_13-55-32_modal/events.out.tfevents.1738159015.modal.2.3 +3 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,75 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: google/vit-base-patch16-224-in21k
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+model-index:
+- name: test_model_88
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# test_model_88
+This model is a fine-tuned version of [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) on the corranm/first_vote_100_per_new2 dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.8934
+- F1 Macro: 0.0606
+- F1 Micro: 0.1591
+- F1 Weighted: 0.0846
+- Precision Macro: 0.0421
+- Precision Micro: 0.1591
+- Precision Weighted: 0.0586
+- Recall Macro: 0.1132
+- Recall Micro: 0.1591
+- Recall Weighted: 0.1591
+- Accuracy: 0.1591
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 32
+- eval_batch_size: 32
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 128
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 3
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1 Macro | F1 Micro | F1 Weighted | Precision Macro | Precision Micro | Precision Weighted | Recall Macro | Recall Micro | Recall Weighted | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:-----------:|:---------------:|:---------------:|:------------------:|:------------:|:------------:|:---------------:|:--------:|
+| 1.9682        | 0.8   | 3    | 1.9070          | 0.0599   | 0.2121   | 0.0848      | 0.0661          | 0.2121          | 0.0908             | 0.1486       | 0.2121       | 0.2121          | 0.2121   |
+| 1.8993        | 1.8   | 6    | 1.8860          | 0.0902   | 0.2197   | 0.1243      | 0.0630          | 0.2197          | 0.0867             | 0.1594       | 0.2197       | 0.2197          | 0.2197   |
+| 2.3539        | 2.8   | 9    | 1.8915          | 0.0637   | 0.1591   | 0.0887      | 0.0443          | 0.1591          | 0.0616             | 0.1141       | 0.1591       | 0.1591          | 0.1591   |
+### Framework versions
+- Transformers 4.48.1
+- Pytorch 2.5.1+cu124
+- Datasets 3.2.0
+- Tokenizers 0.21.0

config.json ADDED Viewed

	@@ -0,0 +1,42 @@

+{
+  "_name_or_path": "google/vit-base-patch16-224-in21k",
+  "architectures": [
+    "ViTForImageClassification"
+  ],
+  "attention_probs_dropout_prob": 0.0,
+  "encoder_stride": 16,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.0,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "-",
+    "1": "0",
+    "2": "1",
+    "3": "2",
+    "4": "3",
+    "5": "4",
+    "6": "5"
+  },
+  "image_size": 224,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "-": "0",
+    "0": "1",
+    "1": "2",
+    "2": "3",
+    "3": "4",
+    "4": "5",
+    "5": "6"
+  },
+  "layer_norm_eps": 1e-12,
+  "model_type": "vit",
+  "num_attention_heads": 12,
+  "num_channels": 3,
+  "num_hidden_layers": 12,
+  "patch_size": 16,
+  "problem_type": "single_label_classification",
+  "qkv_bias": true,
+  "torch_dtype": "float32",
+  "transformers_version": "4.48.1"
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd3cd7b8e144e0436bf3d14cce67ba5c32104db453b530509463535ffb571c83
+size 343239356

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,23 @@

+{
+  "do_convert_rgb": null,
+  "do_normalize": true,
+  "do_rescale": true,
+  "do_resize": true,
+  "image_mean": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "image_processor_type": "ViTImageProcessor",
+  "image_std": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "resample": 2,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "height": 224,
+    "width": 224
+  }
+}

runs/Jan29_13-55-32_modal/events.out.tfevents.1738158933.modal.2.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:880f4b8240982a90a5dda0693920ce52aae560c40f761822750de84cc82c965e
+size 8766

runs/Jan29_13-55-32_modal/events.out.tfevents.1738158933.modal.2.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d0cbbd69cfd4c3d48864e1062ff4a3e89f2a469c8022ae19b8a1143d534c34d2
+size 8766

runs/Jan29_13-55-32_modal/events.out.tfevents.1738159015.modal.2.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2166e548642971fe00679c4ce2c3d2273af9e537ce2c945f39621c1919a4e8a3
+size 906

runs/Jan29_13-55-32_modal/events.out.tfevents.1738159015.modal.2.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e78b8c19fcb40f6af0f54703aa536b593bed9a070c141a83bd84e11188c55588
+size 906

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:984de6a9d3da3da7102e0dbfc44f6ad01de01e45df770859e0009a0a4c8fd644
+size 5368