ZhangNy/reward-mimic

Browse files

Files changed (4) hide show

README.md +63 -0
config.json +11 -0
model.safetensors +3 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+---
+library_name: transformers
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+model-index:
+- name: 2024-11-18_10-58-28
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# 2024-11-18_10-58-28
+This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.5595
+- Accuracy: 0.7301
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 3072
+- eval_batch_size: 3072
+- seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- num_epochs: 2
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.6789        | 0.4   | 20   | 0.6699          | 0.7086   |
+| 0.6399        | 0.8   | 40   | 0.6259          | 0.7125   |
+| 0.5977        | 1.2   | 60   | 0.5863          | 0.7184   |
+| 0.5721        | 1.6   | 80   | 0.5655          | 0.7272   |
+| 0.5639        | 2.0   | 100  | 0.5595          | 0.7301   |
+### Framework versions
+- Transformers 4.46.2
+- Pytorch 2.1.2+cu121
+- Datasets 3.1.0
+- Tokenizers 0.20.3

config.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "architectures": [
+    "RewardModel"
+  ],
+  "mlp_hidden_dim": 2048,
+  "model_type": "reward",
+  "textual_output_dim": 512,
+  "torch_dtype": "float32",
+  "transformers_version": "4.46.2",
+  "visual_output_dim": 512
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9d3bb69540bddf484020dd1570ce5b266428b5d5605db7ac1bb351c5c09bb349
+size 808845424

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a6bda65b1f97fc3d662c03be13ec30c632b27b1edc0e0e152ec502ed420de51a
+size 5496