End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ tags:
 - axolotl
 - generated_from_trainer
 model-index:
-- name: Llama-3-8B-tulu-human
   results: []
 ---
@@ -28,8 +28,8 @@ strict: false
 datasets:
   - path: penfever/tulu-v2-flan-v2-cot-science
     type: sharegpt.load_ultrachat
-chat_template: llama3
 dataset_prepared_path: ./datasets/tulu-human
 output_dir: ./outputs/tulu-human
@@ -37,14 +37,13 @@ sequence_len: 8192
 sample_packing: true
 pad_to_sequence_len: true
-shuffle_merged_datasets: true
 wandb_project: lm-evals
 wandb_entity:
 wandb_watch:
 wandb_name: Llama-3-8B-tulu-human
 wandb_log_model:
-hub_model_id: penfever/Llama-3-8B-tulu-human
 gradient_accumulation_steps: 8
 micro_batch_size: 1
@@ -83,8 +82,8 @@ special_tokens:
 </details><br>
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/nyu-dice-lab/lm-evals/runs/g4os1g4h)
-# Llama-3-8B-tulu-human
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the None dataset.

 - axolotl
 - generated_from_trainer
 model-index:
+- name: Llama-3-8B-tulu-human-v2
   results: []
 ---
 datasets:
   - path: penfever/tulu-v2-flan-v2-cot-science
     type: sharegpt.load_ultrachat
+    conversation: llama3
 dataset_prepared_path: ./datasets/tulu-human
 output_dir: ./outputs/tulu-human
 sample_packing: true
 pad_to_sequence_len: true
 wandb_project: lm-evals
 wandb_entity:
 wandb_watch:
 wandb_name: Llama-3-8B-tulu-human
 wandb_log_model:
+hub_model_id: penfever/Llama-3-8B-tulu-human-v2
 gradient_accumulation_steps: 8
 micro_batch_size: 1
 </details><br>
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/nyu-dice-lab/lm-evals/runs/rpepckaq)
+# Llama-3-8B-tulu-human-v2
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the None dataset.

pytorch_model-00001-of-00004.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2d429d1bd5e25c319ab854bf91c5770717f0ea3090cd16b30795d31b61aa1fc5
 size 4976718466

 version https://git-lfs.github.com/spec/v1
+oid sha256:eda0e48bd701e5c119343bcad0190e08a6abf1b9a933e0c3f2fc10e779d1a1d3
 size 4976718466

pytorch_model-00002-of-00004.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ec16e1c3bc0cc23d37e9370a614882239e752168a836da413101111fb0262cfa
 size 4999827718

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0ce61ccaacc7d5e3cf36d5487c37512e8dcab7befed251a4342337e06fc67ad
 size 4999827718

pytorch_model-00003-of-00004.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e670f15553e39a1b73806858fdd873d3243c7d33d025ad8a5a7589776865018a
 size 4915940170

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e9ee8f0fe08a3e592c175d9c274beb21c8e1919ffc59b52ca75e396a3f76605
 size 4915940170

pytorch_model-00004-of-00004.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7b6aa6c9bb17b4376a7a26deaf3d53d525b11fcaf44284dbb31caeddf386cc43
 size 1168140873

 version https://git-lfs.github.com/spec/v1
+oid sha256:b4ffbc8f50fc62ecad6c8e9e75d9497a027dbfd117fe4f80df83347ceb55d50b
 size 1168140873