JunxiongWang
/

llama3_mamba_0_5_sft

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

Junxiong Wang commited on Jul 20, 2024

Commit

1c0a562

·

1 Parent(s): a871be3

add models

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -1,21 +1,21 @@
 ---
-base_model: /data/junxiong/llama3_0.50_mamba_progressive/
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
 - JunxiongWang/sftdataset
 model-index:
-- name: llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05
-This model is a fine-tuned version of [/data/junxiong/llama3_0.50_mamba_progressive/](https://huggingface.co//data/junxiong/llama3_0.50_mamba_progressive/) on the JunxiongWang/sftdataset dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.6645

 ---
+base_model: JunxiongWang/llama3_0.50_mamba_progressive
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
 - JunxiongWang/sftdataset
 model-index:
+- name: JunxiongWang/llama3_mamba_0_5_sft
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# JunxiongWang/llama3_mamba_0_5_sft
+This model is a fine-tuned version of [JunxiongWang/llama3_0.50_mamba_progressive](https://huggingface.co/JunxiongWang/llama3_0.50_mamba_progressive/) on the JunxiongWang/sftdataset dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.6645