Junxiong Wang commited on
Commit
1c0a562
·
1 Parent(s): a871be3

add models

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -1,21 +1,21 @@
1
  ---
2
- base_model: /data/junxiong/llama3_0.50_mamba_progressive/
3
  tags:
4
  - alignment-handbook
5
  - generated_from_trainer
6
  datasets:
7
  - JunxiongWang/sftdataset
8
  model-index:
9
- - name: llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05
10
  results: []
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- # llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05
17
 
18
- This model is a fine-tuned version of [/data/junxiong/llama3_0.50_mamba_progressive/](https://huggingface.co//data/junxiong/llama3_0.50_mamba_progressive/) on the JunxiongWang/sftdataset dataset.
19
  It achieves the following results on the evaluation set:
20
  - Loss: 0.6645
21
 
 
1
  ---
2
+ base_model: JunxiongWang/llama3_0.50_mamba_progressive
3
  tags:
4
  - alignment-handbook
5
  - generated_from_trainer
6
  datasets:
7
  - JunxiongWang/sftdataset
8
  model-index:
9
+ - name: JunxiongWang/llama3_mamba_0_5_sft
10
  results: []
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ # JunxiongWang/llama3_mamba_0_5_sft
17
 
18
+ This model is a fine-tuned version of [JunxiongWang/llama3_0.50_mamba_progressive](https://huggingface.co/JunxiongWang/llama3_0.50_mamba_progressive/) on the JunxiongWang/sftdataset dataset.
19
  It achieves the following results on the evaluation set:
20
  - Loss: 0.6645
21