Uploaded better trained version.

Files changed (3) hide show

README.md CHANGED Viewed

@@ -10,9 +10,11 @@ pipeline_tag: text-generation
 library_name: transformers
 ---
 # GPT4chan 24B AWQ
 This model is [v2ray/GPT4chan-24B](https://huggingface.co/v2ray/GPT4chan-24B) quantized to int4 using [casper-hansen/AutoAWQ](https://github.com/casper-hansen/AutoAWQ).
-Trained using 8x H100 with global batch size 64, using 2e-4 learning rate, for 800 steps, which is approximately 1 epoch.
 ## Prompt Format
 ```
 board<|start_header_id|>id<|end_header_id|>content<|start_header_id|>id<|end_header_id|>content...<|start_header_id|>id<|end_header_id|>

 library_name: transformers
 ---
 # GPT4chan 24B AWQ
+![GPT4chan Banner](https://huggingface.co/v2ray/GPT4chan-24B-QLoRA/resolve/main/images/banner.avif)
 This model is [v2ray/GPT4chan-24B](https://huggingface.co/v2ray/GPT4chan-24B) quantized to int4 using [casper-hansen/AutoAWQ](https://github.com/casper-hansen/AutoAWQ).
+Trained using 8x H100 with global batch size 64, using 2e-4 learning rate, for 4000 steps, which is approximately 5 epochs.
 ## Prompt Format
 ```
 board<|start_header_id|>id<|end_header_id|>content<|start_header_id|>id<|end_header_id|>content...<|start_header_id|>id<|end_header_id|>

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:150b8ac9f4c6e13564150473aee4f2d8573528c933b1c2778bb36c9c1ad0ba5f
 size 9917518176

 version https://git-lfs.github.com/spec/v1
+oid sha256:b58bc6e6f998bb4cc4d28cce276591fe4c5351b88a7d7aa79359dc9dd01074e5
 size 9917518176

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:71fb7953addd78db23e0c54c70e740b8fb5f5e32a545b112bc40537f1b21062a
 size 4316893568

 version https://git-lfs.github.com/spec/v1
+oid sha256:ea4b0ab5829553d135964b44a234dae8493c7f875ef50d550848a618bf81189e
 size 4316893568