PJMixers-Dev
/

LLaMa-3.2-Instruct-JankMix-v0.2-SFT-3B

Model card Files Files and versions Community

xzuyn commited on Oct 14

Commit

d9c49e1

•

1 Parent(s): 7449dfc

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -5,6 +5,6 @@ base_model:
 - unsloth/Llama-3.2-3B-Instruct
 license: llama3.2
 ---
-A much further trained version, this time done with full finetuning instead of DoRA.
 Note: This likely has refusals like [PJMixers-Dev/LLaMa-3.2-Instruct-JankMix-v0.1-SFT-3B](https://huggingface.co/PJMixers-Dev/LLaMa-3.2-Instruct-JankMix-v0.1-SFT-3B) since no focus was put on removing refusals. I'm working on a KTO DoRA to solve this, and possibly improve roleplay performance.

 - unsloth/Llama-3.2-3B-Instruct
 license: llama3.2
 ---
+A much further trained version, this time done with full finetuning instead of DoRA. Similar ~50/50 mix of completion and instruct data.
 Note: This likely has refusals like [PJMixers-Dev/LLaMa-3.2-Instruct-JankMix-v0.1-SFT-3B](https://huggingface.co/PJMixers-Dev/LLaMa-3.2-Instruct-JankMix-v0.1-SFT-3B) since no focus was put on removing refusals. I'm working on a KTO DoRA to solve this, and possibly improve roleplay performance.