Sao10K
/

MN-12B-Lyra-v4a1-Old

Model card Files Files and versions Community

Sao10K commited on Sep 6, 2024

Commit

6988a4a

·

verified ·

1 Parent(s): 4781ffb

Update README.md

Files changed (1) hide show

README.md +5 -6

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ Lyra-v2a2
   v
 Lyra-v3
   |
-  | [Backmerge + LoRA Extraction + Low Rank SFT Step]
   v
 Lyra-v4
 ```
@@ -64,9 +64,8 @@ min_p: 0.1 - 0.2 # Crucial for NeMo
 # Notes
 \- Some people have been having issues with run-on generations for Lyra-v3. Kind of weird, when I never had issues.
-<br>\- Anyway, make sure not to skip special tokens, or ban EOS tokens. I think this is the main problem that happens when v3 was to be quanted. The special tokens map is fucked in v3, Quantizing tools likely spazzed out seeing it. I blame llamafactory for it. It ran fine unquantised.
 <br>\- I like long generations, though I can control it easily to create short ones. If you're struggling, prompt better. Fix your system prompts, use an Author's Note, use a prefill. They are there for a reason.
-<br>\- Lyra passes my internal benchmark suite, hence why I'm releasing it. Do I like it? Yes? it's out. that's it. They are my models for my enjoyment.
-<br>\- Issues like roleplay format are what I consider worthless, as it follows few-shot examples fine. This is not a priority for me to 'fix', as I see no isses with it.
-<br>\- If you don't like it, just try another model? Plenty of other choices. Ymmv, I like it.
-<br>\- This may sound rough, but Roleplay is subjective. What I like, you may not like. It's fine.

   v
 Lyra-v3
   |
+  | [Backmerge to v2a1 + LoRA Extraction + Low Rank SFT Step for Coherency]
   v
 Lyra-v4
 ```
 # Notes
 \- Some people have been having issues with run-on generations for Lyra-v3. Kind of weird, when I never had issues.
+<br>\- Anyway, make sure not to skip special tokens, or ban EOS tokens. I think this is the main problem that happens when v3 was to be quanted. The special tokens map config is fucked in v3, Quantizing tools likely spazzed out seeing it. I blame llamafactory for it. It ran fine unquantised.
 <br>\- I like long generations, though I can control it easily to create short ones. If you're struggling, prompt better. Fix your system prompts, use an Author's Note, use a prefill. They are there for a reason.
+<br>\- Lyra passes my internal benchmark suite, hence why I'm releasing it. Do I like it? Yes? it's out. that's it. They are my models for my personal enjoyment first.
+<br>\- Issues like roleplay format are what I consider worthless, as it follows few-shot examples fine. This is not a priority for me to 'fix', as I see no isses with it. Same with excessive generations. Its easy to cut out.
+<br>\- If you don't like it, just try another model? Plenty of other choices. Ymmv, I like it.