Sao10K commited on
Commit
6988a4a
·
verified ·
1 Parent(s): 4781ffb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -6
README.md CHANGED
@@ -22,7 +22,7 @@ Lyra-v2a2
22
  v
23
  Lyra-v3
24
  |
25
- | [Backmerge + LoRA Extraction + Low Rank SFT Step]
26
  v
27
  Lyra-v4
28
  ```
@@ -64,9 +64,8 @@ min_p: 0.1 - 0.2 # Crucial for NeMo
64
  # Notes
65
 
66
  \- Some people have been having issues with run-on generations for Lyra-v3. Kind of weird, when I never had issues.
67
- <br>\- Anyway, make sure not to skip special tokens, or ban EOS tokens. I think this is the main problem that happens when v3 was to be quanted. The special tokens map is fucked in v3, Quantizing tools likely spazzed out seeing it. I blame llamafactory for it. It ran fine unquantised.
68
  <br>\- I like long generations, though I can control it easily to create short ones. If you're struggling, prompt better. Fix your system prompts, use an Author's Note, use a prefill. They are there for a reason.
69
- <br>\- Lyra passes my internal benchmark suite, hence why I'm releasing it. Do I like it? Yes? it's out. that's it. They are my models for my enjoyment.
70
- <br>\- Issues like roleplay format are what I consider worthless, as it follows few-shot examples fine. This is not a priority for me to 'fix', as I see no isses with it.
71
- <br>\- If you don't like it, just try another model? Plenty of other choices. Ymmv, I like it.
72
- <br>\- This may sound rough, but Roleplay is subjective. What I like, you may not like. It's fine.
 
22
  v
23
  Lyra-v3
24
  |
25
+ | [Backmerge to v2a1 + LoRA Extraction + Low Rank SFT Step for Coherency]
26
  v
27
  Lyra-v4
28
  ```
 
64
  # Notes
65
 
66
  \- Some people have been having issues with run-on generations for Lyra-v3. Kind of weird, when I never had issues.
67
+ <br>\- Anyway, make sure not to skip special tokens, or ban EOS tokens. I think this is the main problem that happens when v3 was to be quanted. The special tokens map config is fucked in v3, Quantizing tools likely spazzed out seeing it. I blame llamafactory for it. It ran fine unquantised.
68
  <br>\- I like long generations, though I can control it easily to create short ones. If you're struggling, prompt better. Fix your system prompts, use an Author's Note, use a prefill. They are there for a reason.
69
+ <br>\- Lyra passes my internal benchmark suite, hence why I'm releasing it. Do I like it? Yes? it's out. that's it. They are my models for my personal enjoyment first.
70
+ <br>\- Issues like roleplay format are what I consider worthless, as it follows few-shot examples fine. This is not a priority for me to 'fix', as I see no isses with it. Same with excessive generations. Its easy to cut out.
71
+ <br>\- If you don't like it, just try another model? Plenty of other choices. Ymmv, I like it.