something-else
commited on
Commit
•
b20be4e
1
Parent(s):
47e3a73
Update README.md
Browse files
README.md
CHANGED
@@ -40,4 +40,5 @@ tags:
|
|
40 |
- rwkv-9Q-Final-N8-1k.pth : Using rwkv-9Q-stp1447-N8.pth I added 2569 steps of N8 which are 106 Gtokens with a loss of 1.801.
|
41 |
- rwkv-9Q-1k-stp706-N8-0.pth: Using rwkv-9Q-1k-stp706-N8-0.pth I added 706 new steps and 29.13 Gtokens of N8-0 with a loss of 1.78
|
42 |
- rwkv-9Q-4k-stp248.pth: Using rwkv-9Q-1k-stp706-N8-0.pth I added 2048 new steps with 40.66 Gtokens with a loss of 1.717 Nathan-0 datase and Ctx=4096.
|
43 |
-
- rwkv-9Q-16k-step6-0-4.pth: Using rwkv-9Q-4k-stp248.pth I added N-0 and N-8 and a Ctx=16384 loss=1.65. This model looks that can chat better.
|
|
|
|
40 |
- rwkv-9Q-Final-N8-1k.pth : Using rwkv-9Q-stp1447-N8.pth I added 2569 steps of N8 which are 106 Gtokens with a loss of 1.801.
|
41 |
- rwkv-9Q-1k-stp706-N8-0.pth: Using rwkv-9Q-1k-stp706-N8-0.pth I added 706 new steps and 29.13 Gtokens of N8-0 with a loss of 1.78
|
42 |
- rwkv-9Q-4k-stp248.pth: Using rwkv-9Q-1k-stp706-N8-0.pth I added 2048 new steps with 40.66 Gtokens with a loss of 1.717 Nathan-0 datase and Ctx=4096.
|
43 |
+
- rwkv-9Q-16k-step6-0-4.pth: Using rwkv-9Q-4k-stp248.pth I added N-0 and N-8 and a Ctx=16384 loss=1.65. This model looks that can chat better.
|
44 |
+
- rwkv-9Q-step607-N8-3.pth: Using rwkv-9Q-16k-step6-0-4.pth I add 100G tokens of N8-3.
|