BlinkDL commited on
Commit
36de6d1
·
1 Parent(s): 7719662

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -24,6 +24,15 @@ ctx_len = 896
24
  n_layer = 24
25
  n_embd = 2048
26
 
27
- 20220708-1905.pth : Trained on the Pile for 68B tokens. Pile loss 2.148, LAMBADA ppl 8.41, acc 53.17%.
 
 
 
 
 
 
 
 
 
28
 
29
  (I am still training it)
 
24
  n_layer = 24
25
  n_embd = 2048
26
 
27
+ Preview checkpoint: RWKV-3-Pile-20220723-3542.pth : Trained on the Pile for 127B tokens.
28
+ * Pile loss 2.102
29
+ * LAMBADA ppl 7.52, acc 54.71%
30
+ * PIQA acc 71.11%
31
+ * SC2016 acc 67.24%
32
+ * Hellaswag acc_norm 50.45%
33
+
34
+ Preview checkpoint: 20220708-1905.pth : Trained on the Pile for 68B tokens.
35
+ * Pile loss 2.148
36
+ * LAMBADA ppl 8.41, acc 53.17%
37
 
38
  (I am still training it)