jploski commited on
Commit
0be1fc0
1 Parent(s): 741346b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -14,14 +14,14 @@ This model was trained from scratch on "tinyshakespeare" text file.
14
 
15
  ## Model description
16
 
17
- A tiny model similar to jploski/falcon-mini-shakespeare, to demonstrate training and recurrent inference using a retention network (https://arxiv.org/pdf/2307.08621.pdf).
18
- The code utilizes Sehyun Choi's implementation of retention network (https://github.com/syncdoth/RetNet) with configuration parameters changed to make it a very tiny model.
19
 
20
  - **License:** Apache 2.0.
21
 
22
  ## Intended uses & limitations
23
 
24
- Intended to demonstrate training and (recurrent O(1)) inference using a retention network
25
 
26
  ## Training and evaluation data
27
 
 
14
 
15
  ## Model description
16
 
17
+ A tiny model similar to jploski/falcon-mini-shakespeare, to demonstrate training and recurrent inference using a retentive network (https://arxiv.org/pdf/2307.08621.pdf).
18
+ The code utilizes Sehyun Choi's implementation of retentive network (https://github.com/syncdoth/RetNet) with configuration parameters changed to make it a very tiny model.
19
 
20
  - **License:** Apache 2.0.
21
 
22
  ## Intended uses & limitations
23
 
24
+ Intended to demonstrate training and (recurrent O(1)) inference using a retentive network
25
 
26
  ## Training and evaluation data
27