Update README.md
Browse files
README.md
CHANGED
@@ -14,14 +14,14 @@ This model was trained from scratch on "tinyshakespeare" text file.
|
|
14 |
|
15 |
## Model description
|
16 |
|
17 |
-
A tiny model similar to jploski/falcon-mini-shakespeare, to demonstrate training and recurrent inference using a
|
18 |
-
The code utilizes Sehyun Choi's implementation of
|
19 |
|
20 |
- **License:** Apache 2.0.
|
21 |
|
22 |
## Intended uses & limitations
|
23 |
|
24 |
-
Intended to demonstrate training and (recurrent O(1)) inference using a
|
25 |
|
26 |
## Training and evaluation data
|
27 |
|
|
|
14 |
|
15 |
## Model description
|
16 |
|
17 |
+
A tiny model similar to jploski/falcon-mini-shakespeare, to demonstrate training and recurrent inference using a retentive network (https://arxiv.org/pdf/2307.08621.pdf).
|
18 |
+
The code utilizes Sehyun Choi's implementation of retentive network (https://github.com/syncdoth/RetNet) with configuration parameters changed to make it a very tiny model.
|
19 |
|
20 |
- **License:** Apache 2.0.
|
21 |
|
22 |
## Intended uses & limitations
|
23 |
|
24 |
+
Intended to demonstrate training and (recurrent O(1)) inference using a retentive network
|
25 |
|
26 |
## Training and evaluation data
|
27 |
|