jploski
/

retnet-mini-shakespeare

Text Generation

Generated from Trainer

Model card Files Files and versions Community

jploski commited on Aug 5, 2023

Commit

0be1fc0

·

1 Parent(s): 741346b

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -14,14 +14,14 @@ This model was trained from scratch on "tinyshakespeare" text file.
 ## Model description
-A tiny model similar to jploski/falcon-mini-shakespeare, to demonstrate training and recurrent inference using a retention network (https://arxiv.org/pdf/2307.08621.pdf).
-The code utilizes Sehyun Choi's implementation of retention network (https://github.com/syncdoth/RetNet) with configuration parameters changed to make it a very tiny model.
 - **License:** Apache 2.0.
 ## Intended uses & limitations
-Intended to demonstrate training and (recurrent O(1)) inference using a retention network
 ## Training and evaluation data

 ## Model description
+A tiny model similar to jploski/falcon-mini-shakespeare, to demonstrate training and recurrent inference using a retentive network (https://arxiv.org/pdf/2307.08621.pdf).
+The code utilizes Sehyun Choi's implementation of retentive network (https://github.com/syncdoth/RetNet) with configuration parameters changed to make it a very tiny model.
 - **License:** Apache 2.0.
 ## Intended uses & limitations
+Intended to demonstrate training and (recurrent O(1)) inference using a retentive network
 ## Training and evaluation data