MinzaKhan commited on
Commit
f5d2d9e
1 Parent(s): 4afae7b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -15,3 +15,20 @@ The limitations of this model are that it can only generate text in the style of
15
 
16
  I created my own dataset to train this model. I chose 14 novels written by H G Wells for my dataset. Most of the novels in the dataset are of the genre science fiction. The dataset contains more than 1 million tokens.
17
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  I created my own dataset to train this model. I chose 14 novels written by H G Wells for my dataset. Most of the novels in the dataset are of the genre science fiction. The dataset contains more than 1 million tokens.
17
 
18
+ The texts included in the corpus are novels written by H G Wells. The novels in the corpus are:
19
+ The Time Machine
20
+ In the Days of the Comet
21
+ The Food of the Gods
22
+ Tales of Space and Time
23
+ The World Set Free
24
+ The War of the Worlds
25
+ The First Men in the Moon
26
+ The Invisible Man
27
+ The Island of Doctor Moreau
28
+ The Sleeper Awakes
29
+ The War in the Air
30
+ The Research Magnificient
31
+ The Udying Fire
32
+ The Red Room
33
+
34
+ The total number of tokens in the corpus is 1043588