Update README.md
Browse files
README.md
CHANGED
@@ -44,6 +44,7 @@ In the second stage pre-processing, we do the following pre-processing tasks:
|
|
44 |
- remove sentences with token length less than 4,
|
45 |
- convert ‘@username’ to ‘@USER’,
|
46 |
- convert URL to HTTPURL.
|
|
|
47 |
Finally, we have 28,121,693 sentences for our pre-training task.
|
48 |
|
49 |
## Model
|
|
|
44 |
- remove sentences with token length less than 4,
|
45 |
- convert ‘@username’ to ‘@USER’,
|
46 |
- convert URL to HTTPURL.
|
47 |
+
|
48 |
Finally, we have 28,121,693 sentences for our pre-training task.
|
49 |
|
50 |
## Model
|