Update README.md
Browse files
README.md
CHANGED
@@ -51,9 +51,9 @@ Finally, we have 28,121,693 sentences for the training process.
|
|
51 |
This pretraining data will not be opened to public due to Twitter policy.
|
52 |
|
53 |
## Model
|
54 |
-
| Model name |
|
55 |
|-------------------------------------------|-----------------|----------------------------|-------------------------|
|
56 |
-
| `indojave-codemixed-indobertweet-base` |
|
57 |
|
58 |
## Evaluation Results
|
59 |
We train the data with 3 epochs and total steps of 296K for 4 days.
|
|
|
51 |
This pretraining data will not be opened to public due to Twitter policy.
|
52 |
|
53 |
## Model
|
54 |
+
| Model name | Base model | Size of training data | Size of validation data |
|
55 |
|-------------------------------------------|-----------------|----------------------------|-------------------------|
|
56 |
+
| `indojave-codemixed-indobertweet-base` | IndoBERTweet | 2.24 GB of text | 249 MB of text |
|
57 |
|
58 |
## Evaluation Results
|
59 |
We train the data with 3 epochs and total steps of 296K for 4 days.
|