Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,8 @@ library_name: fasttext
|
|
11 |
---
|
12 |
# maths-fasttext-classifier
|
13 |
|
14 |
-
[Dataset](https://huggingface.co/datasets/kenhktsui/math-classifiers-data)
|
|
|
15 |
This is part of my [fasttext classifier collection](https://huggingface.co/collections/kenhktsui/fasttext-model-for-pretraining-data-curation-67220374c8acb97a1839553c) for curating pretraining dataset.
|
16 |
This classifier classifies a text into Maths or Others.
|
17 |
The model is trained over 1.6M records, which is a 50:50 mix of maths and non maths in website and achieved a test F1 score of 0.99 (too good to be true?). It is an intended upsampling of maths data.
|
|
|
11 |
---
|
12 |
# maths-fasttext-classifier
|
13 |
|
14 |
+
[Dataset](https://huggingface.co/datasets/kenhktsui/math-classifiers-data)
|
15 |
+
|
16 |
This is part of my [fasttext classifier collection](https://huggingface.co/collections/kenhktsui/fasttext-model-for-pretraining-data-curation-67220374c8acb97a1839553c) for curating pretraining dataset.
|
17 |
This classifier classifies a text into Maths or Others.
|
18 |
The model is trained over 1.6M records, which is a 50:50 mix of maths and non maths in website and achieved a test F1 score of 0.99 (too good to be true?). It is an intended upsampling of maths data.
|