Commit
·
f920e6e
1
Parent(s):
ffb59f0
Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,8 @@ pipeline_tag: text-classification
|
|
10 |
# librarian-bots/is_new_dataset_student_model
|
11 |
|
12 |
This is a [SetFit model](https://github.com/huggingface/setfit) that can be used for text classification. The model is trained to predict whether a title + abstract for a paper on arXiv introduces a new dataset.
|
|
|
|
|
13 |
|
14 |
## Usage
|
15 |
|
@@ -42,7 +44,7 @@ You probably want to use the same format when running inference for this model.
|
|
42 |
|
43 |
## BibTeX entry and citation info
|
44 |
|
45 |
-
To cite the SetFit approach used to train this model please use this citation:
|
46 |
|
47 |
```bibtex
|
48 |
@article{https://doi.org/10.48550/arxiv.2209.11055,
|
|
|
10 |
# librarian-bots/is_new_dataset_student_model
|
11 |
|
12 |
This is a [SetFit model](https://github.com/huggingface/setfit) that can be used for text classification. The model is trained to predict whether a title + abstract for a paper on arXiv introduces a new dataset.
|
13 |
+
The model was trained on Arxiv papers returned from the search `dataset`. The model, therefore, aims to disambiguate papers about datasets vs papers which introduce a new dataset.
|
14 |
+
This model was trained through distillation training using a larger model [`librarian-bots/is_new_dataset_teacher_model`](https://huggingface.co/librarian-bots/is_new_dataset_teacher_model).
|
15 |
|
16 |
## Usage
|
17 |
|
|
|
44 |
|
45 |
## BibTeX entry and citation info
|
46 |
|
47 |
+
To cite the SetFit approach used to train this model, please use this citation:
|
48 |
|
49 |
```bibtex
|
50 |
@article{https://doi.org/10.48550/arxiv.2209.11055,
|