tomaarsen HF staff commited on
Commit
94273b1
·
verified ·
1 Parent(s): c5094d0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -4,7 +4,7 @@ tags:
4
  - cross-encoder
5
  - text-classification
6
  - generated_from_trainer
7
- - dataset_size:79561408
8
  - loss:MSELoss
9
  base_model: microsoft/MiniLM-L12-H384-uncased
10
  datasets:
@@ -154,7 +154,7 @@ You can finetune this model on your own dataset.
154
  #### ms-marco-shuffled
155
 
156
  * Dataset: [ms-marco-shuffled](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled) at [0e80192](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled/tree/0e8019214fbbb17845d8fa1e4594882944716633)
157
- * Size: 79,561,408 training samples
158
  * Columns: <code>score</code>, <code>query</code>, and <code>passage</code>
159
  * Approximate statistics based on the first 1000 samples:
160
  | | score | query | passage |
@@ -174,7 +174,7 @@ You can finetune this model on your own dataset.
174
  #### ms-marco-shuffled
175
 
176
  * Dataset: [ms-marco-shuffled](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled) at [0e80192](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled/tree/0e8019214fbbb17845d8fa1e4594882944716633)
177
- * Size: 79,561,408 evaluation samples
178
  * Columns: <code>score</code>, <code>query</code>, and <code>passage</code>
179
  * Approximate statistics based on the first 1000 samples:
180
  | | score | query | passage |
 
4
  - cross-encoder
5
  - text-classification
6
  - generated_from_trainer
7
+ - dataset_size:1990000
8
  - loss:MSELoss
9
  base_model: microsoft/MiniLM-L12-H384-uncased
10
  datasets:
 
154
  #### ms-marco-shuffled
155
 
156
  * Dataset: [ms-marco-shuffled](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled) at [0e80192](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled/tree/0e8019214fbbb17845d8fa1e4594882944716633)
157
+ * Size: 1,990,000 training samples
158
  * Columns: <code>score</code>, <code>query</code>, and <code>passage</code>
159
  * Approximate statistics based on the first 1000 samples:
160
  | | score | query | passage |
 
174
  #### ms-marco-shuffled
175
 
176
  * Dataset: [ms-marco-shuffled](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled) at [0e80192](https://huggingface.co/datasets/tomaarsen/ms-marco-shuffled/tree/0e8019214fbbb17845d8fa1e4594882944716633)
177
+ * Size: 10,000 evaluation samples
178
  * Columns: <code>score</code>, <code>query</code>, and <code>passage</code>
179
  * Approximate statistics based on the first 1000 samples:
180
  | | score | query | passage |