AdrienB134
commited on
Commit
•
884ff07
1
Parent(s):
318bd55
Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ tags:
|
|
15 |
|
16 |
#### Details
|
17 |
|
18 |
-
The model is initialized from the [ricardoz/BERTugues-base-portuguese-cased](https://huggingface.co/ricardoz/BERTugues-base-portuguese-cased)
|
19 |
|
20 |
#### Data
|
21 |
|
|
|
15 |
|
16 |
#### Details
|
17 |
|
18 |
+
The model is initialized from the [ricardoz/BERTugues-base-portuguese-cased](https://huggingface.co/ricardoz/BERTugues-base-portuguese-cased) model and fine-tuned on 10M triples via pairwise softmax cross-entropy loss over the computed scores of the positive and negative passages associated to a query. It was trained on a single Tesla A100 GPU with 40GBs of memory during 200k steps with 10% of warmup steps using a batch size of 96 and the AdamW optimizer with a constant learning rate of 3e-06. Total training time was around 12 hours.
|
19 |
|
20 |
#### Data
|
21 |
|