cmarkea
/

bloomz-3b-reranking

Sentence Similarity

text-classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Apr 11, 2024

Commit

51e0e3f

·

verified ·

1 Parent(s): dc73419

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ Developed to be language-agnostic, this model supports both French and English.
 influenced by its behavior in a monolingual context (English or French).
 ## Dataset
-The training dataset is composed of the [mMARCO dataset](https://huggingface.co/datasets/unicamp-dl/mmarco), consisting of query/positive/hard negative triplets.
 Additionally, we have included [SQuAD](https://huggingface.co/datasets/rajpurkar/squad) data from the "train" split, forming query/positive/hard negative triplets. In
 order to generate hard negative data for SQuAD, we considered contexts from the same theme as the query but from a different set of queries. Hence, the negative
 observations belong to the same themes as the queries but presumably do not contain the answer to the question.

 influenced by its behavior in a monolingual context (English or French).
 ## Dataset
+The training dataset is composed of the [mMARCO](https://huggingface.co/datasets/unicamp-dl/mmarco) dataset, consisting of query/positive/hard negative triplets.
 Additionally, we have included [SQuAD](https://huggingface.co/datasets/rajpurkar/squad) data from the "train" split, forming query/positive/hard negative triplets. In
 order to generate hard negative data for SQuAD, we considered contexts from the same theme as the query but from a different set of queries. Hence, the negative
 observations belong to the same themes as the queries but presumably do not contain the answer to the question.