pkshatech
/

m-ST5

yano0 commited on Jun 26, 2023

Commit

a5bf60d

1 Parent(s): 52bb5f0

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,5 +1,7 @@
 ---
 library_name: peft
 ---
 These are LoRA adaption weights for [mT5](https://huggingface.co/google/mt5-xxl) encoder.
@@ -7,6 +9,9 @@ These are LoRA adaption weights for [mT5](https://huggingface.co/google/mt5-xxl)
 This model is a multilingual extension of Sentence T5 and was created using the [mT5](https://huggingface.co/google/mt5-xxl) encoder. It is proposed in this [paper](hoge).
 It is an encoder for sentence embedding, and its performance has been verified in cross-lingual STS and sentence retrieval.
 ### Framework versions
@@ -44,4 +49,4 @@ last_hidden_state = outputs.last_hidden_state
 last_hidden_state[inputs.attention_mask == 0, :] = 0
 sent_len = inputs.attention_mask.sum(dim=1, keepdim=True)
 sent_emb = last_hidden_state.sum(dim=1) / sent_len
-```

 ---
 library_name: peft
+datasets:
+- xnli
 ---
 These are LoRA adaption weights for [mT5](https://huggingface.co/google/mt5-xxl) encoder.
 This model is a multilingual extension of Sentence T5 and was created using the [mT5](https://huggingface.co/google/mt5-xxl) encoder. It is proposed in this [paper](hoge).
 It is an encoder for sentence embedding, and its performance has been verified in cross-lingual STS and sentence retrieval.
+### Traning Data
+The model was trained on the XNLI dataset.
 ### Framework versions
 last_hidden_state[inputs.attention_mask == 0, :] = 0
 sent_len = inputs.attention_mask.sum(dim=1, keepdim=True)
 sent_emb = last_hidden_state.sum(dim=1) / sent_len
+```