pkshatech
/

m-ST5

yano0 commited on Mar 28, 2024

Commit

eb0e691

verified ·

1 Parent(s): 9d12ba8

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ pipeline_tag: sentence-similarity
 These are LoRA adaption weights for [mT5](https://huggingface.co/google/mt5-xxl) encoder.
 ## Multilingual Sentence T5
-This model is a multilingual extension of Sentence T5 and was created using the [mT5](https://huggingface.co/google/mt5-xxl) encoder. It is proposed in this [paper](hoge).
 It is an encoder for sentence embedding, and its performance has been verified in cross-lingual STS and sentence retrieval.
 ### Traning Data
@@ -51,4 +51,12 @@ last_hidden_state = outputs.last_hidden_state
 last_hidden_state[inputs.attention_mask == 0, :] = 0
 sent_len = inputs.attention_mask.sum(dim=1, keepdim=True)
 sent_emb = last_hidden_state.sum(dim=1) / sent_len
-```

 These are LoRA adaption weights for [mT5](https://huggingface.co/google/mt5-xxl) encoder.
 ## Multilingual Sentence T5
+This model is a multilingual extension of Sentence T5 and was created using the [mT5](https://huggingface.co/google/mt5-xxl) encoder. It is proposed in this [paper](https://arxiv.org/abs/2403.17528).
 It is an encoder for sentence embedding, and its performance has been verified in cross-lingual STS and sentence retrieval.
 ### Traning Data
 last_hidden_state[inputs.attention_mask == 0, :] = 0
 sent_len = inputs.attention_mask.sum(dim=1, keepdim=True)
 sent_emb = last_hidden_state.sum(dim=1) / sent_len
+```
+## BenchMarks
+Please check the paper for details.
+|       | Tatoeba-14 | Tatoeba-36 | BUCC | XSTS(ar-ar)|XSTS(ar-en)|XSTS(es-es)|XSTS(es-en)|XSTS(tr-en)|
+| ----- | :----------: | :----------: | :----: | :---:|:----:|:----:|:----:|:----:|
+| m-ST5 | 96.3       | 94.7       | 97.6 | 76.2|78.6|84.4|76.2|75.1|
+| LaBSE | 95.3       | 95.0       | 93.5 | 69.1|74.5|80.8|65.5|72.0|