kevinkrahn
/

shlm-grc-en

Sentence Similarity

sentence-transformers

Ancient Greek (to 1453)

feature-extraction

semantic-search

character-transformer

hierarchical-transformer

Inference Endpoints

Model card Files Files and versions Community

kevinkrahn commited on 12 days ago

Commit

83c5c5a

•

1 Parent(s): c11b0cc

Update README.md

Files changed (1) hide show

README.md +9 -3

README.md CHANGED Viewed

@@ -7,7 +7,11 @@ tags:
 - sentence-similarity
 - transformers
 - semantic-search
 ---
 # shlm-grc-en
@@ -16,9 +20,9 @@ tags:
 This model creates sentence embeddings in a shared vector space for Ancient Greek and English text.
-The base model uses a modified version of the HLM architecture described in [Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers](https://aclanthology.org/2024.sigtyp-1.16/)
-This model is trained to produce sentence embeddings using the multilingual knowledge distillation method and datasets described in [Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation](https://aclanthology.org/2023.alp-1.2/).
 This model was distilled from `BAAI/bge-base-en-v1.5` for embedding English and Ancient Greek text.
@@ -78,6 +82,8 @@ print(sentence_embeddings)
 ## Citing & Authors
 ```
 @inproceedings{riemenschneider-krahn-2024-heidelberg,
     title = "Heidelberg-Boston @ {SIGTYP} 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers",

 - sentence-similarity
 - transformers
 - semantic-search
+- character-transformer
+- hierarchical-transformer
+language:
+- en
+- grc
 ---
 # shlm-grc-en
 This model creates sentence embeddings in a shared vector space for Ancient Greek and English text.
+The base model uses a modified version of the HLM architecture described in [Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers](https://aclanthology.org/2024.sigtyp-1.16/) ([arXiv](https://arxiv.org/abs/2405.20145))
+This model is trained to produce sentence embeddings using the multilingual knowledge distillation method and datasets described in [Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation](https://aclanthology.org/2023.alp-1.2/) ([arXiv](https://arxiv.org/abs/2308.13116)).
 This model was distilled from `BAAI/bge-base-en-v1.5` for embedding English and Ancient Greek text.
 ## Citing & Authors
+If you use this model please cite the following papers:
 ```
 @inproceedings{riemenschneider-krahn-2024-heidelberg,
     title = "Heidelberg-Boston @ {SIGTYP} 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers",