modularStarEncoder
/

ModularStarEncoder

Feature Extraction

ModularStarEncoder

Model card Files Files and versions Community

andreagurioli1995 commited on Mar 6

Commit

2fe2b59

·

verified ·

1 Parent(s): 1ed2cc0

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -81,4 +81,18 @@ The pre-training and fine-tuning were conducted on 512 NVIDIA Ampere (64GB) GPUs
 |Multi-layer loss          | yes       |
 ## Licence
-The model is licensed under the BigCode OpenRAIL-M v1 license agreement. You can find the full agreement [here](https://huggingface.co/spaces/bigcode/bigcode-model-license-agreement).

 |Multi-layer loss          | yes       |
 ## Licence
+The model is licensed under the BigCode OpenRAIL-M v1 license agreement. You can find the full agreement [here](https://huggingface.co/spaces/bigcode/bigcode-model-license-agreement).
+# Citation
+```
+@article{gurioli2025modeltrainallhierarchical,
+      title={One Model to Train them All: Hierarchical Self-Distillation for Enhanced Early Layer Embeddings},
+      author={Andrea Gurioli and Federico Pennino and João Monteiro and Maurizio Gabbrielli},
+      year={2025},
+      eprint={2503.03008},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2503.03008},
+}
+```