lightonai/modernbert-embed-large · Add new SentenceTransformer model

Add new SentenceTransformer model8b4c685c

Jan 13

Hello!

This pull request has been automatically generated from the push_to_hub method from the Sentence Transformers library.

Full Model Architecture:

SentenceTransformer(
  (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: ModernBertModel 
  (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Tip:

Consider testing this pull request before merging by loading the model from this PR with the revision argument:

from sentence_transformers import SentenceTransformer

# TODO: Fill in the PR number
pr_number = 2
model = SentenceTransformer(
    "lightonai/modernbert-embed-large",
    revision=f"refs/pr/{pr_number}",
    backend="torch",
)

# Verify that everything works as expected
embeddings = model.encode(["The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium."])
print(embeddings.shape)

similarities = model.similarity(embeddings, embeddings)
print(similarities)

Undo README changesad207b7b

Update README outputs + dim (768 -> 1024)96fbb87a

NohTow

LightOn AI org Jan 14

Thanks for the PR, forgot to add those files (and thanks for updating the snippets as well!)

NohTow changed pull request status to merged Jan 14