Add knowledge to your txtai agents and processes.
NeuML
company
AI & ML interests
Applying machine learning to solve everyday problems
Recent Activity
Datasets with medical and scientific literature.
Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants.
-
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 816 • 8 -
NeuML/pubmedbert-base-embeddings-2M
Sentence Similarity • Updated • 87 • 3 -
NeuML/pubmedbert-base-embeddings-1M
Sentence Similarity • Updated • 35 • 2 -
NeuML/pubmedbert-base-embeddings-500K
Sentence Similarity • Updated • 25 • 2
StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText
Embeddings indexes and datasets for Wikipedia data.
Models for working with medical and scientific literature.
-
NeuML/pubmedbert-base-embeddings
Sentence Similarity • 0.1B • Updated • 155k • • 142 -
NeuML/pubmedbert-base-embeddings-matryoshka
Sentence Similarity • 0.1B • Updated • 6.71k • 22 -
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 816 • 8 -
NeuML/pubmedbert-base-splade
Sentence Similarity • 0.1B • Updated • 619 • 4
Text to Speech (TTS) models compatible with txtai's TextToSpeech pipeline.
Legacy word vectors (FastText, GloVe, Word2Vec) stored in the StaticVectors format
Add knowledge to your txtai agents and processes.
Embeddings indexes and datasets for Wikipedia data.
Datasets with medical and scientific literature.
Models for working with medical and scientific literature.
-
NeuML/pubmedbert-base-embeddings
Sentence Similarity • 0.1B • Updated • 155k • • 142 -
NeuML/pubmedbert-base-embeddings-matryoshka
Sentence Similarity • 0.1B • Updated • 6.71k • 22 -
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 816 • 8 -
NeuML/pubmedbert-base-splade
Sentence Similarity • 0.1B • Updated • 619 • 4
Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants.
-
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 816 • 8 -
NeuML/pubmedbert-base-embeddings-2M
Sentence Similarity • Updated • 87 • 3 -
NeuML/pubmedbert-base-embeddings-1M
Sentence Similarity • Updated • 35 • 2 -
NeuML/pubmedbert-base-embeddings-500K
Sentence Similarity • Updated • 25 • 2
Text to Speech (TTS) models compatible with txtai's TextToSpeech pipeline.
StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText
Legacy word vectors (FastText, GloVe, Word2Vec) stored in the StaticVectors format