Model Description

This repo contains ONNX exports for the multilingual CLIP model M-CLIP/XLM-Roberta-Large-Vit-B-32. It separates the visual and textual encoders into separate models for the purpose of generating image and text embeddings.

This repo is specifically intended for use with Immich, a self-hosted photo library.

Downloads last month: 202

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Collection including immich-app/XLM-Roberta-Large-Vit-B-32

Multilingual CLIP

Collection

Models that can be used for multilingual Smart Search in Immich. Sorted by average recall@k in descending order. The best model will vary by language. • 4 items • Updated Sep 5, 2024 • 5