Model Description

This repo contains ONNX exports for the multilingual CLIP model M-CLIP/XLM-Roberta-Large-Vit-B-32. It separates the visual and textual encoders into separate models for the purpose of generating image and text embeddings.

This repo is specifically intended for use with Immich, a self-hosted photo library.

Downloads last month
202
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Collection including immich-app/XLM-Roberta-Large-Vit-B-32