Model Description

This repo contains ONNX exports for the multilingual CLIP model M-CLIP/XLM-Roberta-Large-Vit-L-14. It separates the visual and textual encoders into separate models for the purpose of generating image and text embeddings.

This repo is specifically intended for use with Immich, a self-hosted photo library.

Downloads last month: 46

Inference API

Unable to determine this model’s pipeline type. Check the docs .