Model Description

This repo contains ONNX exports for the corresponding ViT-based multilingual CLIP model by OpenCLIP. See the OpenCLIP repo for more info. Visual and textual encoders are separated into separate models for the purpose of generating image and text embeddings.

This repo is specifically intended for use with Immich, a self-hosted photo library.

Downloads last month: 55

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.