Zero-Shot Image Classification
Transformers
Safetensors
clip
Inference Endpoints

CLIP ViT-L/14 finetune: SAE-informed adversarial training

image/png

  • Interesting things with adversarial robustness to try: Right-click and download individual images: Image 1 -- Image 2 -- Image 3 image/png
  • Upload each into zero-shot [hopefully available soon on the right here->]
  • Try labels (class names): a photo of a cat, a photo of a dog, a photo of a text
  • Repeat the same with e.g. my GmP models models and see what happens. =)
  • I'm really hoping the HF format .safetensors conversion didn't mess anything up (it happens!); just in case it did, or if there's no inference API available to use:
  • I put a script that will do the same thing (on the not-converted model) on my GitHub repo. Plus, you can just reproduce the fine-tune yourself, as that code is also available! 🤗
  • 👉 All training info & code: github.com/zer0int/CLIP-SAE-finetune
  • Buy me a coffee

image/png

Downloads last month
362
Safetensors
Model size
428M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for zer0int/CLIP-SAE-ViT-L-14

Finetuned
(53)
this model
Finetunes
1 model

Datasets used to train zer0int/CLIP-SAE-ViT-L-14