BlinkDL
/

clip-guided-binary-autoencoder

Model card Files Files and versions Community

BlinkDL commited on Sep 27, 2022

Commit

784bfc9

·

1 Parent(s): 935cfa8

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -6,4 +6,8 @@ This model can encode 224x224 RGB image into 28x28x13bit (1274 bytes) latent. Th
 12M params for Encoder + Decoder. Trained on LAION-Aesthetics V2 5+ for 60M images.
 (still training. final checkpt will be better)

 12M params for Encoder + Decoder. Trained on LAION-Aesthetics V2 5+ for 60M images.
+Guided by https://huggingface.co/laion/CLIP-ViT-B-32-laion2B-s34B-b79K (it's great. better than OpenAI CLIP B/32) and https://github.com/dingkeyan93/DISTS.
+No GAN loss. So probably the image is slightly blurred in some cases?
 (still training. final checkpt will be better)