Masked Autoencoders Enable Efficient Knowledge Distillers Paper • 2208.12256 • Published Aug 25, 2022
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification Paper • 2210.12843 • Published Oct 23, 2022
CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection Paper • 2301.00785 • Published Jan 2, 2023
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17 • 50
ViTamin Family Collection Designing Scalable Vision Models in the Vision-language Era. The best performing model is 'jienengchen/ViTamin-XL-384px'. • 16 items • Updated Apr 11 • 8
PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter Paper • 2402.10896 • Published Feb 16 • 15
PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter Paper • 2402.10896 • Published Feb 16 • 15
Rejuvenating image-GPT as Strong Visual Representation Learners Paper • 2312.02147 • Published Dec 4, 2023 • 4