REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents Paper • 2411.13552 • Published Nov 20, 2024
Region-Adaptive Sampling for Diffusion Transformers Paper • 2502.10389 • Published 13 days ago • 52 • 3
microsoft/LLM2CLIP-Llama3.2-1B-EVA02-L-14-336 Zero-Shot Image Classification • Updated Dec 12, 2024 • 10
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated Jan 8 • 55
microsoft/LLM2CLIP-Llama3.2-1B-EVA02-L-14-336 Zero-Shot Image Classification • Updated Dec 12, 2024 • 10
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated Jan 8 • 55
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated Jan 8 • 55
microsoft/LLM2CLIP-Llama-3-8B-Instruct-CC-Finetuned Zero-Shot Classification • Updated Nov 19, 2024 • 4.16k • 32