InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models Paper • 2312.05849 • Published Dec 10, 2023 • 1
One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective Paper • 2109.14449 • Published Sep 29, 2021 • 1
Unsupervised Hashing with Similarity Distribution Calibration Paper • 2302.07669 • Published Feb 15, 2023 • 1
InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images Paper • 2503.09130 • Published Mar 12
view post Post 9585 Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐I've built a live real time demo on Spaces 📹💨 multimodalart/self-forcing See translation 5 replies · ❤️ 11 11 🔥 6 6 + Reply
Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps Paper • 2406.14539 • Published Jun 20, 2024 • 28
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization Paper • 2405.17873 • Published May 28, 2024 • 3
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Paper • 2406.02540 • Published Jun 4, 2024 • 3
E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Paper • 2412.14170 • Published Dec 18, 2024
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published Feb 3 • 42
PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models Paper • 2312.08290 • Published Dec 13, 2023 • 3
World-consistent Video Diffusion with Explicit 3D Modeling Paper • 2412.01821 • Published Dec 2, 2024 • 4
Pathways on the Image Manifold: Image Editing via Video Generation Paper • 2411.16819 • Published Nov 25, 2024 • 38
Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis Paper • 2404.19622 • Published Apr 30, 2024 • 2
MM-Conv: A Multi-modal Conversational Dataset for Virtual Humans Paper • 2410.00253 • Published Sep 30, 2024
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective Paper • 2310.11451 • Published Oct 17, 2023