Monet: Mixture of Monosemantic Experts for Transformers Paper • 2412.04139 • Published 21 days ago • 10
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models Paper • 2312.00845 • Published Dec 1, 2023 • 36
Bytes Are All You Need: Transformers Operating Directly On File Bytes Paper • 2306.00238 • Published May 31, 2023 • 6