facebook/dinov2-with-registers-giant-imagenet1k-1-layer Image Classification • Updated 2 days ago • 3 • 1
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated about 15 hours ago • 1
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated about 15 hours ago • 1
Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published 1 day ago • 18
Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators? Paper • 2307.14023 • Published Jul 26, 2023 • 1
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated about 15 hours ago • 1
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated about 15 hours ago • 1
A Touch, Vision, and Language Dataset for Multimodal Alignment Paper • 2402.13232 • Published Feb 20 • 14
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated about 15 hours ago • 1
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features? Paper • 2402.00340 • Published Feb 1 • 1
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated about 15 hours ago • 1
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 92 items • Updated about 15 hours ago • 1
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper • 2404.05719 • Published Apr 8 • 82