Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Paper • 2412.04432 • Published 20 days ago • 14
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration Paper • 2412.13180 • Published 8 days ago • 12
Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers Paper • 2412.12276 • Published 9 days ago • 15
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Paper • 2411.10606 • Published Nov 15 • 1
MaestroMotif: Skill Design from Artificial Intelligence Feedback Paper • 2412.08542 • Published 14 days ago • 1
CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models Paper • 2412.07393 • Published 15 days ago • 2
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published 20 days ago • 48
Moto: Latent Motion Token as the Bridging Language for Robot Manipulation Paper • 2412.04445 • Published 20 days ago • 21
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published 16 days ago • 62
Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction Paper • 2411.14762 • Published Nov 22 • 11
Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows Paper • 2406.16218 • Published Jun 23 • 2
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published 26 days ago • 55