Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published 12 days ago • 28
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 12 days ago • 131
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 13 days ago • 74
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper • 2412.14161 • Published 7 days ago • 43
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 7 days ago • 103
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Paper • 2412.15204 • Published 6 days ago • 31
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published 7 days ago • 51
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 5 days ago • 30
SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation Paper • 2412.13649 • Published 7 days ago • 17
ROICtrl: Boosting Instance Control for Visual Generation Paper • 2411.17949 • Published 29 days ago • 82
GRAPE: Generalizing Robot Policy via Preference Alignment Paper • 2411.19309 • Published 27 days ago • 42
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 21 days ago • 43