Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos Paper • 2507.15597 • Published 11 days ago • 33
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs Paper • 2507.11097 • Published 17 days ago • 62
view article Article Asynchronous Robot Inference: Decoupling Action Prediction and Execution By fracapuano and 7 others • 22 days ago • 38
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 24 days ago • 601
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others • Jun 3 • 213
ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published May 29 • 23
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • May 21 • 199
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 491
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 9 items • Updated Apr 28 • 23
OpenCodeReasoning Collection Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 10 items • Updated 11 days ago • 17
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning Paper • 2406.06469 • Published Jun 10, 2024 • 30
view article Article Train your first Decision Transformer By edbeeching and 1 other • Sep 8, 2022 • 13
Physical AI Collection Collection of commercial-grade datasets for physical AI developers • 22 items • Updated 11 days ago • 66
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control By danaaubakirova and 3 others • Feb 4 • 167