Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 1 day ago • 16
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 15 days ago • 101
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 9 days ago • 31
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 11 days ago • 107
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 5 items • Updated 12 days ago • 33
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 20 days ago • 24
view article Article Janus Pro: DeepSeek's Revolutionary Multimodal AI Model By LLMhacker • 22 days ago • 31
Albertina Collection Albertina family of encoders for Portuguese • 9 items • Updated Jul 26, 2024 • 2
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 27 days ago • 62
🍃 MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 58
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated 29 days ago • 23
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 8 days ago • 59