Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 13 days ago • 59
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 3 days ago • 47
GeoPixel Collection Pixel Grounding Large Multimodal Model in Remote Sensing • 3 items • Updated 4 days ago • 1
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 9 items • Updated 2 days ago • 6
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 6 days ago • 26
The Ultimate Collection of Code Classifiers Collection 🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 3 days ago • 10
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper • 2502.12115 • Published 6 days ago • 41
view article Article Aurora-M: The First Open Source Biden-Harris Executive Order Red teamed Multilingual Language Model By mayank-mishra • Apr 2, 2024 • 7
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 20 days ago • 106
view article Article CinePile 2.0 - making stronger datasets with adversarial refinement Oct 23, 2024 • 14
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk • Oct 7, 2024 • 17
Ultravox v0.5 Collection Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone. • 3 items • Updated 13 days ago • 5