BioMamba: A Pre-trained Biomedical Language Representation Model Leveraging Mamba Paper • 2408.02600 • Published Aug 5, 2024 • 11
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Paper • 2503.05592 • Published 17 days ago • 25
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 32