ulab
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
-
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
Paper ⢠2506.09033 ⢠Published ⢠7 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct
3B ⢠Updated ⢠14 ⢠1 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct-Alpha0.9
3B ⢠Updated ⢠5 -
ulab-ai/Router-R1-Llama-3.2-3B-Instruct
4B ⢠Updated ⢠4
Sotopia-RL: Reward Design for Social Intelligence
IRanker: Towards Ranking Foundation Model
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
-
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
Paper ⢠2506.09033 ⢠Published ⢠7 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct
3B ⢠Updated ⢠14 ⢠1 -
ulab-ai/Router-R1-Qwen2.5-3B-Instruct-Alpha0.9
3B ⢠Updated ⢠5 -
ulab-ai/Router-R1-Llama-3.2-3B-Instruct
4B ⢠Updated ⢠4
Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation).