-
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Paper • 2505.14625 • Published • 13 -
1
TinyV
💬Verify model answers against ground truth
-
zhangchenxu/TinyV-Qwen3-1.7B
Text Generation • 2B • Updated • 7 -
zhangchenxu/TinyV-Qwen3-1.7B-Think
Text Generation • 2B • Updated • 17 • 1
Zhangchen Xu PRO
zhangchenxu
AI & ML interests
LLM Data, Alignment, Post-Training, Safety
Recent Activity
liked
a model
3 days ago
Qwen/Qwen3-235B-A22B-Instruct-2507
liked
a model
7 days ago
mistralai/Devstral-Small-2507
upvoted
an
article
16 days ago
SmolLM3: smol, multilingual, long-context reasoner