CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction Paper • 2508.03159 • Published 11 days ago • 22
Med-PRM Collection This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards • 6 items • Updated Jul 11 • 1
Outlier-Safe Pre-Training (OSP) Collection A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework. • 11 items • Updated Jun 26 • 4
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models Paper • 2506.19697 • Published Jun 24 • 44