arxiv:2411.16489
shijie xia
seven-cat
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
4 days ago
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level
Reward Models
liked
a model
about 2 months ago
Qwen/QwQ-32B-Preview
authored
a paper
about 2 months ago
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple
Distillation, Big Progress or Bitter Lesson?