shijie xia

seven-cat

https://shijie-xia.github.io/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 4 days ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

liked a model about 2 months ago

Qwen/QwQ-32B-Preview

authored a paper about 2 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

View all activity

Organizations

seven-cat's activity

upvoted a paper 4 days ago

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published 6 days ago • 12

liked a model about 2 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated about 23 hours ago • 134k • 1.53k

authored a paper about 2 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 41

liked a Space 3 months ago

Running

259

🚀

Chat-with-OpenAI-o1-mini

liked a Space 5 months ago

Runtime error

📚

Rebiber

liked a Space 6 months ago

Running on Zero

🏆

Anole

authored 2 papers 7 months ago

OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

Paper • 2406.16772 • Published Jun 24, 2024 • 2

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Paper • 2406.12753 • Published Jun 18, 2024 • 14

liked a dataset 7 months ago

RyokoAI/ShareGPT52K

Preview • Updated Apr 2, 2023 • 131 • 311

upvoted a collection 7 months ago

Long Context

Collection

50 items • Updated 7 days ago • 6

liked a dataset 8 months ago

bigcode/commitpackft

Viewer • Updated Aug 20, 2023 • 702k • 5.55k • 62

liked a dataset 9 months ago

anon8231489123/ShareGPT_Vicuna_unfiltered

Updated Apr 12, 2023 • 11.3k • 761

updated 2 models 9 months ago

GAIR/ReasonEval-34B

Text Classification • Updated Apr 9, 2024 • 5 • 4

GAIR/ReasonEval-7B

Text Classification • Updated Apr 9, 2024 • 60 • 2