14 5 38

Zhibin Gou

zubingou

https://zubingou.github.io/

zubingou

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

cais/hle

authored a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

zubingou's activity

liked a dataset about 2 months ago

cais/hle

Viewer • Updated 28 days ago • 2.7k • 7.11k • 272

authored a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 18 days ago • 2.52M • • 11.3k

liked a dataset 3 months ago

HuggingFaceTB/finemath

Viewer • Updated Feb 6 • 48.3M • 10.5k • 292

New activity in TIGER-Lab/MMLU-STEM 3 months ago

🚩 Report: Spam

#2 opened 12 months ago by

zubingou

authored a paper 7 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 56

liked a model 8 months ago

AI-MO/NuminaMath-7B-TIR

Text Generation • Updated Aug 14, 2024 • 26.1k • 339

authored a paper 9 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 63

upvoted a paper 9 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 63

liked a dataset 10 months ago

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 842 • 76

New activity in jetmoe/jetmoe-8b 11 months ago

When can we have the training code as illustrated in the paper.

#5 opened 11 months ago by

Shamane

liked a dataset 11 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 307k • 2.03k

New activity in microsoft/rho-math-1b-v0.1 11 months ago

Update README.md

#1 opened 11 months ago by

zubingou

commented 2 papers 11 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90 •

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90 •

liked a model 11 months ago

microsoft/rho-math-7b-interpreter-v0.1

Text Generation • Updated Apr 18, 2024 • 157 • 35

commented 2 papers 11 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90 •

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90 •

liked a model 11 months ago

microsoft/rho-math-7b-v0.1

Text Generation • Updated Apr 18, 2024 • 386 • 19

upvoted a paper 11 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90