Shuaijie She's picture

Shuaijie She

kevinpro

·

https://ricardokevins.github.io/

AI & ML interests

Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization

Recent Activity

updated a Space about 6 hours ago

kevinpro/R-PRM-Demo

new activity about 9 hours ago

nvidia/OpenReasoning-Nemotron-7B:Has the model undergone the RL process or just SFT on R1-0558 reasoning trajectory?

new activity 2 days ago

kevinpro/R-PRM-Demo:Apply for community grant: Academic project (gpu)

View all activity

Organizations

Collections 2

Papers 1

arxiv:2401.06838

spaces 6

Running on Zero

DEMO

Verify math steps with explanations

Uni API

Geminitest

Enter token to log in

Pro2a

Testgeminispace

验证令牌并登录

Open Multilingual Reasoning Leaderboard

Display and search a leaderboard of math models

models 16

kevinpro/Seed-X-PPO-7B-Q8_0-GGUF

8B • Updated 4 days ago • 12

kevinpro/R-PRM-7B-DPO

Text Generation • 8B • Updated Mar 28 • 6

kevinpro/Hydra-LLaMA3-8B-0531-preview-Q4_K_M-GGUF

Text Generation • 8B • Updated May 31, 2024 • 1

kevinpro/MistralMathOctopus-7B

Text Generation • 7B • Updated Mar 26, 2024 • 1

kevinpro/MetaMathOctopus-MAPO-DPO-13B

Text Generation • 13B • Updated Mar 26, 2024 • 1

kevinpro/MathOctopus-MAPO-DPO-7B

Text Generation • 7B • Updated Mar 26, 2024 • 2

kevinpro/MetaMathOctopus-13B

Text Generation • 13B • Updated Mar 26, 2024 • 2

kevinpro/MetaMathOctopus-MAPO-DPO-7B

Text Generation • 7B • Updated Mar 26, 2024 • 2

kevinpro/MetaMathOctopus-7B

Text Generation • 7B • Updated Mar 26, 2024 • 2

kevinpro/MathOctopus-MAPO-DPO-13B

Text Generation • 13B • Updated Mar 26, 2024 • 6

datasets 5

kevinpro/R-PRM

Viewer • Updated Mar 28 • 594k • 185

kevinpro/WildChat-1M-GPT4-1Turn

Viewer • Updated May 6, 2024 • 121k • 15

kevinpro/WildChat-1M-GPT4-strict

Updated May 6, 2024 • 4

kevinpro/WildChat-1M-GPT4

Viewer • Updated May 6, 2024 • 221k • 31

kevinpro/MNumGLUESub

Updated Mar 11, 2024 • 7