Safetensors
qwen2
hanbin commited on
Commit
3ff3d7e
Β·
verified Β·
1 Parent(s): 3c18b63

add paper link

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -6,6 +6,7 @@ license: apache-2.0
6
 
7
  ## Links
8
 
 
9
  - πŸ“œ [Blog](https://curvy-check-498.notion.site/Process-Reinforcement-through-Implicit-Rewards-15f4fcb9c42180f1b498cc9b2eaf896f)
10
  - πŸ€— [PRIME Collection](https://huggingface.co/PRIME-RL)
11
  - πŸ€— [SFT Data](https://huggingface.co/datasets/PRIME-RL/Eurus-2-SFT-Data)
 
6
 
7
  ## Links
8
 
9
+ - πŸ“œ [Paper](https://arxiv.org/abs/2502.01456)
10
  - πŸ“œ [Blog](https://curvy-check-498.notion.site/Process-Reinforcement-through-Implicit-Rewards-15f4fcb9c42180f1b498cc9b2eaf896f)
11
  - πŸ€— [PRIME Collection](https://huggingface.co/PRIME-RL)
12
  - πŸ€— [SFT Data](https://huggingface.co/datasets/PRIME-RL/Eurus-2-SFT-Data)