Spaces:

Satori-reasoning
/

README

Running

chaoscodes commited on Feb 5

Commit

501a263

verified ·

1 Parent(s): fce4773

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -18,15 +18,18 @@ stage leveraging reinforcement learning (RL). Our approach results in Satori, a
 # **Resources**
 Please refer to our blog and research paper for more technical details of Satori.
  - [Blog](https://satori-reasoning.github.io/blog/satori/)
- - [Paper](https://satori-reasoning.github.io/blog/satori/)
 # **Citation**
 If you find our model and data helpful, please cite our paper:
 ```
-@article{TBD,
-  title={Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search},
-  author={Maohao Shen and Guangtao Zeng and Zhenting Qi and Zhang-Wei Hong and Zhenfang Chen and Wei Lu and Gregory Wornell and Subhro Das and David Cox and Chuang Gan},
-  journal={arXiv preprint arXiv: TBD},
-  year={2025}
 }
 ```

 # **Resources**
 Please refer to our blog and research paper for more technical details of Satori.
  - [Blog](https://satori-reasoning.github.io/blog/satori/)
+ - [Paper](https://arxiv.org/pdf/2502.02508)
 # **Citation**
 If you find our model and data helpful, please cite our paper:
 ```
+@misc{shen2025satorireinforcementlearningchainofactionthought,
+      title={Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search},
+      author={Maohao Shen and Guangtao Zeng and Zhenting Qi and Zhang-Wei Hong and Zhenfang Chen and Wei Lu and Gregory Wornell and Subhro Das and David Cox and Chuang Gan},
+      year={2025},
+      eprint={2502.02508},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2502.02508},
 }
 ```