Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
@@ -40,15 +40,6 @@ with gr.Blocks(css="""
|
|
40 |
""")
|
41 |
gr.Markdown(
|
42 |
"""
|
43 |
-
[](https://arxiv.org/pdf/2505.05315)
|
44 |
-
[](https://huggingface.co/collections/Salesforce/elastic-reasoning-682b4bba108d6ea0a8bab275)
|
45 |
-
[](https://github.com/SalesforceAIResearch/Elastic-Reasoning)
|
46 |
-
|
47 |
-
## Table of Contents
|
48 |
-
- [Introduction](#introduction)
|
49 |
-
- [Environment Setup](#environment-setup)
|
50 |
-
- [Training](#training)
|
51 |
-
- [Evaluation](#evaluation)
|
52 |
|
53 |
## Introduction
|
54 |
We propose **Elastic Reasoning**, a novel framework for scalable chain of thoughts
|
@@ -75,7 +66,40 @@ training.
|
|
75 |
gr.Image("figs/aime.png", label="Framework", show_label=False, elem_id="small")
|
76 |
gr.Image("figs/livecode.png", label="Framework", show_label=False, elem_id="small")
|
77 |
gr.Image("figs/codetable.png", label="Framework", show_label=False, elem_id="my-img")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
78 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
79 |
gr.Markdown(
|
80 |
"""
|
81 |
## Citation
|
@@ -88,6 +112,16 @@ training.
|
|
88 |
journal={arXiv preprint arXiv:2505.05315},
|
89 |
year={2025}
|
90 |
}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
91 |
```
|
92 |
""")
|
93 |
|
|
|
40 |
""")
|
41 |
gr.Markdown(
|
42 |
"""
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
43 |
|
44 |
## Introduction
|
45 |
We propose **Elastic Reasoning**, a novel framework for scalable chain of thoughts
|
|
|
66 |
gr.Image("figs/aime.png", label="Framework", show_label=False, elem_id="small")
|
67 |
gr.Image("figs/livecode.png", label="Framework", show_label=False, elem_id="small")
|
68 |
gr.Image("figs/codetable.png", label="Framework", show_label=False, elem_id="my-img")
|
69 |
+
gr.HTML("""
|
70 |
+
<div align="center">
|
71 |
+
<div>
|
72 |
+
<h3>Part 1:Fractured Chain-of-Thought (Scalable Chain of Thoughts via Elastic Reasoning) 🌟
|
73 |
+
</div>
|
74 |
+
<br>
|
75 |
+
</div>
|
76 |
+
""")
|
77 |
+
gr.HTML("""
|
78 |
+
<div style="display: flex; justify-content: center; gap: 8px; flex-wrap: wrap;">
|
79 |
+
<a href="https://arxiv.org/pdf/2505.12992">
|
80 |
+
<img src="https://img.shields.io/badge/paper-A42C25?style=for-the-badge&logo=arxiv&logoColor=white" />
|
81 |
+
</a>
|
82 |
+
<a href="https://github.com/BaohaoLiao/frac-cot">
|
83 |
+
<img src="https://img.shields.io/badge/frac-cot-000000?style=for-the-badge&logo=github&logoColor=white" />
|
84 |
+
</a>
|
85 |
+
</div>
|
86 |
+
""")
|
87 |
+
|
88 |
+
gr.Markdown(
|
89 |
+
"""
|
90 |
|
91 |
+
## Introduction
|
92 |
+
We propose **Elastic Reasoning**, a novel framework for scalable chain of thoughts
|
93 |
+
that explicitly separates reasoning into two phases—`thinking and solution`—with
|
94 |
+
independently allocated budgets. At test time, Elastic Reasoning prioritize that
|
95 |
+
completeness of solution segments, significantly improving reliability under tight
|
96 |
+
resource constraints. To train models that are robust to truncated thinking, we
|
97 |
+
introduce a lightweight `budget-constrained rollout` strategy, integrated into GRPO,
|
98 |
+
which teaches the model to reason adaptively when the thinking process is cut
|
99 |
+
short and generalizes effectively to unseen budget constraints without additional
|
100 |
+
training.
|
101 |
+
""")
|
102 |
+
gr.Image("figs/framework.png", label="Framework", show_label=False, elem_id="my-img")
|
103 |
gr.Markdown(
|
104 |
"""
|
105 |
## Citation
|
|
|
112 |
journal={arXiv preprint arXiv:2505.05315},
|
113 |
year={2025}
|
114 |
}
|
115 |
+
|
116 |
+
@misc{liao2025fracturedchainofthoughtreasoning,
|
117 |
+
title={Fractured Chain-of-Thought Reasoning},
|
118 |
+
author={Baohao Liao and Hanze Dong and Yuhui Xu and Doyen Sahoo and Christof Monz and Junnan Li and Caiming Xiong},
|
119 |
+
year={2025},
|
120 |
+
eprint={2505.12992},
|
121 |
+
archivePrefix={arXiv},
|
122 |
+
primaryClass={cs.LG},
|
123 |
+
url={https://arxiv.org/abs/2505.12992},
|
124 |
+
}
|
125 |
```
|
126 |
""")
|
127 |
|