Spaces:
Build error
Build error
Kaushik Rajan
commited on
Commit
·
0417f51
1
Parent(s):
eb8fa29
Language edits to app.py
Browse files
app.py
CHANGED
@@ -3,9 +3,6 @@ SPIRAL: Strategic Business Competition Simulator
|
|
3 |
|
4 |
This demo has been updated to more intuitively demonstrate the key concepts from the
|
5 |
"Self-Play in Zero-Sum Games Incentivizes Reasoning" (SPIRAL) research paper.
|
6 |
-
|
7 |
-
Instead of Tic-Tac-Toe, this simulation uses a zero-sum business competition to showcase
|
8 |
-
complex, multi-turn strategic reasoning in a more practical and relatable context.
|
9 |
"""
|
10 |
|
11 |
import gradio as gr
|
@@ -291,9 +288,9 @@ def create_interface():
|
|
291 |
This demo is inspired by the SPIRAL framework from the research paper: [SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning](https://arxiv.org/abs/2506.24119).
|
292 |
|
293 |
### Key Links to SPIRAL Paper Takeaways
|
294 |
-
- **Transferable Reasoning:** Your R&D investments build long-term planning skills, transferable to real-world logic problems
|
295 |
-
- **Diverse Skills:** Marketing encourages probabilistic thinking (like Poker), while Sales focuses on resource foresight
|
296 |
-
- **Synergy from Multi-Game Training:** Combining these creates a well-rounded strategy, better than focusing on one area
|
297 |
|
298 |
### How to Use the App
|
299 |
|
|
|
3 |
|
4 |
This demo has been updated to more intuitively demonstrate the key concepts from the
|
5 |
"Self-Play in Zero-Sum Games Incentivizes Reasoning" (SPIRAL) research paper.
|
|
|
|
|
|
|
6 |
"""
|
7 |
|
8 |
import gradio as gr
|
|
|
288 |
This demo is inspired by the SPIRAL framework from the research paper: [SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning](https://arxiv.org/abs/2506.24119).
|
289 |
|
290 |
### Key Links to SPIRAL Paper Takeaways
|
291 |
+
- **Transferable Reasoning:** Your R&D investments build long-term planning skills, transferable to real-world logic problems.
|
292 |
+
- **Diverse Skills:** Marketing encourages probabilistic thinking (like Poker), while Sales focuses on resource foresight.
|
293 |
+
- **Synergy from Multi-Game Training:** Combining these creates a well-rounded strategy, better than focusing on one area.
|
294 |
|
295 |
### How to Use the App
|
296 |
|