Kaushik Rajan commited on
Commit
14c2f3e
ยท
1 Parent(s): 52e6594

docs: update README with business game info and git remotes

Browse files
Files changed (1) hide show
  1. README.md +39 -44
README.md CHANGED
@@ -1,42 +1,39 @@
1
  ---
2
- title: Reasoning Simulator
3
- emoji: ๐Ÿ†
4
- colorFrom: gray
5
- colorTo: blue
6
  sdk: gradio
7
- sdk_version: 5.36.2
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
- short_description: An interactive reasoning game simulator
12
  ---
13
 
14
- # SPIRAL: Self-Play Reasoning Demo
15
 
16
- **Demonstrating how strategic reasoning emerges from self-play in zero-sum games**
17
 
18
- Based on: *"Self-Play in Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning"*
19
 
20
- ## ๐ŸŽฎ Interactive Demo
21
 
22
- This simplified demo showcases the key concepts from the SPIRAL research through an interactive TicTacToe game. Watch as the AI demonstrates strategic reasoning using minimax tree search and explains its decision-making process.
23
 
24
- ## ๐Ÿง  Key Concepts Demonstrated
 
 
 
25
 
26
- ### Strategic Reasoning
27
- - AI uses minimax tree search to evaluate all possible future moves
28
- - Demonstrates how optimal strategies emerge from competitive gameplay
29
- - Shows explicit reasoning explanations for each move
30
 
31
- ### Self-Play Learning Principles
32
- - Zero-sum games create competitive pressure that incentivizes strategic thinking
33
- - Multi-agent interactions naturally develop intelligent behavior
34
- - Strategic patterns emerge from repeated competitive gameplay
35
 
36
- ### Tree Search & Planning
37
- - Minimax algorithm demonstrates formalized strategic reasoning
38
- - Look-ahead planning to evaluate future game states
39
- - Optimal decision-making under competitive constraints
40
 
41
  ## ๐Ÿš€ Running the Demo
42
 
@@ -59,34 +56,32 @@ The demo is deployed and ready to use at:
59
 
60
  ## ๐Ÿ“ How It Works
61
 
62
- 1. **Human Move**: Click any square to make your move as X
63
- 2. **AI Analysis**: The AI analyzes the game tree using minimax search
64
- 3. **Strategic Reasoning**: Watch the AI explain its decision-making process
65
- 4. **Optimal Play**: The AI chooses the move that maximizes its winning probability
 
66
 
67
  ## ๐Ÿ”ฌ Research Connection
68
 
69
- This demo illustrates core findings from the SPIRAL methodology:
 
 
 
70
 
71
- - **Zero-sum competitive environments** naturally incentivize strategic reasoning
72
- - **Multi-turn planning** emerges from the need to anticipate opponent moves
73
- - **Strategic reasoning capabilities** developed through self-play can transfer to general reasoning tasks
74
- - **Tree search algorithms** formalize the strategic reasoning process
75
 
76
- ## ๐ŸŽฏ Educational Value
77
 
78
- Perfect for:
79
- - Understanding strategic AI decision-making
80
- - Learning about game theory and minimax algorithms
81
- - Exploring the connection between competition and intelligence
82
- - Visualizing how reasoning emerges from strategic gameplay
83
 
84
- ## ๐Ÿ“Š Technical Details
85
 
86
- - **Game Environment**: Clean TicTacToe implementation with proper state management
87
- - **AI Strategy**: Minimax algorithm with optimal move selection
88
- - **Reasoning Display**: Generated explanations of AI strategic thinking
89
- - **Interactive Interface**: Real-time game state updates and move explanations
90
 
91
  ---
92
 
 
1
  ---
2
+ title: "SPIRAL: Strategic Business Competition"
3
+ emoji: ๐Ÿ“ˆ
4
+ colorFrom: blue
5
+ colorTo: green
6
  sdk: gradio
7
+ sdk_version: 4.29.0
8
  app_file: app.py
9
  pinned: false
10
  license: apache-2.0
11
+ short_description: An interactive simulator for strategic business competition.
12
  ---
13
 
14
+ # SPIRAL: Strategic Business Competition Simulator
15
 
16
+ **An interactive demo inspired by the paper: *"Self-Play in Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning"***
17
 
18
+ This demo has been updated to more intuitively demonstrate the core concepts of the SPIRAL research. Instead of a simple board game, it uses a **strategic business competition** to showcase how competitive pressures in a zero-sum environment can lead to complex, multi-turn reasoning and planning.
19
 
20
+ ## ๐ŸŽฎ The Game: A Zero-Sum Business Battle
21
 
22
+ You and an AI competitor are in charge of rival companies. Over 12 quarters (turns), you must make critical budget allocation decisions to win market share. The company with the highest market share at the end of the game wins.
23
 
24
+ Your goal is to strategically allocate your quarterly budget across three key areas:
25
+ - **Research & Development (R&D):** Increases your product quality, providing a long-term competitive advantage.
26
+ - **Marketing:** Directly captures market share from your opponent in the short term.
27
+ - **Sales:** Generates revenue for your next quarter's budget, fueling future growth.
28
 
29
+ ## ๐Ÿง  Key Concepts Demonstrated
 
 
 
30
 
31
+ This simulator illustrates how principles from the SPIRAL framework emerge in a dynamic system:
 
 
 
32
 
33
+ - **Strategic Reasoning:** The AI analyzes your moves and market conditions to make counter-moves, balancing short-term gains (Marketing) with long-term investments (R&D).
34
+ - **Multi-Turn Planning:** A decision to over-invest in marketing for a quick win might leave you with a poor product and low budget in later quarters. You must plan ahead.
35
+ - **Emergent Strategies:** There is no single "best" move. The optimal strategy depends on your opponent's actions, forcing you to adapt and reason about their potential choices.
36
+ - **Resource Management:** In this zero-sum game, every percentage of market share you gain, the AI loses. Efficiently managing your budget is critical to victory.
37
 
38
  ## ๐Ÿš€ Running the Demo
39
 
 
56
 
57
  ## ๐Ÿ“ How It Works
58
 
59
+ 1. **Allocate Your Budget:** Use the sliders to decide how to allocate your budget for the quarter.
60
+ 2. **Submit Your Move:** Once you finalize your allocation, submit it.
61
+ 3. **AI Analysis & Counter-Move:** The AI evaluates the game state and your strategy, then makes its own budget allocation. The AI's reasoning is printed for you to see.
62
+ 4. **Quarterly Results:** The simulation advances one quarter. Market share shifts, product quality improves, and new budgets are calculated based on both of your decisions.
63
+ 5. **Review and Adapt:** Analyze the results on the dashboard and adapt your strategy for the next quarter.
64
 
65
  ## ๐Ÿ”ฌ Research Connection
66
 
67
+ This demo connects directly to the core findings of the SPIRAL methodology:
68
+ - **Zero-Sum Environments Drive Strategy:** The business competition is a zero-sum game for market share, creating the competitive pressure needed for strategic reasoning to emerge.
69
+ - **Anticipatory Planning:** Success requires you to anticipate how your investments will pay off over several turns and how your opponent will react.
70
+ - **Transferable Reasoning:** The skills developed in this complex gameโ€”balancing priorities, managing resources, and predicting opponent behaviorโ€”are forms of general strategic reasoning.
71
 
72
+ ## _Dual Git Remotes_
 
 
 
73
 
74
+ This repository is configured with two remotes:
75
 
76
+ - **`origin`**: Pushes to the Hugging Face Space for deployment.
77
+ - **`github`**: Pushes to the public GitHub repository for development and version control.
 
 
 
78
 
79
+ To push changes to both platforms, you can use the following commands:
80
 
81
+ ```bash
82
+ git push origin main
83
+ git push github main
84
+ ```
85
 
86
  ---
87