Spaces:
Build error
Build error
Kaushik Rajan
commited on
Commit
·
0417f51
1
Parent(s):
eb8fa29
Language edits to app.py
Browse files
app.py
CHANGED
|
@@ -3,9 +3,6 @@ SPIRAL: Strategic Business Competition Simulator
|
|
| 3 |
|
| 4 |
This demo has been updated to more intuitively demonstrate the key concepts from the
|
| 5 |
"Self-Play in Zero-Sum Games Incentivizes Reasoning" (SPIRAL) research paper.
|
| 6 |
-
|
| 7 |
-
Instead of Tic-Tac-Toe, this simulation uses a zero-sum business competition to showcase
|
| 8 |
-
complex, multi-turn strategic reasoning in a more practical and relatable context.
|
| 9 |
"""
|
| 10 |
|
| 11 |
import gradio as gr
|
|
@@ -291,9 +288,9 @@ def create_interface():
|
|
| 291 |
This demo is inspired by the SPIRAL framework from the research paper: [SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning](https://arxiv.org/abs/2506.24119).
|
| 292 |
|
| 293 |
### Key Links to SPIRAL Paper Takeaways
|
| 294 |
-
- **Transferable Reasoning:** Your R&D investments build long-term planning skills, transferable to real-world logic problems
|
| 295 |
-
- **Diverse Skills:** Marketing encourages probabilistic thinking (like Poker), while Sales focuses on resource foresight
|
| 296 |
-
- **Synergy from Multi-Game Training:** Combining these creates a well-rounded strategy, better than focusing on one area
|
| 297 |
|
| 298 |
### How to Use the App
|
| 299 |
|
|
|
|
| 3 |
|
| 4 |
This demo has been updated to more intuitively demonstrate the key concepts from the
|
| 5 |
"Self-Play in Zero-Sum Games Incentivizes Reasoning" (SPIRAL) research paper.
|
|
|
|
|
|
|
|
|
|
| 6 |
"""
|
| 7 |
|
| 8 |
import gradio as gr
|
|
|
|
| 288 |
This demo is inspired by the SPIRAL framework from the research paper: [SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning](https://arxiv.org/abs/2506.24119).
|
| 289 |
|
| 290 |
### Key Links to SPIRAL Paper Takeaways
|
| 291 |
+
- **Transferable Reasoning:** Your R&D investments build long-term planning skills, transferable to real-world logic problems.
|
| 292 |
+
- **Diverse Skills:** Marketing encourages probabilistic thinking (like Poker), while Sales focuses on resource foresight.
|
| 293 |
+
- **Synergy from Multi-Game Training:** Combining these creates a well-rounded strategy, better than focusing on one area.
|
| 294 |
|
| 295 |
### How to Use the App
|
| 296 |
|