Spaces:
Sleeping
Sleeping
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -1,10 +1,32 @@
|
|
| 1 |
---
|
| 2 |
-
title: Tetris
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: docker
|
| 7 |
-
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: Tetris OpenEnv
|
| 3 |
+
emoji: 🎮
|
| 4 |
+
colorFrom: purple
|
| 5 |
+
colorTo: blue
|
| 6 |
sdk: docker
|
| 7 |
+
app_port: 7860
|
| 8 |
---
|
| 9 |
|
| 10 |
+
# Tetris OpenEnv
|
| 11 |
+
|
| 12 |
+
A Tetris RL environment for LLM agent training, built on the OpenEnv spec.
|
| 13 |
+
|
| 14 |
+
LLM agents receive a text-based board representation and must choose spatial actions (left, right, rotate, drop) to play Tetris. Features combo scoring where clearing multiple lines simultaneously gives disproportionately higher rewards.
|
| 15 |
+
|
| 16 |
+
## API
|
| 17 |
+
|
| 18 |
+
- `POST /reset` — Start new episode, returns session_id + initial observation
|
| 19 |
+
- `POST /step/{session_id}` — Take an action, returns observation + reward + done
|
| 20 |
+
- `GET /state/{session_id}` — Get current state without acting
|
| 21 |
+
- `GET /info` — Environment metadata
|
| 22 |
+
|
| 23 |
+
## Reward Structure
|
| 24 |
+
|
| 25 |
+
| Lines Cleared | Reward | Multiplier |
|
| 26 |
+
|---|---|---|
|
| 27 |
+
| 1 | +100 | x1 |
|
| 28 |
+
| 2 | +300 | x3 |
|
| 29 |
+
| 3 | +700 | x7 |
|
| 30 |
+
| 4 (Tetris!) | +1500 | x15 |
|
| 31 |
+
|
| 32 |
+
Penalties: -1/step, -2*height, -5*holes, -500 game over.
|