VortexedSquirrel commited on
Commit
c59c739
·
verified ·
1 Parent(s): c694b6f

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +28 -6
README.md CHANGED
@@ -1,10 +1,32 @@
1
  ---
2
- title: Tetris Env
3
- emoji:
4
- colorFrom: indigo
5
- colorTo: pink
6
  sdk: docker
7
- pinned: false
8
  ---
9
 
10
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Tetris OpenEnv
3
+ emoji: 🎮
4
+ colorFrom: purple
5
+ colorTo: blue
6
  sdk: docker
7
+ app_port: 7860
8
  ---
9
 
10
+ # Tetris OpenEnv
11
+
12
+ A Tetris RL environment for LLM agent training, built on the OpenEnv spec.
13
+
14
+ LLM agents receive a text-based board representation and must choose spatial actions (left, right, rotate, drop) to play Tetris. Features combo scoring where clearing multiple lines simultaneously gives disproportionately higher rewards.
15
+
16
+ ## API
17
+
18
+ - `POST /reset` — Start new episode, returns session_id + initial observation
19
+ - `POST /step/{session_id}` — Take an action, returns observation + reward + done
20
+ - `GET /state/{session_id}` — Get current state without acting
21
+ - `GET /info` — Environment metadata
22
+
23
+ ## Reward Structure
24
+
25
+ | Lines Cleared | Reward | Multiplier |
26
+ |---|---|---|
27
+ | 1 | +100 | x1 |
28
+ | 2 | +300 | x3 |
29
+ | 3 | +700 | x7 |
30
+ | 4 (Tetris!) | +1500 | x15 |
31
+
32
+ Penalties: -1/step, -2*height, -5*holes, -500 game over.