Spaces:
Sleeping
Sleeping
aded
Browse files- DataCleanser +1 -0
- README.md +12 -10
- __pycache__/inference.cpython-314.pyc +0 -0
DataCleanser
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
Subproject commit 3e6e35f31acfb8a4a3df2a0310b429d1134626e2
|
README.md
CHANGED
|
@@ -1,6 +1,8 @@
|
|
| 1 |
---
|
| 2 |
-
|
| 3 |
-
|
|
|
|
|
|
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: green
|
| 6 |
sdk: docker
|
|
@@ -12,7 +14,6 @@ tags:
|
|
| 12 |
- openenv
|
| 13 |
- data-cleaning
|
| 14 |
- data-validation
|
| 15 |
-
---
|
| 16 |
|
| 17 |
## What this is
|
| 18 |
|
|
@@ -39,7 +40,7 @@ Agents must iteratively apply safe transformations (imputation, deduplication, n
|
|
| 39 |
- **medium_001**: missing values + format inconsistencies + invalid ranges (employee table)
|
| 40 |
- **hard_001**: missing values + duplicates + mixed date/currency formats + cross-field constraints + outliers (sales table)
|
| 41 |
|
| 42 |
-
On `submit`, the grader returns a **score in
|
| 43 |
|
| 44 |
## Action space
|
| 45 |
|
|
@@ -144,15 +145,16 @@ docker exec -it $(docker ps -q --filter ancestor=datacleanser | head -n 1) \
|
|
| 144 |
|
| 145 |
1. Create a Space → **SDK: Docker**
|
| 146 |
2. Push these files to the Space repo:
|
| 147 |
-
|
| 148 |
-
|
| 149 |
-
|
| 150 |
-
|
| 151 |
-
|
| 152 |
-
|
| 153 |
3. The Space will build and start automatically on port **7860**.
|
| 154 |
|
| 155 |
## Notes
|
| 156 |
|
| 157 |
- The server generates datasets on startup (see `app.py` startup event).
|
| 158 |
- For baseline agent runs (outside Spaces), set `OPENAI_API_KEY` and use `inference.py`.
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
|
| 3 |
+
## title: OpenEnv Data Cleaning Environment
|
| 4 |
+
emoji: 🧼
|
| 5 |
+
|
| 6 |
colorFrom: blue
|
| 7 |
colorTo: green
|
| 8 |
sdk: docker
|
|
|
|
| 14 |
- openenv
|
| 15 |
- data-cleaning
|
| 16 |
- data-validation
|
|
|
|
| 17 |
|
| 18 |
## What this is
|
| 19 |
|
|
|
|
| 40 |
- **medium_001**: missing values + format inconsistencies + invalid ranges (employee table)
|
| 41 |
- **hard_001**: missing values + duplicates + mixed date/currency formats + cross-field constraints + outliers (sales table)
|
| 42 |
|
| 43 |
+
On `submit`, the grader returns a **score in [0.0, 1.0]** in `info.grade.final_score`.
|
| 44 |
|
| 45 |
## Action space
|
| 46 |
|
|
|
|
| 145 |
|
| 146 |
1. Create a Space → **SDK: Docker**
|
| 147 |
2. Push these files to the Space repo:
|
| 148 |
+
- `Dockerfile`
|
| 149 |
+
- `.dockerignore`
|
| 150 |
+
- `requirements.txt`
|
| 151 |
+
- `app.py`
|
| 152 |
+
- `env/`, `agent/`, `data/` (optional; datasets are generated on startup)
|
| 153 |
+
- `README.md` (this file)
|
| 154 |
3. The Space will build and start automatically on port **7860**.
|
| 155 |
|
| 156 |
## Notes
|
| 157 |
|
| 158 |
- The server generates datasets on startup (see `app.py` startup event).
|
| 159 |
- For baseline agent runs (outside Spaces), set `OPENAI_API_KEY` and use `inference.py`.
|
| 160 |
+
|
__pycache__/inference.cpython-314.pyc
ADDED
|
Binary file (22.7 kB). View file
|
|
|