Spaces:

sairaj2
/

DataCleanser

Sleeping

sairaj2 commited on 18 days ago

Commit

9696c2d

1 Parent(s): 1482893

Fix README.md frontmatter for HF Spaces

Files changed (1) hide show

README.md CHANGED Viewed

@@ -4,7 +4,6 @@ emoji: 🧼
 colorFrom: blue
 colorTo: green
 sdk: docker
-app_port: 7860
 pinned: false
 tags:
   - fastapi
@@ -12,6 +11,7 @@ tags:
   - openenv
   - data-cleaning
   - data-validation
 ## What this is
@@ -25,11 +25,11 @@ It is suitable for **Hugging Face Spaces (Docker)**. Inference Endpoints are not
 ## Web UI (optional)
-Open `\/web` for a lightweight dashboard to reset/step and view the table preview.
 ## Real-world task
-Simulates a common data engineering workflow: **cleaning a dirty table** so downstream analytics/ML won’t break.
 Agents must iteratively apply safe transformations (imputation, deduplication, normalization, format standardization, range/outlier handling) and then **submit**.
 ## Tasks (3 levels, deterministic grading)
@@ -121,7 +121,7 @@ The baseline script is `inference.py` (repo root). It uses an **OpenAI-compatibl
 Required environment variables (per submission rules):
 - `API_BASE_URL`: OpenAI-compatible endpoint base URL (optional if using OpenAI default)
-- `MODEL_NAME`: model id (e.g. `gpt-4.1-mini`, or your provider’s model name)
 - `OPENAI_API_KEY`: API key (preferred)
 - `HF_TOKEN`: API key fallback (used if `OPENAI_API_KEY` is not set)
@@ -154,5 +154,4 @@ docker exec -it $(docker ps -q --filter ancestor=datacleanser | head -n 1) \
 ## Notes
 - The server generates datasets on startup (see `app.py` startup event).
-- For baseline agent runs (outside Spaces), set `OPENAI_API_KEY` and use `inference.py`.

 colorFrom: blue
 colorTo: green
 sdk: docker
 pinned: false
 tags:
   - fastapi
   - openenv
   - data-cleaning
   - data-validation
+---
 ## What this is
 ## Web UI (optional)
+Open `/web` for a lightweight dashboard to reset/step and view the table preview.
 ## Real-world task
+Simulates a common data engineering workflow: **cleaning a dirty table** so downstream analytics/ML won't break.
 Agents must iteratively apply safe transformations (imputation, deduplication, normalization, format standardization, range/outlier handling) and then **submit**.
 ## Tasks (3 levels, deterministic grading)
 Required environment variables (per submission rules):
 - `API_BASE_URL`: OpenAI-compatible endpoint base URL (optional if using OpenAI default)
+- `MODEL_NAME`: model id (e.g. `gpt-4.1-mini`, or your provider's model name)
 - `OPENAI_API_KEY`: API key (preferred)
 - `HF_TOKEN`: API key fallback (used if `OPENAI_API_KEY` is not set)
 ## Notes
 - The server generates datasets on startup (see `app.py` startup event).
+- For baseline agent runs (outside Spaces), set `OPENAI_API_KEY` and use `inference.py`.