virustechhacks's picture
Upload folder using huggingface_hub
0c470ae verified
# Debug Scripts
This folder contains diagnostic and testing scripts used during development and debugging.
## Scripts
### `debug_score.py`
Focused debugging script to trace the final_score transmission issue.
- Tests metadata propagation through the environment
- Verifies observation serialization
- Checks final_score in both metadata and direct field
- **Usage**: `uv run python debug_scripts/debug_score.py`
### `quick_test.py`
Quick verification script for testing Docker builds and final_score fixes.
- Starts a fresh container from the latest image
- Runs a simple episode to completion
- Verifies final_score is properly transmitted
- **Usage**: `uv run python debug_scripts/quick_test.py`
## Purpose
These scripts were created to:
1. Diagnose the bug where `final_score` was always 0.00
2. Trace the root cause (OpenEnv's serialization excluding metadata)
3. Verify the fix (adding `final_score` as a direct field on ProjectObservation)
4. Ensure the Docker build and deployment work correctly
## Fix Summary
The final_score bug was fixed by:
- Adding `final_score: Optional[float]` to `ProjectObservation` in `models.py`
- Populating it in `server/hustlers_env_environment.py`
- Reading it in `client.py` and `inference.py`
This bypasses OpenEnv's metadata exclusion in serialization.