Spaces:
Running
Running
| name: sre-engineer-llm | |
| version: 2.0.0 | |
| description: > | |
| Honest narrow OpenEnv benchmark for incident diagnosis and safe remediation. | |
| Agents query evidence, choose bounded remediation actions, run explicit checks, | |
| and declare resolution only after objective recovery succeeds. | |
| author: Daksh Verma | |
| license: MIT | |
| environment: | |
| action_type: UnifiedIncidentAction | |
| observation_type: UnifiedIncidentObservation | |
| state_type: UnifiedIncidentState | |
| max_steps: 12 | |
| difficulties: [easy, medium, hard] | |
| reward_type: dense | |
| huggingface: | |
| space_id: dakshdoesdev/sre-gym | |
| sdk: docker | |
| hardware: cpu-basic | |