Spaces:

BAIBHAV1234
/

Sepsis-OpenEnv

Sleeping

Sepsis-OpenEnv / results_comparison.md

Upload folder using huggingface_hub

c655b32 verified 23 days ago

884 Bytes

ID3QNE Sepsis OpenEnv Results

Policy	Mean Score	Density	Steps	Safety
Heuristic	0.9867	1.00	9.7	100%
LLM (gpt-4o-mini)	0.9867	1.00	9.7	100%
ID3QNE	0.9867	1.00	9.7	100%

All verified policies achieved dense reward performance with zero safety violations in the local OpenEnv sepsis benchmark.

The OpenAI-backed policy was constrained to the environment action schema and guarded against unsupported outputs.
In this environment, the observed performance ceiling is 0.9867, and both the LLM-controlled run and ID3QNE matched that ceiling.