Clean up task3 and task8 simulations — only imperfect where reasoning warrants it 35a2bd3 ananya173147 commited on Mar 17
Fix task3 simulation: replace generic lab lookup with clinically relevant HR/conditions check bd8a41d ananya173147 commited on Mar 17
Make simulations realistic with redundancy penalties visible in trace adadb76 ananya173147 commited on Mar 17
Fix patient banner to show real FHIR names instead of hash-generated fakes 4abdc38 ananya173147 commited on Mar 17
Update reward.py for 5 in-scope task types and rebuild fhir_cache 7a4e779 ananya173147 commited on Mar 17
Fix baseline Qwen row: add lb-baseline-tasks id, fix chip label 0971fec verified amantra commited on Mar 13
UI: sidebar scroll fix, FHIR APIs tab, aligned reward breakdown 9ba12fc verified amantra commited on Mar 12
Rename quality score to reward score with penalty range note 5730b86 verified amantra commited on Mar 12
Dashboard: add SOTA leaderboard with per-task success rates 093b12b verified amantra commited on Mar 12
UI: white theme, PCP Clinic branding, auto-agent execution (correct path) f6ba458 verified amantra commited on Mar 12
UI: white theme, PCP Clinic branding, auto-agent execution 79734dc verified amantra commited on Mar 12
Remove ENABLE_WEB_INTERFACE: stops env from being instantiated at startup 91d8b9b ananya173147 commited on Mar 10
Slim Dockerfile: drop build-essential and git, only install curl 5e81f1b ananya173147 commited on Mar 10
Fix Dockerfile for HF Space: use python:3.11-slim, port 7860, install at root c22d8f0 ananya173147 commited on Mar 10