Spaces:
Sleeping
Sleeping
Clinical Trial Phase-Aware Workflow & Scoring
10-Phase Clinical Workflow
The agent learns this ordering through reward signal, not hard-coding.
| # | Phase | Description | Actions |
|---|---|---|---|
| 0 | literature_review |
Understand disease and constraints | Review scenario |
| 1 | hypothesis |
Form hypothesis about drug mechanism | Estimate expected effect |
| 2 | phase_i_design |
Phase I safety/dose-finding | run_dose_escalation, observe_safety_signal |
| 3 | phase_i_analysis |
Analyze Phase I results | estimate_effect_size |
| 4 | phase_ii_design |
Design Phase II efficacy trial | set_primary_endpoint, set_sample_size, set_inclusion_criteria, etc. |
| 5 | regulatory |
FDA review | submit_to_fda_review, request_protocol_amendment |
| 6 | enrollment |
Enroll patients | (implicit after FDA approval) |
| 7 | monitoring |
Interim analysis, adaptation | run_interim_analysis, modify_sample_size, add_biomarker_stratification |
| 8 | analysis |
Final statistical test | run_primary_analysis |
| 9 | conclusion |
Synthesize results | synthesize_conclusion |
Phase-Order Scoring
| Condition | Reward |
|---|---|
| Action in correct or next phase | +0.2 |
| Action stays in current phase | +0.2 |
| Action skips N phases ahead | −0.3 × N |
Judge persona scaling by tier:
| Tier | Persona | Forward Bonus | Skip Penalty | Extras |
|---|---|---|---|---|
| Warmup | Junior | +0.20 | −0.30/skip | Allows 1 skip free, gives hints |
| Beginner | Junior→Senior | +0.20 | −0.30/skip | Standard |
| Intermediate | Senior | +0.15 | −0.30/skip | Expects correct ordering |
| Advanced | Senior→Principal | +0.10 | −0.50/skip | Redundancy penalty −0.10 |
| Expert | Principal | +0.05 | −0.50/skip | Redundancy −0.15, efficiency penalty |
Hard Prerequisites
These block the action entirely (not a reward signal — returns error):
| Action | Requires |
|---|---|
estimate_effect_size |
≥1 run_dose_escalation |
set_sample_size |
estimate_effect_size |
submit_to_fda_review |
set_primary_endpoint + set_sample_size |
run_interim_analysis |
submit_to_fda_review passed |
run_primary_analysis |
submit_to_fda_review passed |
synthesize_conclusion |
run_primary_analysis |
modify_sample_size |
run_interim_analysis |
add_biomarker_stratification |
estimate_effect_size |
Protocol Amendment
request_protocol_amendmentallows recovery from FDA review failure- Costs time and budget (realistic consequence)
- Successful recovery: +0.3 recovery bonus
- Maximum 2 amendments per episode