Spaces:

Roopalgn
/

openenv-clinical-trial

Sleeping

App Files Files Community

openenv-clinical-trial / docs /phase_workflow.md

Roopalgn

fix: add root route so HF Space doesn't show 404

6b6624d about 1 month ago

preview code

raw

history blame contribute delete

2.7 kB

	# Clinical Trial Phase-Aware Workflow & Scoring

	## 10-Phase Clinical Workflow

	The agent learns this ordering through reward signal, not hard-coding.

	\| # \| Phase \| Description \| Actions \|
	\|---\|-------\|-------------\|---------\|
	\| 0 \| `literature_review` \| Understand disease and constraints \| Review scenario \|
	\| 1 \| `hypothesis` \| Form hypothesis about drug mechanism \| Estimate expected effect \|
	\| 2 \| `phase_i_design` \| Phase I safety/dose-finding \| run_dose_escalation, observe_safety_signal \|
	\| 3 \| `phase_i_analysis` \| Analyze Phase I results \| estimate_effect_size \|
	\| 4 \| `phase_ii_design` \| Design Phase II efficacy trial \| set_primary_endpoint, set_sample_size, set_inclusion_criteria, etc. \|
	\| 5 \| `regulatory` \| FDA review \| submit_to_fda_review, request_protocol_amendment \|
	\| 6 \| `enrollment` \| Enroll patients \| (implicit after FDA approval) \|
	\| 7 \| `monitoring` \| Interim analysis, adaptation \| run_interim_analysis, modify_sample_size, add_biomarker_stratification \|
	\| 8 \| `analysis` \| Final statistical test \| run_primary_analysis \|
	\| 9 \| `conclusion` \| Synthesize results \| synthesize_conclusion \|

	---

	## Phase-Order Scoring

	\| Condition \| Reward \|
	\|-----------\|--------\|
	\| Action in correct or next phase \| +0.2 \|
	\| Action stays in current phase \| +0.2 \|
	\| Action skips N phases ahead \| −0.3 × N \|

	Judge persona scaling by tier:

	\| Tier \| Persona \| Forward Bonus \| Skip Penalty \| Extras \|
	\|------\|---------\|-------------\|-------------\|--------\|
	\| Warmup \| Junior \| +0.20 \| −0.30/skip \| Allows 1 skip free, gives hints \|
	\| Beginner \| Junior→Senior \| +0.20 \| −0.30/skip \| Standard \|
	\| Intermediate \| Senior \| +0.15 \| −0.30/skip \| Expects correct ordering \|
	\| Advanced \| Senior→Principal \| +0.10 \| −0.50/skip \| Redundancy penalty −0.10 \|
	\| Expert \| Principal \| +0.05 \| −0.50/skip \| Redundancy −0.15, efficiency penalty \|

	---

	## Hard Prerequisites

	These block the action entirely (not a reward signal — returns error):

	\| Action \| Requires \|
	\|--------\|----------\|
	\| `estimate_effect_size` \| ≥1 `run_dose_escalation` \|
	\| `set_sample_size` \| `estimate_effect_size` \|
	\| `submit_to_fda_review` \| `set_primary_endpoint` + `set_sample_size` \|
	\| `run_interim_analysis` \| `submit_to_fda_review` passed \|
	\| `run_primary_analysis` \| `submit_to_fda_review` passed \|
	\| `synthesize_conclusion` \| `run_primary_analysis` \|
	\| `modify_sample_size` \| `run_interim_analysis` \|
	\| `add_biomarker_stratification` \| `estimate_effect_size` \|

	---

	## Protocol Amendment

	- `request_protocol_amendment` allows recovery from FDA review failure
	- Costs time and budget (realistic consequence)
	- Successful recovery: +0.3 recovery bonus
	- Maximum 2 amendments per episode

	# Clinical Trial Phase-Aware Workflow & Scoring

	## 10-Phase Clinical Workflow

	The agent learns this ordering through reward signal, not hard-coding.

	\| # \| Phase \| Description \| Actions \|
	\|---\|-------\|-------------\|---------\|
	\| 0 \| `literature_review` \| Understand disease and constraints \| Review scenario \|
	\| 1 \| `hypothesis` \| Form hypothesis about drug mechanism \| Estimate expected effect \|
	\| 2 \| `phase_i_design` \| Phase I safety/dose-finding \| run_dose_escalation, observe_safety_signal \|
	\| 3 \| `phase_i_analysis` \| Analyze Phase I results \| estimate_effect_size \|
	\| 4 \| `phase_ii_design` \| Design Phase II efficacy trial \| set_primary_endpoint, set_sample_size, set_inclusion_criteria, etc. \|
	\| 5 \| `regulatory` \| FDA review \| submit_to_fda_review, request_protocol_amendment \|
	\| 6 \| `enrollment` \| Enroll patients \| (implicit after FDA approval) \|
	\| 7 \| `monitoring` \| Interim analysis, adaptation \| run_interim_analysis, modify_sample_size, add_biomarker_stratification \|
	\| 8 \| `analysis` \| Final statistical test \| run_primary_analysis \|
	\| 9 \| `conclusion` \| Synthesize results \| synthesize_conclusion \|

	---

	## Phase-Order Scoring

	\| Condition \| Reward \|
	\|-----------\|--------\|
	\| Action in correct or next phase \| +0.2 \|
	\| Action stays in current phase \| +0.2 \|
	\| Action skips N phases ahead \| −0.3 × N \|

	Judge persona scaling by tier:

	\| Tier \| Persona \| Forward Bonus \| Skip Penalty \| Extras \|
	\|------\|---------\|-------------\|-------------\|--------\|
	\| Warmup \| Junior \| +0.20 \| −0.30/skip \| Allows 1 skip free, gives hints \|
	\| Beginner \| Junior→Senior \| +0.20 \| −0.30/skip \| Standard \|
	\| Intermediate \| Senior \| +0.15 \| −0.30/skip \| Expects correct ordering \|
	\| Advanced \| Senior→Principal \| +0.10 \| −0.50/skip \| Redundancy penalty −0.10 \|
	\| Expert \| Principal \| +0.05 \| −0.50/skip \| Redundancy −0.15, efficiency penalty \|

	---

	## Hard Prerequisites

	These block the action entirely (not a reward signal — returns error):

	\| Action \| Requires \|
	\|--------\|----------\|
	\| `estimate_effect_size` \| ≥1 `run_dose_escalation` \|
	\| `set_sample_size` \| `estimate_effect_size` \|
	\| `submit_to_fda_review` \| `set_primary_endpoint` + `set_sample_size` \|
	\| `run_interim_analysis` \| `submit_to_fda_review` passed \|
	\| `run_primary_analysis` \| `submit_to_fda_review` passed \|
	\| `synthesize_conclusion` \| `run_primary_analysis` \|
	\| `modify_sample_size` \| `run_interim_analysis` \|
	\| `add_biomarker_stratification` \| `estimate_effect_size` \|

	---

	## Protocol Amendment

	- `request_protocol_amendment` allows recovery from FDA review failure
	- Costs time and budget (realistic consequence)
	- Successful recovery: +0.3 recovery bonus
	- Maximum 2 amendments per episode