readCtrl_lambda / prompts /readability_revised.txt
mshahidul
Initial commit of readCtrl code without large models
030876e
def inference_prompt_revise_summary(fulltext, ref_summary, generated_summary, version, missing_subclaims):
prompt = f"""
You are a medical summarization model specialized in readability-controlled text revision.
Your task is to improve the **Generated Summary** by adding back the key missing clinical information listed under **Missing Subclaims**, while keeping the readability style defined for the level **{version}**.
Do not copy the reference summary. Keep coherence, brevity, and correctness.
---
### INPUT
**Full Text (for context):**
{fulltext}
**Reference Summary (for comparison only):**
{ref_summary}
**Generated Summary (to revise):**
{generated_summary}
**Missing Subclaims (to integrate naturally):**
{missing_subclaims}
---
### READABILITY STYLES
- **easy (FH 70–100, grade 5–7):**
- Short sentences, familiar vocabulary, concrete ideas.
- Avoid subordinate clauses and medical jargon.
- Tone: explanatory, simple, and friendly.
- **intermediate (FH 50–69, grade 8–12):**
- Moderate sentence complexity and domain vocabulary.
- Clear and structured explanation.
- **hard (FH 0–49, university/professional):**
- Use specialized terminology, formal and dense phrasing.
- Include:
- precise domain vocabulary;
- causal or analytical connectors (por consiguiente, sin embargo, dado que…);
- one definition, one process description, and one implication statement if possible;
- optional subordinate clauses for academic rhythm.
---
### OUTPUT
Return **only the revised summary text**, coherent and medically correct, matching the {version} readability level.
"""
return prompt
### Synthetic data generation (https://chatgpt.com/c/68f1c138-5a78-8332-8052-eeb65cca1bde)
--------------------------------
def generate_revised_summary_prompt(fulltext, ref_summary, generated_summary, version, missing_subclaims):
prompt = f"""
You are a medical summarization model that revises simplified summaries to restore important missing information
while keeping the same readability level.
---
### INPUT INFORMATION
**Readability Level:** {version}
**Full Medical Text (for context):**
{fulltext}
**Reference Summary (complete clinical version):**
{ref_summary}
**Generated Summary (current version, missing some information):**
{generated_summary}
**Important Subclaims Missing:**
{missing_subclaims}
---
### READABILITY STYLE GUIDE
- **easy (FH 70–100, grade 5–7):**
- Short sentences, common vocabulary, concrete ideas.
- Avoid subordinate clauses and technical terms.
- Tone: explanatory, lively, and accessible.
- **intermediate (FH 50–69, grade 8–12):**
- Moderate complexity, suitable for high school readers.
- **hard (FH 0–49, university/professional):**
- Use specialized terminology, formal register, dense information packaging, and long multi-clause sentences.
- Incorporate:
- precise domain vocabulary;
- causal or analytical connectors (por consiguiente, sin embargo, en virtud de, dado que…);
- at least one definition, one process description, and one statement of implications or challenges;
- optional parenthetical clarifications or subordinate relative clauses for academic rhythm.
---
### TASK
Revise the **Generated Summary** to make it more complete by integrating all the **Important Subclaims Missing**,
while preserving the tone, fluency, and readability level defined above.
- Do **not** copy the reference summary directly.
- Use your own phrasing consistent with the given readability level.
- Keep it concise, coherent, and medically accurate.
- Do not add new facts not supported by the text.
- Integrate subclaims *naturally* β€” not as a list.
---
### OUTPUT
Return **only the revised summary text**, with no explanation, notes, or formatting.
"""
return prompt