ClinAlign-4B / README.md
ShiweiLyu's picture
Update README.md
8373fc0 verified
metadata
license: apache-2.0
base_model:
  - Qwen/Qwen3-4B-Instruct-2507

ClinAlign

ClinAlign is a clinician-grounded healthcare alignment framework that scales from instance rubrics to a reusable library of clinical principles, enabling robust preference alignment and inference-time self-revision for medical LLMs.

📃 Paper

ClinAlign


🔥 Highlights

  • Clinician-verified rubrics (HealthRubrics): a physician-validated preference dataset built by having clinicians revise and finalize LLM-drafted, checkable rubrics.
  • Reusable principle library (HealthPrinciples): distilled clinician consensus as 119 broadly reusable principles organized by clinical dimensions (urgency / uncertainty / expertise / task type).
  • Scalable supervision: principles can be converted into per-question rubrics for new, unlabeled medical queries—scaling training data without per-instance clinician authoring.
  • Inference-time alignment tool: retrieve matched principles → generate rubric references → guide iterative self-revision at test time.

🥇 Results

🤗 Models: ClinAlign-4BClinAlign-30B-A3B

ClinAlign results


Acknowledgement