Henrychur
/

DiagAgent-8B

@@ -1,14 +1,17 @@
 ---
-license: apache-2.0
-language:
-- en
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
 tags:
 - medical
 - diagnosis
 - RL
 ---
 # DiagAgent-8B: RL-Optimized Diagnostic Agent
 <div align="center">
@@ -25,6 +28,7 @@ DiagAgent‑8B is a reinforcement learning‑optimized large language model for
 DiagAgent‑8B is trained end‑to‑end inside the `DiagGym` virtual clinical environment with multi‑turn RL (GRPO), enabling safe, closed‑loop learning without real‑world risk.
 Details can be found in our paper https://arxiv.org/abs/2510.24654
 ## Quickstart
@@ -58,27 +62,45 @@ def chat(messages, max_new_tokens=1024, temperature=0.0):
 SYSTEM_PROMPT = (
     "You are a medical AI assistant. Analyze patient information, suggest relevant tests, "
-    "and provide a final diagnosis when sufficient information is available.\n\n"
-    "RESPONSE FORMAT:\n"
-    "If more information is needed:\n"
-    "```\n"
-    "Current diagnosis: <your current best diagnosis>\n"
-    "Based on the patient's initial presentation, the following investigation(s) should be performed: <one additional test>\n"
-    "Reason: <reason for the test>\n"
-    "```\n"
-    "If sufficient information exists for diagnosis:\n"
-    "```\n"
-    "The available information is sufficient to make a diagnosis.\n"
-    "Diagnosis: <final diagnosis>\n"
-    "Reason: <brief justification>\n"
     "```"
 )
 initial_inquiry = (
-    "- Patient Information: ___ y/o F\n"
-    "- Chief Complaint: Early satiety, weight loss, abdominal pain\n"
-    "- HPI: 1-month weight loss (10 lbs), early satiety, fatigue; prior emesis; reduced intake; denies fever/chills.\n"
-    "- PMH: Asthma, hyperlipidemia, HTN, osteoarthritis, polymyalgia rheumatica, CAD (NSTEMI), osteoporosis, H. pylori, s/p TAH/USO.\n"
     "- Allergy: Lisinopril."
 )
@@ -191,6 +213,7 @@ The following tables are taken directly from the project evaluation. For evaluat
 | MDAgent          | -    | 2024.10| 21.64            |
 | **Our Method**   |      |        |                  |
 | DiagAgent-14B    | 14B  | -      | 32.86            |
 ## Training Details
@@ -207,7 +230,8 @@ DiagAgent‑8B is optimized with multi‑turn RL (GRPO) inside `DiagGym`.
 For implementation and scripts, see `DiagAgent/train/rl/` in the GitHub.
-## Citation
 ```
 @misc{qiu2025evolvingdiagnosticagentsvirtual,
       title={Evolving Diagnostic Agents in a Virtual Clinical Environment},
@@ -220,8 +244,4 @@ For implementation and scripts, see `DiagAgent/train/rl/` in the GitHub.
 }
 ```
-## Contact
-- Email: henrychur@sjtu.edu.cn
-- GitHub: https://github.com/MAGIC-AI4Med/DiagGym

 ---
 base_model:
 - meta-llama/Llama-3.1-8B-Instruct
+language:
+- en
+license: apache-2.0
 tags:
 - medical
 - diagnosis
 - RL
+library_name: transformers
+pipeline_tag: text-generation
 ---
 # DiagAgent-8B: RL-Optimized Diagnostic Agent
 <div align="center">
 DiagAgent‑8B is trained end‑to‑end inside the `DiagGym` virtual clinical environment with multi‑turn RL (GRPO), enabling safe, closed‑loop learning without real‑world risk.
 Details can be found in our paper https://arxiv.org/abs/2510.24654
+Code: https://github.com/MAGIC-AI4Med/DiagGym
 ## Quickstart
 SYSTEM_PROMPT = (
     "You are a medical AI assistant. Analyze patient information, suggest relevant tests, "
+    "and provide a final diagnosis when sufficient information is available.
+"
+    "RESPONSE FORMAT:
+"
+    "If more information is needed:
+"
+    "```
+"
+    "Current diagnosis: <your current best diagnosis>
+"
+    "Based on the patient's initial presentation, the following investigation(s) should be performed: <one additional test>
+"
+    "Reason: <reason for the test>
+"
+    "```
+"
+    "If sufficient information exists for diagnosis:
+"
+    "```
+"
+    "The available information is sufficient to make a diagnosis.
+"
+    "Diagnosis: <final diagnosis>
+"
+    "Reason: <brief justification>
+"
     "```"
 )
 initial_inquiry = (
+    "- Patient Information: ___ y/o F
+"
+    "- Chief Complaint: Early satiety, weight loss, abdominal pain
+"
+    "- HPI: 1-month weight loss (10 lbs), early satiety, fatigue; prior emesis; reduced intake; denies fever/chills.
+"
+    "- PMH: Asthma, hyperlipidemia, HTN, osteoarthritis, polymyalgia rheumatica, CAD (NSTEMI), osteoporosis, H. pylori, s/p TAH/USO.
+"
     "- Allergy: Lisinopril."
 )
 | MDAgent          | -    | 2024.10| 21.64            |
 | **Our Method**   |      |        |                  |
 | DiagAgent-14B    | 14B  | -      | 32.86            |
+---
 ## Training Details
 For implementation and scripts, see `DiagAgent/train/rl/` in the GitHub.
+## 📝Citation & Contact
 ```
 @misc{qiu2025evolvingdiagnosticagentsvirtual,
       title={Evolving Diagnostic Agents in a Virtual Clinical Environment},
 }
 ```
+For any inquiries or feedback, don’t hesitate to contact henrychur@sjtu.edu.cn.