| | --- |
| | license: apache-2.0 |
| | language: |
| | - ko |
| | - en |
| | base_model: |
| | - Qwen/Qwen2.5-72B |
| | tags: |
| | - medical |
| | - clinical |
| | - QA |
| | - benchmark |
| | - healthcare |
| | - korean |
| | --- |
| | |
| | ๐ง **Korean Medical LLM (QA-Finetuned) by Healthcare AI Research Institute of Seoul National University Hospital** |
| |
|
| | Welcome to the official repository of the **Korean Medical Large Language Model (LLM)** developed by the **Healthcare AI Research Institute (HARI)** at **Seoul National University Hospital (SNUH)**. |
| |
|
| | This model is **fine-tuned on Korean medical questionโanswering (QA) style data**, enabling robust performance in clinical reasoning, educational Q&A, and domain-specific medical inference. |
| |
|
| | --- |
| |
|
| | ## ๐ Model Overview |
| |
|
| | * **Model Name**: `snuh/hari-q2.5-thinking` |
| | * **Architecture**: Large Language Model (LLM) |
| | * **Fine-tuning Objective**: Medical QA (QuestionโAnswer) style generation |
| | * **Primary Language**: English, Korean |
| | * **Domain**: Clinical Medicine |
| | * **Performance**: Achieves **89.2% accuracy** on the **Korean Medical Licensing Examination (KMLE)** |
| | * **Key Applications**: |
| | * Clinical decision support (QA-style) |
| | * Medical education and self-assessment tools |
| | * Automated medical reasoning and documentation aid |
| |
|
| | --- |
| |
|
| | ## ๐ Training Data & Benchmark |
| |
|
| | This model was fine-tuned using a curated corpus of Korean medical QA-style data derived from **publicly available, de-identified sources**. The training data includes clinical guidelines, academic publications, exam-style questions, and synthetic prompts reflecting real-world clinical reasoning. |
| |
|
| | * **Training Data Characteristics**: |
| | - Focused on Korean-language questionโanswering formats relevant to clinical settings. |
| | - Includes guideline-derived questions, de-identified case descriptions, and physician-crafted synthetic queries. |
| | - Designed to reflect realistic diagnostic, therapeutic, and decision-making scenarios. |
| |
|
| | * **Benchmark Evaluation**: |
| | - **KMLE QA benchmark(KorMedMCQA 5-shot)** |
| | - Doctor: 89.20% |
| | - Nurse: 90.99% |
| | - Pharm: 90.94% |
| | - Dentist: 72.96% |
| | - **USMLE QA benchmark(MedQA-USMLE 0-shot)** |
| | - 88.36% |
| | - All evaluations were conducted on de-identified, non-clinical test sets, with no real patient data involved. |
| |
|
| | > โ ๏ธ These benchmarks are provided for research purposes only and do not imply clinical safety or efficacy. |
| |
|
| | --- |
| |
|
| | ## ๐ Privacy & Ethical Compliance |
| |
|
| | We strictly adhere to ethical AI development and privacy protection: |
| |
|
| | * โ
The model was trained exclusively on **publicly available and de-identified data**. |
| | * ๐ It does **not include any real patient data or personally identifiable information (PII)**. |
| | * โ๏ธ Designed for **safe, responsible, and research-oriented** use in healthcare AI. |
| |
|
| | > โ ๏ธ This model is intended for **research and educational purposes only** and should **not** be used to make clinical decisions. |
| |
|
| | --- |
| |
|
| | ## ๐ฅ About HARI โ Healthcare AI Research Institute |
| |
|
| | The **Healthcare AI Research Institute (HARI)** is a pioneering research group within **Seoul National University Hospital**, driving innovation in medical AI. |
| |
|
| | ### ๐ Vision & Mission |
| |
|
| | * **Vision**: Shaping a sustainable and healthy future through pioneering AI research. |
| | * **Mission**: |
| | * Develop clinically useful, trustworthy AI technologies. |
| | * Foster cross-disciplinary collaboration in medicine and AI. |
| | * Lead global healthcare AI commercialization and policy frameworks. |
| | * Educate the next generation of AI-powered medical professionals. |
| |
|
| | --- |
| |
|
| | ## ๐งช Research Platforms & Infrastructure |
| |
|
| | * **Platforms**: SUPREME, SNUHUB, DeView, VitalDB, KHDP |
| | * **Computing**: NVIDIA B200 / H100 / A100 GPUs |
| | * **Projects**: |
| | * Clinical note summarization |
| | * AI-powered diagnostics |
| | * EHR automation |
| | * Real-time monitoring via AI pipelines |
| |
|
| | --- |
| |
|
| | ## ๐ AI Education Programs |
| |
|
| | * **Basic AI for Healthcare**: Designed for clinicians and students |
| | * **Advanced AI Research**: Targeting senior researchers and specialists in clinical AI validation and deep learning |
| |
|
| | --- |
| |
|
| | ## ๐ค Collaborate with Us |
| |
|
| | We welcome collaboration with: |
| |
|
| | * AI research institutions and medical universities |
| | * Healthcare startups and technology partners |
| | * Policymakers shaping AI regulation in medicine |
| |
|
| | ๐ง **Contact**: [hhoon@snu.ac.kr](mailto:hhoon@snu.ac.kr) |
| | ๐ **Website**: [Seoul National University Hospital](https://www.snuh.org) |
| |
|
| | --- |
| |
|
| | ## ๐ค Model Usage Example |
| |
|
| | ```python |
| | from transformers import AutoTokenizer, AutoModelForCausalLM |
| | import torch |
| | |
| | # Load tokenizer and model |
| | model_name = "snuh/hari-q2.5-thinking" |
| | |
| | model = AutoModelForCausalLM.from_pretrained( |
| | model_name, |
| | torch_dtype="auto", |
| | device_map="auto" |
| | ) |
| | tokenizer = AutoTokenizer.from_pretrained(model_name) |
| | |
| | prompt = ''' |
| | ### Instruction: |
| | ๋น์ ์ ์์ ์ง์์ ๊ฐ์ถ ์ ๋ฅํ๊ณ ์ ๋ขฐํ ์ ์๋ ํ๊ตญ์ด ๊ธฐ๋ฐ ์๋ฃ ์ด์์คํดํธ์
๋๋ค. |
| | ์ฌ์ฉ์์ ์ง๋ฌธ์ ๋ํด ์ ํํ๊ณ ์ ์คํ ์์ ์ถ๋ก ์ ๋ฐํ์ผ๋ก ์ง๋จ ๊ฐ๋ฅ์ฑ์ ์ ์ํด ์ฃผ์ธ์. |
| | ๋ฐ๋์ ํ์์ ์ฐ๋ น, ์ฆ์, ๊ฒ์ฌ ๊ฒฐ๊ณผ, ํต์ฆ ๋ถ์ ๋ฑ ๋ชจ๋ ๋จ์๋ฅผ ์ข
ํฉ์ ์ผ๋ก ๊ณ ๋ คํ์ฌ ์ถ๋ก ๊ณผ์ ๊ณผ ์ง๋จ๋ช
์ ์ ์ํด์ผ ํฉ๋๋ค. |
| | ์ํ์ ์ผ๋ก ์ ํํ ์ฉ์ด๋ฅผ ์ฌ์ฉํ๋, ํ์ํ๋ค๋ฉด ์ผ๋ฐ์ธ์ด ์ดํดํ๊ธฐ ์ฌ์ด ์ฉ์ด๋ ๋ณํํด ์ค๋ช
ํด ์ฃผ์ธ์. |
| | |
| | ### Question: |
| | 60์ธ ๋จ์ฑ์ด ๋ณตํต๊ณผ ๋ฐ์ด์ ํธ์ํ๋ฉฐ ๋ด์ํ์์ต๋๋ค. |
| | ํ์ก ๊ฒ์ฌ ๊ฒฐ๊ณผ ๋ฐฑํ๊ตฌ ์์น๊ฐ ์์นํ๊ณ , ์ฐ์ธก ํ๋ณต๋ถ ์ํต์ด ํ์ธ๋์์ต๋๋ค. |
| | ๊ฐ์ฅ ๊ฐ๋ฅ์ฑ์ด ๋์ ์ง๋จ๋ช
์ ๋ฌด์์ธ๊ฐ์? |
| | '''.strip() |
| | |
| | messages = [ |
| | {"role": "user", "content": prompt} |
| | ] |
| | text = tokenizer.apply_chat_template( |
| | messages, |
| | tokenize=False, |
| | add_generation_prompt=True |
| | ) |
| | model_inputs = tokenizer([text], return_tensors="pt").to(model.device) |
| | |
| | generated_ids = model.generate( |
| | **model_inputs, |
| | max_new_tokens=512 |
| | ) |
| | generated_ids = [ |
| | output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids) |
| | ] |
| | |
| | response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0] |
| | print(response) |
| | ```` |
| |
|
| | --- |
| |
|
| | ## ๐ License |
| |
|
| | **Apache 2.0 License** โ Free for research and commercial use with attribution. |
| |
|
| | --- |
| |
|
| | ## ๐ข Citation |
| |
|
| | If you use this model in your work, please cite: |
| |
|
| | ``` |
| | @misc{hari-q2.5-thinking, |
| | title = {hari-q2.5-thinking}, |
| | url = {https://huggingface.co/snuh/hari-q2.5-thinking}, |
| | author = {Healthcare AI Research Institute(HARI) of Seoul National University Hospital(SNUH)}, |
| | month = {December}, |
| | year = {2025} |
| | } |
| | ``` |
| |
|
| | --- |
| | ## ๐ Together, we are shaping the future of AI-driven healthcare. |
| | --- |
| |
|
| | ## Acknowlegments |
| | This work was supported by Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) (RS-2025-02653113, High-Performance Research AI Computing Infrastructure Support at the 2 PFLOPS Scale) |