|
|
--- |
|
|
license: mit |
|
|
tags: |
|
|
- unsloth |
|
|
- trl |
|
|
- grpo |
|
|
datasets: |
|
|
- FreedomIntelligence/Medical-R1-Distill-Data |
|
|
language: |
|
|
- en |
|
|
base_model: |
|
|
- unsloth/phi-4 |
|
|
pipeline_tag: text-generation |
|
|
library_name: transformers |
|
|
--- |
|
|
|
|
|
This model is a [Phi-4](https://huggingface.co/unsloth/phi-4) Large Language Model (LLM) that has been fine-tuned for medical reasoning tasks. It leverages the architecture of the Phi-4 model and has been specifically adapted to understand and process medical information, answer medical questions, and potentially assist in clinical decision-making support (for research purposes only). |
|
|
|
|
|
The fine-tuning process was carried out using [Unsloth AI](https://unsloth.ai/), a library designed for efficient and accessible LLM fine-tuning. Unsloth AI simplifies the process of adapting powerful models like Phi-4 for specific downstream tasks, making it easier to create specialized LLMs like this medical reasoning model. |
|
|
|
|
|
**Key Features:** |
|
|
|
|
|
* Based on the Phi-4 architecture, known for its efficiency and performance. |
|
|
* Fine-tuned on a medical reasoning dataset to enhance its capabilities in the medical domain. |
|
|
* Trained using Unsloth AI for optimized fine-tuning. |
|
|
|
|
|
## Dataset |
|
|
|
|
|
The model was fine-tuned on [FreedomIntelligence/Medical-R1-Distill-Data](https://huggingface.co/datasets/FreedomIntelligence/Medical-R1-Distill-Data) dataset. |
|
|
|
|
|
* **Dataset Description:** SFT dataset distilled from Deepseek-R1 (Full Power Version), based on medical verifiable problems from HuatuoGPT-o1. |
|
|
* **Source:** [FreedomIntelligence/Medical-R1-Distill-Data](https://huggingface.co/datasets/FreedomIntelligence/Medical-R1-Distill-Data) |
|
|
* **Dataset Size:** 22k rows. |
|
|
|
|
|
**Important Note on Dataset Bias:** It's important to acknowledge that medical datasets can contain biases that reflect real-world healthcare disparities. This model may inherit and amplify such biases. Users should be aware of this limitation and interpret the model's outputs with caution. |
|
|
|
|
|
## Intended Use |
|
|
|
|
|
This model is intended for **research purposes only**. Potential use cases include: |
|
|
|
|
|
* **Medical Education:** Assisting medical students and professionals in learning and reviewing medical concepts. |
|
|
* **Clinical Decision Support Research:** Exploring the potential of LLMs in providing support for clinical decision-making (always under the supervision of qualified professionals and not for direct clinical use). |
|
|
* **Medical Information Retrieval:** Improving the accuracy and relevance of medical information retrieval systems. |
|
|
* **Medical Question Answering:** Developing systems that can answer complex medical questions. |
|
|
|
|
|
**Disclaimer:** This model is **not intended for clinical use** and should **not be used as a substitute for professional medical advice, diagnosis, or treatment.** Always consult with a qualified healthcare provider for any health concerns or before making any decisions related to your health or treatment. |
|
|
|