NorthernTribe-Research
/

NorthernTribe-Medical-Reasoning-7B

Text Generation

clinical-reasoning

chain-of-thought

Model card Files Files and versions

NorthernTribe-Research commited on 21 days ago

Commit

ca2e1db

·

verified ·

1 Parent(s): 2ec8c15

Add README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -47,6 +47,9 @@ This model is designed for:
 - **Base Model**: [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
 - **Dataset**: `NorthernTribe-Research/comprehensive-healthbench-v2` (1.4M examples)
 - **Method**: QLoRA Fine-tuning via [Unsloth](https://github.com/unslothai/unsloth)
 - **Infrastructure**: Trained on Nvidia A100/H100 GPUs.
 ## Usage

 - **Base Model**: [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
 - **Dataset**: `NorthernTribe-Research/comprehensive-healthbench-v2` (1.4M examples)
 - **Method**: QLoRA Fine-tuning via [Unsloth](https://github.com/unslothai/unsloth)
+- **SOTA Techniques**:
+    - **NEFTune (Noisy Embeddings Fine Tuning)**: Implemented to improve model generalization and robustness.
+    - **Teacher-Student Distillation**: System prompts strictly enforce the model to act as an "Expert Medical Teacher", engaging in deep Chain-of-Thought reasoning for every response.
 - **Infrastructure**: Trained on Nvidia A100/H100 GPUs.
 ## Usage