hoanganhpham commited on
Commit
5627fa3
·
verified ·
1 Parent(s): f8b9e91

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -6,14 +6,15 @@ tags: []
6
  # II-Medical-7B-Preview
7
 
8
  <div style="display: flex; justify-content: center;">
9
- <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/xBJE1uk9_FGPn2N1emMFR.png" width="800">
10
  </div>
11
 
12
  ## I. Model Overview
13
 
14
  II-Medical-7B-Preview is a medical reasoning model trained on a [comprehensive dataset](https://huggingface.co/datasets/Intelligent-Internet/II-Medical-Reasoning-SFT-V0) of medical knowledge. The model is designed to enhance AI capabilities in medical.
15
 
16
- ![Model Benchmark](model_benchmark.png)
 
17
  ## II. Training Methodology
18
 
19
  We collected and generated a comprehensive set of reasoning datasets for the medical domain and performed SFT fine-tuning on the **Qwen/Qwen2.5-7B-Instruct** model. Following this, we further optimized the SFT model by training DAPO on a hard-reasoning dataset to boost performance.
 
6
  # II-Medical-7B-Preview
7
 
8
  <div style="display: flex; justify-content: center;">
9
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/6389496ff7d3b0df092095ed/73Y-oDmehp0eJ2HWrfn3V.jpeg" width="800">
10
  </div>
11
 
12
  ## I. Model Overview
13
 
14
  II-Medical-7B-Preview is a medical reasoning model trained on a [comprehensive dataset](https://huggingface.co/datasets/Intelligent-Internet/II-Medical-Reasoning-SFT-V0) of medical knowledge. The model is designed to enhance AI capabilities in medical.
15
 
16
+ ![Model Benchmark](https://cdn-uploads.huggingface.co/production/uploads/6389496ff7d3b0df092095ed/oTGtjC-ngnIZw9BpVgAHv.png)
17
+
18
  ## II. Training Methodology
19
 
20
  We collected and generated a comprehensive set of reasoning datasets for the medical domain and performed SFT fine-tuning on the **Qwen/Qwen2.5-7B-Instruct** model. Following this, we further optimized the SFT model by training DAPO on a hard-reasoning dataset to boost performance.