tuenguyen commited on
Commit
3e6ac18
·
verified ·
1 Parent(s): 7bb86fc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -10,7 +10,7 @@ tags: []
10
  II-Medical-7B-Preview is a medical reasoning model trained on a [comprehensive dataset](https://huggingface.co/datasets/Intelligent-Internet/II-Medical-Reasoning-SFT-V0) of medical knowledge. The model is designed to enhance AI capabilities in medical.
11
  ## II. Training Methodology
12
 
13
- We collected and generated a comprehensive set of reasoning datasets for the medical domain and performed SFT fine-tuning on the Qwen/Qwen2.5-7B-Instruct model. Following this, we further optimized the SFT model by training DAPO on a hard-reasoning dataset to boost performance.
14
 
15
  For SFT stage we using the hyperparameters:
16
 
@@ -46,7 +46,7 @@ Journal of Medicine, 4 Options and 5 Options splits from the MedBullets platfo
46
  | Med-reason | 61.67 | 71.87 | 77.4 | 64.1 | 50.51| 59.7 | 60.06 | 54.22 |22.87 |66.8 | 59.92 |
47
  | M1 | 62.54 | 75.81 | 75.80 | 65.86 | 53.08| 62.62 | 63.64 | 59.74 |19.59 |64.34 | 60.3 |
48
  | II-Medical-7B-Preview-Wo-RL | 69.13 | 84.05 | 77.5 | 73.49 | 55.12| **67.71** | 69.48 | 64.28 |19.51 |**70.64** | 65.1 |
49
- | II-Medical-7B-Preview-RL | **69.42** | **85.15** | 77.9 | **77.26** | **55.90**| **65.29** | **72.72** | **68.50** |**22.97** |68.66 | **66.4** |
50
 
51
 
52
 
 
10
  II-Medical-7B-Preview is a medical reasoning model trained on a [comprehensive dataset](https://huggingface.co/datasets/Intelligent-Internet/II-Medical-Reasoning-SFT-V0) of medical knowledge. The model is designed to enhance AI capabilities in medical.
11
  ## II. Training Methodology
12
 
13
+ We collected and generated a comprehensive set of reasoning datasets for the medical domain and performed SFT fine-tuning on the **Qwen/Qwen2.5-7B-Instruct** model. Following this, we further optimized the SFT model by training DAPO on a hard-reasoning dataset to boost performance.
14
 
15
  For SFT stage we using the hyperparameters:
16
 
 
46
  | Med-reason | 61.67 | 71.87 | 77.4 | 64.1 | 50.51| 59.7 | 60.06 | 54.22 |22.87 |66.8 | 59.92 |
47
  | M1 | 62.54 | 75.81 | 75.80 | 65.86 | 53.08| 62.62 | 63.64 | 59.74 |19.59 |64.34 | 60.3 |
48
  | II-Medical-7B-Preview-Wo-RL | 69.13 | 84.05 | 77.5 | 73.49 | 55.12| **67.71** | 69.48 | 64.28 |19.51 |**70.64** | 65.1 |
49
+ | II-Medical-7B-Preview-RL | **69.42** | **85.15** | 77.9 | **77.26** | **55.90**| 65.29 | **72.72** | **68.50** |**22.97** |68.66 | **66.4** |
50
 
51
 
52