Update README.md
Browse files
README.md
CHANGED
|
@@ -62,7 +62,7 @@ The training dataset comprises 555,000 samples from the following sources:
|
|
| 62 |
- Medical-R1-Distill-Data-Chinese: 17,000 samples
|
| 63 |
- UCSC-VLAA/m23k-tokenized: 23,487 samples
|
| 64 |
|
| 65 |
-
### 2. Synthetic Medical QA Data with QwQ
|
| 66 |
Generated from established medical datasets:
|
| 67 |
- MedMcQA (from openlifescienceai/medmcqa): 183,000 samples
|
| 68 |
- MedQA: 10,000 samples
|
|
|
|
| 62 |
- Medical-R1-Distill-Data-Chinese: 17,000 samples
|
| 63 |
- UCSC-VLAA/m23k-tokenized: 23,487 samples
|
| 64 |
|
| 65 |
+
### 2. Synthetic Medical QA Data with QwQ (225,700 samples)
|
| 66 |
Generated from established medical datasets:
|
| 67 |
- MedMcQA (from openlifescienceai/medmcqa): 183,000 samples
|
| 68 |
- MedQA: 10,000 samples
|