MedInjection-FR commited on
Commit
7c0779f
·
verified ·
1 Parent(s): d89a8a4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -6,5 +6,22 @@ colorTo: gray
6
  sdk: static
7
  pinned: false
8
  ---
 
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  sdk: static
7
  pinned: false
8
  ---
9
+ # 🩹 MedInjection-FR
10
 
11
+ A **French biomedical instruction dataset and model suite** for studying how data provenance (**native, synthetic, translated**) impacts instruction-tuning of LLMs. [huggingface](https://huggingface.co/docs/hub/organizations-cards)
12
+
13
+ ## 📊 Dataset Stats
14
+
15
+ **Total size**: 571,436 instruction–response pairs
16
+
17
+ **Components**:
18
+ - Native: 77,247
19
+ - Synthetic: 76,506
20
+ - Translated: 417,674
21
+
22
+ **Tasks**:
23
+ - MCQU (single-answer)
24
+ - MCQ (multi-answer)
25
+ - OEQ (open-ended)
26
+
27
+ ***