Spaces:

MedInjection
/

README

Running

MedInjection-FR commited on 4 days ago

Commit

7c0779f

verified ·

1 Parent(s): d89a8a4

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -6,5 +6,22 @@ colorTo: gray
 sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 sdk: static
 pinned: false
 ---
+# 🩹 MedInjection-FR
+A **French biomedical instruction dataset and model suite** for studying how data provenance (**native, synthetic, translated**) impacts instruction-tuning of LLMs. [huggingface](https://huggingface.co/docs/hub/organizations-cards)
+## 📊 Dataset Stats
+**Total size**: 571,436 instruction–response pairs
+**Components**:
+- Native: 77,247
+- Synthetic: 76,506
+- Translated: 417,674
+**Tasks**:
+- MCQU (single-answer)
+- MCQ (multi-answer)
+- OEQ (open-ended)
+***