File size: 516 Bytes
d89a8a4
 
 
 
 
 
 
 
7c0779f
d89a8a4
f90ff39
7c0779f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
title: README
emoji: 👁
colorFrom: purple
colorTo: gray
sdk: static
pinned: false
---
# 🩹 MedInjection-FR

A **French biomedical instruction dataset and model suite** for studying how data provenance (**native, synthetic, translated**) impacts instruction-tuning of LLMs.

## 📊 Dataset Stats

**Total size**: 571,436 instruction–response pairs

**Components**:
- Native: 77,247
- Synthetic: 76,506  
- Translated: 417,674

**Tasks**:
- MCQU (single-answer)
- MCQ (multi-answer)
- OEQ (open-ended)

***