Hebrew Recipe Modification NER
DictaBERT-large fine-tuned for recipe modification extraction from Hebrew YouTube comments. Trained with class weighting (P1) on silver labels from a 3-pass LLM teacher pipeline (v2).
Labels
B/I-SUBSTITUTIONโ ingredient substitutionB/I-ADDITIONโ ingredient additionB/I-QUANTITYโ quantity changeB/I-TECHNIQUEโ technique change
Usage
from transformers import pipeline
pipe = pipeline("token-classification",
model="DanielDDDS/hebrew-recipe-modification-ner",
aggregation_strategy="simple")
pipe("ืืคืฉืจ ืืืืืืฃ ืืืื ืืฉืื ืงืืงืืก")
Performance (corrected gold test set, n=496, 38 spans)
- Exact Entity F1: 25.5%
- Relaxed Entity F1: 62.6%
- Model: DictaBERT-large + linear head, class weights (P1)
- Beats LLM teacher on relaxed F1 (teacher: 48.4%)
- Downloads last month
- 258