File size: 2,404 Bytes
d269178
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
609967a
d269178
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
---
license: apache-2.0
base_model:
- dicta-il/neodictabert
---
------
license: apache-2.0
language:
- he
base_model:
- dicta-il/neodictabert
tags:
- multi-label-classification
- hebrew
- fact-checking
- error-detection
pipeline_tag: text-classification
library_name: transformers
---

# Hebrew Multi-Label Error Type Classifier (Setup 2: Claim-Level)

## Model Description

Fine-tuned [dicta-il/neodictabert](https://huggingface.co/dicta-il/neodictabert) for multi-label classification of factual error types in Hebrew text at the claim/sentence level.

**Task:** Identify specific error types in corrupted Hebrew claims  
**Language:** Hebrew  
**Max Context:** 3,072 tokens  
**Granularity:** Sentence-level (individual claims)

## Input/Output

### Input
- **Premise:** Original correct claim (Hebrew sentence)
- **Hypothesis:** Summary claim (Hebrew sentence)

### Output
Multi-label classification with probability scores for each error type. Threshold: 0.5 (probabilities โ‰ฅ 0.5 indicate presence of that error type).

## Supported Error Types

The model detects both **cross-language** and **Hebrew-specific** error types:

### Cross-Language Errors
- `entity_person_swap` - Swapping person names
- `entity_location_swap` - Swapping place names
- `entity_organization_swap` - Swapping organization names
- `entity_date_swap` - Changing dates
- `number_swap` - Altering numerical values
- `measure_unit_swap` - Changing units of measurement
- `sentence_negation` - Adding/removing negation


### Hebrew-Specific Errors
- `hebrew_root_pattern_confusion` - Morphological root/binyan errors
- `morphological_connective_confusion` - Logical connector errors (ืฉื›ืŸ โ†” ื›ืš ืฉ)
- `verb_gender_swap` - Verb gender disagreement
- `noun_gender_swap` - Noun/pronoun gender errors
- `homographic_gender_errors` - Gender mismatches in homographic words (ืžื•ืจื”, ืžืจืฆื”)
- `specificity_shift_errors` - Definite article changes (ื”-)
- `construct_state_confusion` - Smichut (ืกืžื™ื›ื•ืช) errors
- `subject_verb_person_mismatch` - Person agreement errors
- `verb_tense_swap` - Tense inconsistencies
- `evidentiality_source_attribution_collapse` - Attribution removal
- `impersonal_to_personal_verb_errors` - Voice changes (passive โ†” active)
- `verb_template_agency_reversal` - Agency reversals (ื”ืœื‘ื™ืฉ โ†” ื”ืชืœื‘ืฉ)
- `directional_preposition_swap` - Directional errors (ืž-X ืœ-Y โ†” ืž-Y ืœ-X)