Initial upload: ELECTRA-base fine-tuned on LOGIC (zhijin splits)

8327c1f verified about 2 months ago

3.25 kB

license: apache-2.0
base_model: google/electra-base-discriminator
tags:
  - text-classification
  - fallacy-detection
  - logical-fallacy
language:
  - en
datasets:
  - logical-fallacy-detection
metrics:
  - accuracy
  - f1
pipeline_tag: text-classification

ELECTRA-base fine-tuned for logical fallacy classification

13-way classifier fine-tuned on the LOGIC dataset from Jin et al. 2022, Logical Fallacy Detection (arXiv:2202.13758).

Base model: google/electra-base-discriminator.

Labels

ad hominem, ad populum, appeal to emotion, circular reasoning, equivocation,
fallacy of credibility, fallacy of extension, fallacy of logic,
fallacy of relevance, false causality, false dilemma, faulty generalization,
intentional

Training

Data: LOGIC train split, 1849 examples / 13 classes (zhijin/zhijingjin splits; dev 300, test 300).
Hyperparams: 3 epochs, lr 5e-5, weight decay 0.01, warmup ratio 0.1, batch 8, max_len 128, seed 42, best checkpoint by val macro-F1.
Hardware: CPU (4 threads), ~23 min wall.

Evaluation

In-domain (LOGIC test, n=300)

metric	value
accuracy	0.643
macro-F1	0.552
weighted-F1	0.625

Comparable to the paper's plain-ELECTRA baseline (~0.533 F1 in Table 3).

Zero-shot transfer (LOGICCLIMATE, n=1312)

metric	value
accuracy	0.210
macro-F1	0.183

Sharp drop on out-of-domain transfer, in line with the paper's Table 4 findings (their best model drops from 0.588 to 0.272 F1).

Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

repo = "heavyhelium/electra-base-logic-fallacy"
tok = AutoTokenizer.from_pretrained(repo)
model = AutoModelForSequenceClassification.from_pretrained(repo).eval()

text = "Everyone I know drives a Toyota, so Toyotas must be the best cars."
enc = tok(text, return_tensors="pt", truncation=True, max_length=128)
with torch.no_grad():
    pred_id = model(**enc).logits.argmax(-1).item()
print(model.config.id2label[pred_id])  # -> ad populum

Limitations

Poor cross-domain generalization. Drops ~0.37 macro-F1 from educational text (LOGIC) to climate-change news (LOGICCLIMATE). Do not trust predictions far from the training domain.
Data imbalance bias. Rare classes (equivocation, ~2% of training data) are under-predicted; equivocation test F1 is 0.00 in both in-domain and transfer settings.
Short-text bias. Training examples are mostly 1-2 sentence educational quiz items (median ~100 characters). Longer or structurally different text may degrade.
Single-label. Each input is forced into exactly one of 13 classes; real-world text often exhibits multiple fallacies or none.

Citation

@inproceedings{jin-etal-2022-logical,
    title = "Logical Fallacy Detection",
    author = "Jin, Zhijing and Lalwani, Abhinav and Vaidhya, Tejas and Shen, Xiaoyu and Ding, Yiwen and Lyu, Zhiheng and Sachan, Mrinmaya and Mihalcea, Rada and Sch{\"o}lkopf, Bernhard",
    booktitle = "Findings of EMNLP 2022",
    year = "2022",
    url = "https://arxiv.org/abs/2202.13758",
}