File size: 5,274 Bytes

---
language:
- en
- es
- fr
- hi
- it
- bn
- gu
- ml
- te
license: mit
datasets:
- hallucination_dataset_100k
- SHROOM-CAP
- LibreEval
- FactCHD
pipeline_tag: text-classification
base_model:
- FacebookAI/xlm-roberta-large
---

# SVNIT-AGI at SHROOM-CAP 2025: Multilingual Hallucination Detection Model

## Model Description

The model is an XLM-RoBERTa-Large based fine-tuned model for scientific hallucination detection across 9 languages using the Huggingface transformers library.

- **Developed by:** Harsh Rathva, Pruthwik Mishra, Shrikant Malviya
- **Funded by:** Sardar Vallabhbhai National Institute of Technology, Surat
- **License:** MIT
- **Finetuned from model:** `xlm-roberta-large`
- **Competition:** SHROOM-CAP 2025 Shared Task (2nd place in Gujarati)

## Model Sources

- **Repository:** [https://github.com/ezylopx5/SHROOM-CAP2025](https://github.com/ezylopx5/SHROOM-CAP2025)
- **Paper:** https://arxiv.org/abs/2511.18301

## Uses

The model can be directly used for detecting hallucinations in scientific text across 9 languages:
- **Training Languages:** English (en), Spanish (es), French (fr), Hindi (hi), Italian (it)
- **Zero-shot Languages:** Bengali (bn), Gujarati (gu), Malayalam (ml), Telugu (te)

## How to Get Started with the Model

```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

# Load model and tokenizer
model_name = "Haxxsh/XLMRHallucinationDetectorSHROOMCAP"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

def detect_hallucination(text):
    """Detect if text contains scientific hallucinations."""
    inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=256)
    
    with torch.no_grad():
        outputs = model(**inputs)
        predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
    
    label = "HALLUCINATED" if predictions[0][1] > 0.5 else "CORRECT"
    confidence = predictions[0][1].item() if label == "HALLUCINATED" else predictions[0][0].item()
    
    return {"label": label, "confidence": confidence}

# Example usage
test_texts = [
    "The protein folding mechanism involves quantum tunneling effects at room temperature.",
    "Water boils at 100°C at standard atmospheric pressure.",
    "Einstein discovered the theory of relativity in 1905 with his paper on special relativity."
]

for text in test_texts:
    result = detect_hallucination(text)
    print(f"Text: {text}")
    print(f"Prediction: {result['label']} (confidence: {result['confidence']:.4f})\n")
```

## Label Mapping
- `0`: CORRECT (factually accurate scientific text)
- `1`: HALLUCINATED (contains factual errors or fabrications)

## Downstream Use

Can be integrated into:
- Scientific writing assistants
- LLM output verification systems
- Academic paper review tools
- Multilingual fact-checking pipelines

## Out-of-Scope Use
- The model is specifically trained for scientific domain text
- May not perform well on general domain hallucinations
- Limited to the 9 languages mentioned above

## Limitations
- Performance varies across languages (best in Gujarati, competitive in others)
- Trained primarily on scientific text, may not generalize to other domains
- Requires domain adaptation for highly specialized scientific fields

## Training Details

### Training Data
- **Total Samples:** 124,821 balanced samples (50% correct, 50% hallucinated)
- **Sources:** Unified dataset from SHROOM-CAP, hallucination_dataset_100k, LibreEval, FactCHD
- **Languages:** 9 languages with cross-lingual transfer

### Training Procedure
- **Base Model:** XLM-RoBERTa-Large (560M parameters)
- **Training Regime:** Full fine-tuning (not LoRA/PEFT)
- **Training Batch Size:** 32 with gradient accumulation
- **Learning Rate:** 2e-5
- **Weight Decay:** 0.01
- **Epochs:** 3
- **Sequence Length:** 256 tokens

### Training Hyperparameters
```
{
    "per_device_train_batch_size": 16,
    "gradient_accumulation_steps": 2,
    "learning_rate": 2e-5,
    "num_train_epochs": 3,
    "max_seq_length": 256,
    "warmup_ratio": 0.1,
    "weight_decay": 0.01
}
```


## Evaluation

### Competition Results (SHROOM-CAP 2025)
| Language | Rank | Factuality F1 | Fluency F1 |
|----------|------|---------------|------------|
| Gujarati (gu) | 🥈 2nd | 0.5107 | 0.1579 |
| Bengali (bn) | 4th | 0.4449 | 0.2542 |
| Hindi (hi) | 4th | 0.4906 | 0.4353 |
| Spanish (es) | 5th | 0.4938 | 0.4607 |
| French (fr) | 5th | 0.4771 | 0.2899 |
| Telugu (te) | 5th | 0.4738 | 0.1474 |
| Malayalam (ml) | 5th | 0.4704 | 0.3593 |
| English (en) | 6th | 0.4246 | 0.4495 |
| Italian (it) | 5th | 0.3149 | 0.4582 |

### Metrics
- **Primary:** Macro F1 Score
- **Validation Performance:** 0.8510 F1
- **Competition Performance:** ~0.40-0.51 F1 (due to distribution shift)

## Compute Infrastructure
- **Hardware:** NVIDIA H200 GPU (141GB VRAM)
- **Training Time:** 1 hour 14 minutes
- **Framework:** PyTorch, HuggingFace Transformers

## Model Size
- **Parameters:** 560M
- **File format:** SafeTensors
- **Tensor type:** F32

## Acknowledgements
- SHROOM-CAP 2025 Organizers for the shared task
- Lightning AI for H200 GPU infrastructure
- HuggingFace for the XLM-RoBERTa-Large model
- All dataset contributors

---
```