---
# For God so loved the world that he gave his only begotten Son,
# that whoever believes in him should not perish but have eternal life. - John 3:16
language: en
license: mit
tags:
  - bible
  - chirho
  - intertextual
  - cross-reference
  - classification
  - roberta
  - bible-ml
datasets:
  - LoveJesus/intertextual-dataset-chirho
base_model: roberta-base
metrics:
  - f1
pipeline_tag: text-classification
---

# Intertextual Classifier (Chirho)

**RoBERTa-base fine-tuned for classifying biblical cross-reference connection types.**

> "For God so loved the world that he gave his only begotten Son, that whoever believes in him should not perish but have eternal life." - John 3:16

## Model Description

Given two Bible passages that are cross-referenced, this model classifies the type of intertextual connection between them into one of 7 categories:

| Label | Description |
|-------|-------------|
| `thematic_parallel` | Passages share the same theme or topic |
| `direct_quote` | One passage directly quotes another |
| `prophetic_fulfillment` | OT prophecy fulfilled in NT |
| `typological` | OT type foreshadowing NT antitype |
| `contrast` | Passages present contrasting ideas |
| `historical_narrative` | Shared historical events or figures |
| `theological_expansion` | Later passage expands on earlier theology |

## Training Details

- **Base model**: `roberta-base` (125M params)
- **Training data**: 19,164 balanced examples (Grok-labeled from TSK cross-references)
- **Class balancing**: WeightedTrainer with inverse-frequency CrossEntropyLoss + majority class capping
- **Epochs**: 8
- **Best epoch**: 8 (by eval loss)

## Metrics (v2 - Retrained Feb 2026)

| Metric | Value |
|--------|-------|
| **Macro F1** | **0.761** |
| Micro F1 | 0.853 |
| Precision | 0.665 |
| Recall | 0.939 |
| Eval Loss | 0.501 |

### Improvement over v1

| Metric | v1 (Original) | v2 (Retrained) | Change |
|--------|---------------|----------------|--------|
| Macro F1 | 0.42 | **0.761** | +81% |
| Micro F1 | 0.72 | **0.853** | +18% |

**Root cause of v1 weakness**: 76% class imbalance (thematic_parallel dominated). Fixed with:
1. Balanced dataset (cap majority class, keep all minority examples)
2. WeightedTrainer with inverse-frequency class weights

## Usage

```python
from transformers import pipeline

classifier = pipeline(
    "text-classification",
    model="LoveJesus/intertextual-classifier-chirho",
    top_k=None,
)

text = "[CLS] Genesis 3:15 And I will put enmity between thee and the woman, and between thy seed and her seed; it shall bruise thy head, and thou shalt bruise his heel. [SEP] Galatians 4:4 But when the fulness of the time was come, God sent forth his Son, made of a woman, made under the law [SEP]"

result = classifier(text)
print(result)
# [{'label': 'prophetic_fulfillment', 'score': 0.95}, ...]
```

## Part of Bible ML Pipeline

This model is part of the [Intertextual Reference Network](https://huggingface.co/spaces/LoveJesus/intertextual-reference-network-chirho) pipeline:

1. **Embedder** ([LoveJesus/intertextual-embedder-chirho](https://huggingface.co/LoveJesus/intertextual-embedder-chirho)): Finds similar passages
2. **Classifier** (this model): Classifies the connection type

Dataset: [LoveJesus/intertextual-dataset-chirho](https://huggingface.co/datasets/LoveJesus/intertextual-dataset-chirho)