|
|
--- |
|
|
|
|
|
|
|
|
language: en |
|
|
license: mit |
|
|
tags: |
|
|
- bible |
|
|
- chirho |
|
|
- intertextual |
|
|
- cross-reference |
|
|
- classification |
|
|
- roberta |
|
|
- bible-ml |
|
|
datasets: |
|
|
- LoveJesus/intertextual-dataset-chirho |
|
|
base_model: roberta-base |
|
|
metrics: |
|
|
- f1 |
|
|
pipeline_tag: text-classification |
|
|
--- |
|
|
|
|
|
# Intertextual Classifier (Chirho) |
|
|
|
|
|
**RoBERTa-base fine-tuned for classifying biblical cross-reference connection types.** |
|
|
|
|
|
> "For God so loved the world that he gave his only begotten Son, that whoever believes in him should not perish but have eternal life." - John 3:16 |
|
|
|
|
|
## Model Description |
|
|
|
|
|
Given two Bible passages that are cross-referenced, this model classifies the type of intertextual connection between them into one of 7 categories: |
|
|
|
|
|
| Label | Description | |
|
|
|-------|-------------| |
|
|
| `thematic_parallel` | Passages share the same theme or topic | |
|
|
| `direct_quote` | One passage directly quotes another | |
|
|
| `prophetic_fulfillment` | OT prophecy fulfilled in NT | |
|
|
| `typological` | OT type foreshadowing NT antitype | |
|
|
| `contrast` | Passages present contrasting ideas | |
|
|
| `historical_narrative` | Shared historical events or figures | |
|
|
| `theological_expansion` | Later passage expands on earlier theology | |
|
|
|
|
|
## Training Details |
|
|
|
|
|
- **Base model**: `roberta-base` (125M params) |
|
|
- **Training data**: 19,164 balanced examples (Grok-labeled from TSK cross-references) |
|
|
- **Class balancing**: WeightedTrainer with inverse-frequency CrossEntropyLoss + majority class capping |
|
|
- **Epochs**: 8 |
|
|
- **Best epoch**: 8 (by eval loss) |
|
|
|
|
|
## Metrics (v2 - Retrained Feb 2026) |
|
|
|
|
|
| Metric | Value | |
|
|
|--------|-------| |
|
|
| **Macro F1** | **0.761** | |
|
|
| Micro F1 | 0.853 | |
|
|
| Precision | 0.665 | |
|
|
| Recall | 0.939 | |
|
|
| Eval Loss | 0.501 | |
|
|
|
|
|
### Improvement over v1 |
|
|
|
|
|
| Metric | v1 (Original) | v2 (Retrained) | Change | |
|
|
|--------|---------------|----------------|--------| |
|
|
| Macro F1 | 0.42 | **0.761** | +81% | |
|
|
| Micro F1 | 0.72 | **0.853** | +18% | |
|
|
|
|
|
**Root cause of v1 weakness**: 76% class imbalance (thematic_parallel dominated). Fixed with: |
|
|
1. Balanced dataset (cap majority class, keep all minority examples) |
|
|
2. WeightedTrainer with inverse-frequency class weights |
|
|
|
|
|
## Usage |
|
|
|
|
|
```python |
|
|
from transformers import pipeline |
|
|
|
|
|
classifier = pipeline( |
|
|
"text-classification", |
|
|
model="LoveJesus/intertextual-classifier-chirho", |
|
|
top_k=None, |
|
|
) |
|
|
|
|
|
text = "[CLS] Genesis 3:15 And I will put enmity between thee and the woman, and between thy seed and her seed; it shall bruise thy head, and thou shalt bruise his heel. [SEP] Galatians 4:4 But when the fulness of the time was come, God sent forth his Son, made of a woman, made under the law [SEP]" |
|
|
|
|
|
result = classifier(text) |
|
|
print(result) |
|
|
# [{'label': 'prophetic_fulfillment', 'score': 0.95}, ...] |
|
|
``` |
|
|
|
|
|
## Part of Bible ML Pipeline |
|
|
|
|
|
This model is part of the [Intertextual Reference Network](https://huggingface.co/spaces/LoveJesus/intertextual-reference-network-chirho) pipeline: |
|
|
|
|
|
1. **Embedder** ([LoveJesus/intertextual-embedder-chirho](https://huggingface.co/LoveJesus/intertextual-embedder-chirho)): Finds similar passages |
|
|
2. **Classifier** (this model): Classifies the connection type |
|
|
|
|
|
Dataset: [LoveJesus/intertextual-dataset-chirho](https://huggingface.co/datasets/LoveJesus/intertextual-dataset-chirho) |
|
|
|