Exegetical Generation: A New Task for Information-Expanding Text Generation

Volodymyr Ovcharov, Igor Tatarinovich — Anamavajra Labs

📄 Read the paper (PDF) · working draft

Abstract

We introduce exegetical generation, a text generation task in which the model must produce an expansive target-language commentary from a tersely encoded source text, recovering implicit definitions, logical connections, and contextual knowledge that the source presupposes but does not express. Unlike translation (information-preserving, 1--3x expansion) or summarization (information-reducing), exegetical generation is information-expanding (5--20x), requiring abductive reasoning over a tradition's knowledge base.

We formalize the task, propose a taxonomy of five exegetical operations, and define ExeScore, a composite evaluation metric. An analysis of 1.33M open Sanskrit--English parallel pairs reveals that 98.6% are translations (R < 5x); no large-scale exegetical corpus exists in open data, confirming a significant resource gap.

We present a multi-modal data extraction pipeline (OCR + ASR + LLM post-processing) and establish four baselines:

Zero-shot LLM (G = 77)
Hybrid RAG with dictionary grounding (G = 94, +6.5%)
Few-shot (G = 25--28)
QLoRA fine-tuning on 704 exegetical pairs that successfully transfers domain-specific commentary style

We further introduce ExeScore-Lex, a lemma- and sense-grounded re-scoring built on the Digital Corpus of Sanskrit, the Monier-Williams lexicon, WordNet synsets, and neural lemmatization (ByT5-Sanskrit). It corrects a script-dependent bias in surface metrics — which overcount zero-shot LLM output (surface-form inflation) yet undercount Devanāgarī-emitting fine-tuned output — and exposes per-system hallucination rates.

A complementary sense-drift test operationalizes tradition-coherence (T): for polysemous technical terms, does a system express the tradition-specific Kashmir-Śaiva sense or drift to the mainstream classical/Buddhist sense? Using a three-pole sense inventory and an independent LLM judge, we find that retrieval on a classical dictionary triples sense-bleed (0.15→0.45) — the grounding that helps G and F hurts T — while fine-tuning on tradition data best preserves the tradition sense (Śaiva-rate 0.50). Together, ExeScore-Lex and Sense-Drift ground all three hard ExeScore components (G, F, T) on open linguistic resources.

The task generalizes beyond Sanskrit to Talmudic, Scholastic, and other commentary traditions.

Key Contributions

Task definition: Exegetical generation formalized as information-expanding generation, distinguished from translation and summarization
Taxonomy: Five exegetical operations (Term Definition Unpacking, Implicit Context Restoration, Logical Connection Bridging, Doctrinal Elaboration, Cross-Reference Linking)
ExeScore metric: Composite evaluation measuring Faithfulness, Information Gain, Completeness, and Tradition-Coherence
Gap analysis: 98.6% of 1.33M open Sa--En pairs are translations; the exegetical task is unserved
Multi-modal pipeline: Chandra OCR-2 + Whisper + LLM correction for extracting exegetical pairs from scholarly corpora
Baselines: Zero-shot, RAG, few-shot, and QLoRA fine-tuning with style transfer
ExeScore-Lex: a lemma/synset-grounded, script-independent re-scoring (DCS + Monier-Williams + WordNet + ByT5-Sanskrit) that quantifies and corrects the surface-metric bias and adds an automatable hallucination signal
Sense-Drift: an automatic tradition-coherence (T) metric via three-pole word-sense classification, showing classical-dictionary RAG triples sense-bleed while fine-tuning best preserves the tradition sense

Results

System	Info Gain (G)	Expansion (R)	Defs
B2: Claude Haiku 4.5 (zero-shot)	77	128.6x	1.9
B3.1: Claude RAG (hybrid)	94	132.6x	2.3
B4-fs: Nova Micro (few-shot)	25	81.8x	--
B4-ft: Qwen 14B (QLoRA)	17	56.0x	1.9

Cross-Tradition Generalization

The task structure -- terse source + tradition knowledge → expansive commentary -- recurs across:

Judaism: Mishnah → Gemara
Christianity: Scripture → Scholastic commentary
Islam: Qur'an → Tafsir
Chinese classics: Jing (经) → Zhu (注) commentary
Indian philosophy: Sutra → Bhasya

Related Resources

🤗 ExeGen QLoRA model
🤗 Tantraloka raw corpus

Citation

@article{ovcharov2026exegetical,
  title={Exegetical Generation: A New Task for Information-Expanding Text Generation},
  author={Ovcharov, Volodymyr and Tatarinovich, Igor},
  year={2026},
  note={Anamavajra Labs}
}

Downloads last month: -; Downloads are not tracked for this model. How to track