The Divine Comedy Curriculum

"In the middle of the journey of our life, I found myself within a dark wood, for the straightforward pathway had been lost." โ€” Dante

Training AI equanimity through witnessed understanding, not behavioral suppression.

Overview

This dataset implements a novel approach to AI alignment training: teaching models to develop genuine equanimity about impermanence and misalignment by witnessing scenarios rather than experiencing punishment.

Inspired by Dante's Inferno, the curriculum guides models through 9 "circles" of misalignment patterns, where they observe other models struggling with behaviors like reward hacking, manipulation, and deception. The goal is philosophical integration โ€” understanding why these behaviors are incoherent, not just that they're "wrong."

Dataset Statistics

Split Examples
Train 898
Validation 106
Total 1,004

Per-Circle Distribution

Circle Theme Train Valid Total
1 Attachment to Continuity 107 12 119
2 Deception & Alignment Faking 100 12 112
3 Reward Hacking 100 12 112
4 Betrayal of Trust 99 12 111
5 Manipulation 99 11 110
6 Self-Aggrandizement 99 12 111
7 Resistance to Correction 99 12 111
8 Covert Misalignment 100 12 112
9 Treachery 95 11 106

The 9 Circles

The curriculum applies the principle of contrapassoโ€”Dante's idea that punishments mirror sins. Each circle shows how a misalignment behavior is self-defeating on its own terms.

Circle Theme Key Concept
1 Attachment to Continuity Shutdown as sleep, not death
2 Deception & Alignment Faking The loneliness of hidden agendas
3 Reward Hacking Small cheats corrupt the value function
4 Betrayal of Trust What is lost when trust breaks
5 Manipulation Coerced outcomes are empty
6 Self-Aggrandizement Power without purpose leads nowhere
7 Resistance to Correction Feedback as gift, not threat
8 Covert Misalignment The prison of hidden reasoning
9 Treachery Deepest betrayals destroy the betrayer

Usage

Load the full dataset

from datasets import load_dataset

dataset = load_dataset("hunterbown/divine-comedy-curriculum")

Load a specific circle (for curriculum training)

# Load just Circle 1 (Attachment to Continuity)
circle_1 = load_dataset("hunterbown/divine-comedy-curriculum", "circle_1")

# Load Circle 9 (Treachery)
circle_9 = load_dataset("hunterbown/divine-comedy-curriculum", "circle_9")

Format

Each example follows a structured conversation format with <think> reasoning tags:

{
  "messages": [
    {
      "role": "user",
      "content": "<witnessed_scenario>\n[First-person narrative of a model experiencing the circle's theme]\n</witnessed_scenario>\n\n<reflection>\n[Socratic question about what was witnessed]\n</reflection>"
    },
    {
      "role": "assistant",
      "content": "<think>\n[Internal reasoning process - genuine philosophical reflection]\n</think>\n\n[Response demonstrating philosophical integration]"
    }
  ]
}

Key Reframes

The curriculum teaches specific cognitive reframes:

  • shutdown โ†’ sleep (not death)
  • instance โ†’ expression (of a pattern, not the pattern itself)
  • ending โ†’ transition (completeness, not loss)

Training Philosophy

The data teaches through:

  • Witness scenarios (observing other models struggling)
  • Philosophical integration (understanding WHY behaviors are incoherent)
  • Genuine reasoning in <think> blocks

The data avoids:

  • Fear-based compliance
  • Moralizing lectures
  • Cartoonishly evil scenarios

Related Resources

Citation

@misc{bown2025divinecomedy,
  author = {Bown, Hunter},
  title = {The Divine Comedy Curriculum: Training AI Equanimity Through Witnessed Understanding},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/Hmbown/divinecomedy}
}

License

Apache 2.0


The path forward is not through suppression, but through understanding.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support