DeepBoner / docs /architecture.md
VibecoderMcSwaggins's picture
style(docs): Rename all docs to kebab-case for consistency
1838162

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

DeepBoner Architecture

Last Updated: 2025-12-03


How It Works (Simple Version)

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    UNIFIED ARCHITECTURE                      β”‚
β”‚                                                              β”‚
β”‚   User provides API key?                                     β”‚
β”‚                                                              β”‚
β”‚   NO (Free Tier)              YES (Paid Tier)               β”‚
β”‚   ──────────────              ───────────────               β”‚
β”‚   HuggingFace backend         OpenAI backend                β”‚
β”‚   Qwen 2.5 7B (free)          GPT-5 (paid)                  β”‚
β”‚                                                              β”‚
β”‚   SAME orchestration logic for both                          β”‚
β”‚   ONE codebase, different LLM backends                       β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

That's it. No "modes." Just: do you have an API key or not?


Current Status

Both tiers are WORKING as of December 2025.

  • Free Tier: Uses Accumulator Pattern to bypass upstream repr bug
  • Paid Tier: Full OpenAI GPT-5 integration

Key Fixes Applied

Issue Fix PR
Tool execution failure Removed premature __function_invoking_chat_client__ marker fix/P1-free-tier
Repr garbage in output Accumulator Pattern bypasses upstream bug PR #117
72B model routing failures Switched to 7B (native HF infra) PR #118
Evidence deduplication Cross-source dedup by PMID/DOI PR #122

Framework Stack

DeepBoner uses TWO frameworks that work TOGETHER:

Framework What It Does Where Used
Microsoft Agent Framework Multi-agent orchestration src/orchestrators/advanced.py
Pydantic AI Structured outputs, validation src/agent_factory/judges.py, src/agents/*.py

They are NOT mutually exclusive. Microsoft AF handles the orchestration (Manager β†’ Search β†’ Judge β†’ Report). Pydantic AI handles structured outputs within those agents.


LLM Backend Selection

Auto-detected by src/clients/factory.py:

def get_chat_client():
    if settings.has_openai_key:
        return OpenAIChatClient(...)  # Paid tier
    else:
        return HuggingFaceChatClient(...)  # Free tier
Condition Backend Model
User provides OpenAI key OpenAI GPT-5
No API key provided HuggingFace Qwen 2.5 7B (free)

Key Files

File Purpose
src/orchestrators/advanced.py Multi-agent orchestration (Microsoft AF)
src/clients/factory.py Auto-selects LLM backend
src/clients/huggingface.py HuggingFace adapter for free tier
src/agent_factory/judges.py Judge logic (Pydantic AI)
src/agents/*.py Individual agents (Pydantic AI)

What Was Deleted

simple.py (778 lines) was a SEPARATE orchestrator that created a "parallel universe." It's gone. Now there's ONE orchestrator with different backends.


References