DeepBoner / docs /brainstorming /BRAINSTORM_EMBEDDINGS_META.md
VibecoderMcSwaggins's picture
docs: organize root files and add P1 synthesis fallback bug report
1d32642
|
raw
history blame
3.07 kB

Embeddings Brainstorm - Conclusions

Date: November 2025 Status: CLOSED - Conclusions reached, no action needed


The Question

Should DeepBoner implement:

  1. Internal codebase embeddings/ingestion pipeline?
  2. mGREP for internal tool selection?
  3. Self-knowledge components for agents?

The Answer: NO

After research and first-principles analysis, the conclusion is clear:

Why Not Internal Embeddings/Ingestion

DeepBoner's Core Task:
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  User Query: "Evidence for testosterone in HSDD?"       β”‚
β”‚                         ↓                               β”‚
β”‚  1. Search PubMed, ClinicalTrials, Europe PMC          β”‚
β”‚  2. Judge: Is evidence sufficient?                      β”‚
β”‚  3. Synthesize: Generate report                         β”‚
β”‚                         ↓                               β”‚
β”‚  Output: Research report with citations                 β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Does ANY step require self-knowledge of codebase? NO.

Why Not mGREP for Tool Selection

Approach Complexity Accuracy
Embeddings + mGREP for tool selection High Medium (semantic similarity β‰  correct tool)
Direct prompting with tool descriptions Low High (LLM reasons about applicability)

No real agent system uses embeddings for tool selection. All major frameworks (LangChain, OpenAI, Anthropic, Magentic) use prompt-based tool selection because:

  1. LLMs are already doing semantic matching internally
  2. Tool count is small (5-20) - fits easily in context
  3. Prompts allow reasoning, not just similarity

What We Already Have

DeepBoner already uses embeddings for the right thing: research evidence retrieval.

  • src/services/embeddings.py - ChromaDB + sentence-transformers
  • src/services/llamaindex_rag.py - OpenAI embeddings for premium tier

The Real Priority

Instead of internal embeddings/mGREP, focus on:

  1. Deduplication across PubMed/Europe PMC/OpenAlex
  2. Outcome measures from ClinicalTrials.gov
  3. Citation graph traversal via OpenAlex

See: TOOL_ANALYSIS_CRITICAL.md for detailed improvement roadmap.


Research Sources


This document is closed. The conclusion is: don't implement internal embeddings/mGREP for this use case.