Codette-Reasoning / ADAPTER_ROUTER_INTEGRATION.md

Raiff1982

Upload 78 files

d574a3d verified 1 day ago

14.4 kB

	# AdapterRouter Integration Guide: Memory-Weighted Routing

	## Overview

	This guide shows how to integrate Phase 2's MemoryWeighting into the actual AdapterRouter to enable adaptive adapter selection based on historical performance.

	Current State: MemoryWeighting is built and wired into ForgeEngine, but not yet connected to AdapterRouter. This document bridges that gap.

	---

	## Architecture: Where MemoryWeighting Fits

	```
	Query
	↓
	AdapterRouter.route()
	├─ [Current] Keyword matching → base_result = RouteResult(primary, secondary, confidence)
	└─ [Phase 2] Memory-weighted boost → boosted_confidence = base_confidence * (1 + weight_modifier)
	↓
	ForgeEngine.forge_with_debate(primary=primary_adapter, secondary=secondary_adapters)
	↓
	Agents generate analyses → Conflicts detected → Stored in memory
	↓
	Next Query: Adapters with high historical coherence get +50% confidence boost
	```

	---

	## Integration Steps

	### Step 1: Wire MemoryWeighting into AdapterRouter.__init__()

	File: `inference/adapter_router.py` (lines ~50-80)

	Current Code:
	```python
	class AdapterRouter:
	def __init__(self, adapter_registry):
	self.adapter_registry = adapter_registry
	self.keyword_index = {}
	# ... initialize other components ...
	```

	Phase 2 Enhancement:
	```python
	from reasoning_forge.memory_weighting import MemoryWeighting

	class AdapterRouter:
	def __init__(self, adapter_registry, memory_weighting=None):
	self.adapter_registry = adapter_registry
	self.keyword_index = {}
	self.memory_weighting = memory_weighting # NEW: optional memory weighting
	# ... initialize other components ...
	```

	Usage:
	```python
	# In codette_session.py or app initialization:
	from reasoning_forge.living_memory import LivingMemoryKernel
	from reasoning_forge.memory_weighting import MemoryWeighting
	from inference.adapter_router import AdapterRouter

	memory = LivingMemoryKernel(max_memories=100)
	weighting = MemoryWeighting(memory)
	router = AdapterRouter(adapter_registry, memory_weighting=weighting)
	```

	---

	### Step 2: Modify AdapterRouter.route() for Memory-Weighted Boost

	File: `inference/adapter_router.py` (lines ~200-250)

	Current Code:
	```python
	def route(self, query: str) -> RouteResult:
	"""Route query to appropriate adapters."""
	# Keyword matching
	scores = self._route_keyword(query)

	return RouteResult(
	primary=best_adapter,
	secondary=top_secondary,
	confidence=max_score
	)
	```

	Phase 2 Enhancement - SOFT BOOST:
	```python
	def route(self, query: str, use_memory_boost: bool = True) -> RouteResult:
	"""Route query to appropriate adapters with optional memory weighting.

	Args:
	query: User query text
	use_memory_boost: If True, boost confidence based on historical performance

	Returns:
	RouteResult with primary, secondary adapters and confidence
	"""
	# Step 1: Keyword-based routing (existing logic)
	base_result = self._route_keyword(query)

	# Step 2: Apply memory-weighted boost (Phase 2)
	if use_memory_boost and self.memory_weighting:
	boosted_conf = self.memory_weighting.get_boosted_confidence(
	base_result.primary,
	base_result.confidence
	)
	base_result.confidence = boosted_conf

	# Optional: Explain the boost for debugging
	if os.environ.get("DEBUG_ADAPTER_ROUTING"):
	explanation = self.memory_weighting.explain_weight(base_result.primary)
	print(f"[ROUTING] {base_result.primary}: "
	f"base={base_result.confidence:.2f}, "
	f"boosted={boosted_conf:.2f}, "
	f"weight={explanation['final_weight']:.2f}")

	return base_result
	```

	Advanced Option - STRICT MEMORY-ONLY (optional, higher risk):
	```python
	def route(self, query: str, strategy: str = "keyword") -> RouteResult:
	"""Route query with pluggable strategy.

	Args:
	query: User query text
	strategy: "keyword" (default), "memory_weighted", or "memory_only"

	Returns:
	RouteResult with primary, secondary adapters and confidence
	"""
	if strategy == "memory_only" and self.memory_weighting:
	# Pure learning approach: ignore keywords
	weights = self.memory_weighting.compute_weights()
	if weights:
	primary = max(weights.keys(), key=lambda a: weights[a])
	return RouteResult(
	primary=primary,
	secondary=[], # No secondary adapters in memory-only mode
	confidence=weights[primary] / 2.0 # Normalize [0, 1]
	)
	else:
	# Fallback to keyword if no memory yet
	return self._route_keyword(query)

	elif strategy == "memory_weighted":
	# Soft boost approach: keyword routing + memory confidence boost
	base_result = self._route_keyword(query)
	if self.memory_weighting:
	boosted_conf = self.memory_weighting.get_boosted_confidence(
	base_result.primary,
	base_result.confidence
	)
	base_result.confidence = boosted_conf
	return base_result

	else: # strategy == "keyword"
	# Pure keyword routing (existing behavior)
	return self._route_keyword(query)
	```

	---

	### Step 3: Pass MemoryWeighting Through Session/App

	File: `inference/codette_session.py` (lines ~50-100)

	Current Code:
	```python
	class CodetteSession:
	def __init__(self):
	self.memory_kernel = LivingMemoryKernel(max_memories=100)
	self.router = AdapterRouter(adapter_registry)
	self.forge = ForgeEngine()
	```

	Phase 2 Enhancement:
	```python
	from reasoning_forge.memory_weighting import MemoryWeighting

	class CodetteSession:
	def __init__(self):
	self.memory_kernel = LivingMemoryKernel(max_memories=100)

	# NEW: Initialize memory weighting
	self.memory_weighting = MemoryWeighting(self.memory_kernel)

	# Wire into router
	self.router = AdapterRouter(
	adapter_registry,
	memory_weighting=self.memory_weighting
	)

	# Wire into forge (Phase 2)
	self.forge = ForgeEngine(
	living_memory=self.memory_kernel,
	enable_memory_weighting=True
	)

	def on_submit(self, query: str):
	"""Process user query with memory-weighted routing."""
	# Route using memory weights
	route_result = self.router.route(query, use_memory_boost=True)

	# Run forge with memory enabled
	result = self.forge.forge_with_debate(query)

	# Conflicts automatically stored in memory
	response = result["metadata"]["synthesized"]

	return response
	```

	---

	## Testing the Integration

	### Unit Test: Memory Weighting + Router

	```python
	def test_memory_weighted_routing():
	"""Test that memory weights modulate router confidence."""
	from reasoning_forge.living_memory import LivingMemoryKernel, MemoryCocoon
	from reasoning_forge.memory_weighting import MemoryWeighting
	from inference.adapter_router import AdapterRouter

	# Setup
	memory = LivingMemoryKernel()

	# Seed memory with Newton performance (high coherence)
	newton_cocoon = MemoryCocoon(
	title="Newton analysis",
	content="Analytical approach",
	adapter_used="newton",
	coherence=0.9,
	emotional_tag="neutral",
	)
	memory.store(newton_cocoon)

	# Create weighting + router
	weighting = MemoryWeighting(memory)
	router = AdapterRouter(adapter_registry, memory_weighting=weighting)

	# Test
	query = "Analyze this algorithm"
	result = router.route(query, use_memory_boost=True)

	# If Newton scored high before, its confidence should be boosted
	assert result.confidence > 0.5 # Baseline
	print(f"✓ Routing test passed: {result.primary} @ {result.confidence:.2f}")
	```

	### E2E Test: Full Loop

	```python
	def test_memory_learning_loop():
	"""Test that conflicts → memory → weights → better future routing."""
	from reasoning_forge.forge_engine import ForgeEngine
	from reasoning_forge.living_memory import LivingMemoryKernel
	from reasoning_forge.memory_weighting import MemoryWeighting
	from inference.adapter_router import AdapterRouter

	# Run 1: Initial debate (no memory history)
	memory = LivingMemoryKernel()
	forge = ForgeEngine(living_memory=memory, enable_memory_weighting=True)

	result1 = forge.forge_with_debate("Compare speed vs clarity", debate_rounds=1)
	conflicts1 = result1["metadata"]["conflicts_round_0_count"]
	print(f"Run 1: {conflicts1} conflicts detected, stored in memory")

	# Run 2: Same query with memory history
	# Adapters that resolved conflicts should get boosted
	weighting = MemoryWeighting(memory) # Now has history
	weights = weighting.get_all_weights()

	print(f"\nAdapter weights after learning:")
	for adapter, w_dict in weights.items():
	print(f" {adapter}: weight={w_dict['weight']:.3f}, coherence={w_dict['coherence']:.3f}")

	# Router should now boost high-performing adapters
	router = AdapterRouter(adapter_registry, memory_weighting=weighting)
	route_result = router.route("Compare speed vs clarity", use_memory_boost=True)
	print(f"\nRouting decision: {route_result.primary} @ {route_result.confidence:.2f}")

	# Run debate again (should use boosted adapters)
	result2 = forge.forge_with_debate("Compare speed vs clarity", debate_rounds=1)
	conflicts2 = result2["metadata"]["conflicts_round_0_count"]

	# Measure improvement
	improvement = (conflicts1 - conflicts2) / max(conflicts1, 1)
	print(f"Run 2: {conflicts2} conflicts (improvement: {improvement:.1%})")
	```

	---

	## Configuration: Tuning Parameters

	Memory Weighting Parameters (in `MemoryWeighting`):

	```python
	# Update frequency (hours)
	update_interval_hours = 1.0 # Recompute weights every hour

	# Weight formula contributions
	base_coherence_weight = 0.5 # Contribution from mean coherence
	conflict_success_weight = 0.3 # Contribution from conflict resolution
	recency_weight = 0.2 # Contribution from recency decay

	# Recency decay half-life (hours)
	recency_half_life_hours = 168 # 7 days

	# Boost modulation
	max_boost = 0.5 # ±50% confidence modification
	```

	Router Integration Options:

	```python
	# Memory boost enabled/disabled
	router.route(query, use_memory_boost=True) # Default: enabled
	router.route(query, use_memory_boost=False) # Keyword-only

	# Strategy selection (advanced)
	router.route(query, strategy="keyword") # Pure keyword
	router.route(query, strategy="memory_weighted") # Soft boost (recommended)
	router.route(query, strategy="memory_only") # Pure learning (risky)
	```

	---

	## Production Deployment Checklist

	- [ ] Wire MemoryWeighting into AdapterRouter.__init__()
	- [ ] Modify route() method with use_memory_boost parameter
	- [ ] Update CodetteSession to initialize memory_weighting
	- [ ] Pass memory_weighting through all routing calls
	- [ ] Update app.py/Gradio interface to pass memory context
	- [ ] Add unit test for memory-weighted routing
	- [ ] Add E2E test for full learning loop
	- [ ] Monitor: Log adapter weights after each debate cycle
	- [ ] Tune: Adjust weight formula coefficients based on results
	- [ ] Document: User-facing explanation of why adapters were selected

	---

	## Monitoring & Debugging

	### Enable Debug Logging

	```python
	import os
	import logging

	# In app initialization:
	if os.environ.get("DEBUG_ADAPTER_ROUTING"):
	logging.basicConfig(level=logging.DEBUG)

	# This will print weight explanations on each route call
	```

	### Query Adapter Weight History

	```python
	from reasoning_forge.memory_weighting import MemoryWeighting

	# Get snapshot of adapter weights
	weights = memory_weighting.get_all_weights()
	for adapter, w_dict in weights.items():
	print(f"{adapter}: weight={w_dict['weight']:.3f}")

	# Explain a specific adapter's weight
	explanation = memory_weighting.explain_weight("newton")
	print(explanation["explanation"])
	# Output: "Adapter 'newton' has used 15 times with 0.8 avg coherence,
	# 73% conflict resolution rate, and 0.95 recency score.
	# Final weight: 1.45 (range [0, 2.0])"
	```

	### Memory State

	```python
	# Check memory cocoon counts per adapter
	for cocoon in memory.memories:
	if cocoon.emotional_tag == "tension":
	print(f"Conflict: {cocoon.adapter_used}, coherence={cocoon.coherence}")

	# Get emotional profile
	profile = memory.emotional_profile()
	print(f"Memory profile: {profile}") # {'tension': 25, 'neutral': 10, ...}
	```

	---

	## Known Limitations & Future Work

	1. Adapter Naming: Currently stores agent pairs (e.g., "Newton,Quantum"). For pure adapter routing, need to map to actual adapter names.

	2. Cold Start: New adapters have neutral weights (1.0) until they accumulate history (~10-15 uses).

	3. Strict Mode Risk: Memory-only routing (no keywords) can ignore important query context. Test thoroughly before production.

	4. Memory Pruning: Automatic pruning at 100 memories may lose old patterns. Consider keeping high-importance conflicts longer.

	5. Next Phase: Multi-round conflict resolution tracking would enable learning across multiple debate cycles, not just single-round.

	---

	## Summary

	To Enable Memory-Weighted Routing:

	1. Add `memory_weighting` parameter to AdapterRouter.__init__()
	2. Modify route() to apply `get_boosted_confidence()` soft boost
	3. Wire through CodetteSession / app initialization
	4. Test with unit + E2E test suite
	5. Monitor weights and tune formula if needed

	Recommended Approach: Soft boost (preserve keyword intelligence) → can migrate to memory-only if results justify it.

	Expected Outcome: Better adapter selection over time, converging to adapters that historically resolved more conflicts.