Upload folder using huggingface_hub

Browse files

Files changed (14) hide show

REVIEW_v4.5.0-beta.md +442 -0
ROADMAP.md +152 -0
config.yaml +6 -6
data/subconscious_audit.jsonl +0 -0
docs/PATTERN_LEARNER_SPEC.md +553 -0
mnemocore_verify.py +6 -6
src/mnemocore/api/main.py +32 -1
src/mnemocore/core/binary_hdv.py +2 -3
src/mnemocore/core/hnsw_index.py +2 -7
src/mnemocore/core/qdrant_store.py +47 -41
src/mnemocore/core/subconscious_ai.py +19 -15
src/mnemocore/core/tier_manager.py +6 -2
sync_qdrant.py +50 -0
test_qdrant_scores.py +61 -0

REVIEW_v4.5.0-beta.md ADDED Viewed

	@@ -0,0 +1,442 @@

+# MnemoCore v4.5.0-beta — Code Review
+**Reviewer:** Omega (GLM-5)
+**Datum:** 2026-02-20 07:45 CET
+**Scope:** Full kodbas, fokus på query/store-flödet
+---
+## 🚨 KRITISKA PROBLEMER (Blockers)
+### 1. **Query Returnerar 0 Resultat** 🔴 BLOCKER
+**Symptom:** `POST /query` returnerar tom lista även efter framgångsrik `POST /store`
+**Root Cause Analysis:**
+#### 1.1 HNSW Index Manager — Position Mapping Bug
+**Fil:** `hnsw_index.py:221-236`
+```python
+def _position_to_node_id(self, position: int) -> Optional[str]:
+    """Map HNSW sequential position back to node_id."""
+    if not hasattr(self, "_position_map"):
+        object.__setattr__(self, "_position_map", {})
+    pm: Dict[int, str] = self._position_map
+    # Rebuild position map if needed (after index rebuild)
+    if len(pm) < len(self._id_map):
+        pm.clear()
+        for pos, (fid, nid) in enumerate(
+            sorted(self._id_map.items(), key=lambda x: x[0])
+        ):
+            pm[pos] = nid
+    return pm.get(position)
+```
+**PROBLEM:** Position map bygger på `sorted(_id_map.items(), key=lambda x: x[0])` vilket sorterar efter FAISS ID (int), **inte** efter insättningsordning. HNSW returnerar positioner baserat på insättningsordning, men mappningen är inkonsekvent.
+**Fix:**
+```python
+# Behåll insättningsordning separat
+def add(self, node_id: str, hdv_data: np.ndarray) -> None:
+    # ... existing code ...
+    self._insertion_order.append(node_id)  # NY
+def _position_to_node_id(self, position: int) -> Optional[str]:
+    if position < len(self._insertion_order):
+        return self._insertion_order[position]
+    return None
+```
+---
+#### 1.2 TextEncoder — Token Normalization Inkonsekvens
+**Fil:** `binary_hdv.py:339-342`
+```python
+def encode(self, text: str) -> BinaryHDV:
+    tokens = text.lower().split()  # <-- BARA whitespace split
+    if not tokens:
+        return BinaryHDV.random(self.dimension)
+```
+**PROBLEM:** Query-text vs lagrad text kan ha olika tokenisering:
+- `"Hello World"` → tokens: `["hello", "world"]`
+- `"Hello, World!"` → tokens: `["hello,", "world!"]` ← olika token!
+**Fix:**
+```python
+import re
+def encode(self, text: str) -> BinaryHDV:
+    # Konsekvent tokenisering
+    tokens = re.findall(r'\b\w+\b', text.lower())
+    if not tokens:
+        return BinaryHDV.random(self.dimension)
+```
+---
+#### 1.3 HNSW Upgrade Threshold Race Condition
+**Fil:** `hnsw_index.py:87-117`
+```python
+def _maybe_upgrade_to_hnsw(self) -> None:
+    if len(self._id_map) < FLAT_THRESHOLD:  # 256
+        return
+    # ... existing code ...
+    existing: List[Tuple[int, np.ndarray]] = []
+    for fid, node_id in self._id_map.items():
+        if node_id in self._vector_cache:
+            existing.append((fid, self._vector_cache[node_id]))
+```
+**PROBLEM:** `_vector_cache` används bara vid HNSW-upgrade, men vid normal flat-index-användning cachas inte vektorer. Vid upgrade saknas data.
+**Fix:** Alltid cacha vektorer:
+```python
+def add(self, node_id: str, hdv_data: np.ndarray) -> None:
+    # ... existing code ...
+    self._vector_cache[node_id] = hdv_data.copy()  # ALLTID, inte bara HNSW
+```
+---
+### 2. **Qdrant Vector Unpacking Mismatch** 🔴 HIGH
+**Fil:** `tier_manager.py:387-392` + `qdrant_store.py`
+```python
+# Vid save till Qdrant (tier_manager.py):
+bits = np.unpackbits(node.hdv.data)
+vector = bits.astype(float).tolist()  # 16,384 floats
+# Vid search från Qdrant (qdrant_store.py):
+arr = np.array(vec_data) > 0.5
+packed = np.packbits(arr.astype(np.uint8))
+```
+**PROBLEM:** Qdrant använder COSINE distance för HOT och MANHATTAN för WARM, men BinaryHDV använder HAMMING distance. Similarity scores kan vara inkompatibla.
+**Konfiguration (`config.yaml`):**
+```yaml
+qdrant:
+  collection_hot:
+    distance: COSINE  # ← Fel för binary vectors!
+  collection_warm:
+    distance: MANHATTAN  # ← Också suboptimalt
+```
+**Fix:** Använd `Distance.DOT` för binary vectors med normaliserad similarity.
+---
+### 3. **FAISS Binary HNSW — Inte Fullt Implementerat** 🔴 HIGH
+**Fil:** `hnsw_index.py:59-66`
+```python
+def _build_hnsw_index(self, existing_nodes: Optional[List[Tuple[int, np.ndarray]]] = None) -> None:
+    hnsw = faiss.IndexBinaryHNSW(self.dimension, self.m)
+    hnsw.hnsw.efConstruction = self.ef_construction
+    hnsw.hnsw.efSearch = self.ef_search
+```
+**PROBLEM:** `IndexBinaryHNSW` saknar `IndexIDMap`-stöd. Koden försöker hantera detta med `_position_map`, men detta är skört vid:
+- Delete + re-add
+- Concurrent access
+- Index rebuilds
+**Risk:** Position mapping kan bli desynkroniserad → query returnerar fel IDs eller inga resultat.
+---
+## ⚠️ HÖGA RISKER (High Priority)
+### 4. **Demotion Race Condition** 🟠
+**Fil:** `tier_manager.py:175-220`
+```python
+async def get_memory(self, node_id: str) -> Optional[MemoryNode]:
+    demote_candidate = None
+    result_node = None
+    async with self.lock:
+        if node_id in self.hot:
+            node = self.hot[node_id]
+            node.access()
+            if self._should_demote(node):
+                node.tier = "warm"  # Markerar som warm
+                demote_candidate = node
+            result_node = node
+    # I/O OUTSIDE LOCK — gap där annan tråd kan försöka access
+    if demote_candidate:
+        await self._save_to_warm(demote_candidate)  # Kan misslyckas
+        async with self.lock:
+            if demote_candidate.id in self.hot:
+                del self.hot[demote_candidate.id]  # Nu borta
+```
+**PROBLEM:** Tidsfönster mellan "mark as warm" och "delete from hot" där:
+- `get_memory()` kan returnera samma node twice
+- Query kan missa noden under övergången
+---
+### 5. **Subconscious AI — Infinite Loop Risk** 🟠
+**Fil:** `subconscious_ai.py` (inte granskad fullt, men config visar risk)
+```yaml
+subconscious_ai:
+  enabled: false  # BETA - bra att den är avstängd
+  pulse_interval_seconds: 120
+  rate_limit_per_hour: 50
+  max_memories_per_cycle: 10
+```
+**Risk:** Om `micro_self_improvement_enabled: true` kan systemet gå in i självförbättringsspiraler.
+---
+### 6. **Memory Leak i _vector_cache** 🟠
+**Fil:** `hnsw_index.py:107`
+```python
+@property
+def _vector_cache(self) -> Dict[str, np.ndarray]:
+    if not hasattr(self, "_vcache"):
+        object.__setattr__(self, "_vcache", {})
+    return self._vcache
+```
+**PROBLEM:** `_vector_cache` växer obegränsat. Ingen cleanup vid delete eller consolidation.
+**Fix:**
+```python
+def remove(self, node_id: str) -> None:
+    # ... existing code ...
+    self._vector_cache.pop(node_id, None)  # Finns redan, men verifiera
+```
+---
+## 📊 PRESTANDA & SKALBARHET
+### 7. **O(N) Linear Search Fallback** 🟡
+**Fil:** `tier_manager.py:902+`
+När HNSW inte är tillgängligt (FAISS ej installerat), faller systemet tillbaka till:
+```python
+def _linear_search_hot(self, query_vec: BinaryHDV, top_k: int) -> List[Tuple[str, float]]:
+    # Inte visad i filen, men nämnd som fallback
+```
+**Prestandaimpakt:**
+- 2,000 memories (HOT max): ~4ms
+- 10,000 memories: ~20ms
+- 100,000 memories: ~200ms ← Ej acceptabelt för real-time query
+---
+### 8. **Qdrant Batch Operations Saknas** 🟡
+**Fil:** `qdrant_store.py`
+```python
+async def upsert(self, collection: str, points: List[models.PointStruct]):
+    await qdrant_breaker.call(
+        self.client.upsert, collection_name=collection, points=points
+    )
+```
+**PROBLEM:** Consolidation (`consolidate_warm_to_cold`) gör en-at-a-time deletes istället för batch:
+```python
+# tier_manager.py:750
+if ids_to_delete:
+    await self.qdrant.delete(collection, ids_to_delete)  # Bra!
+```
+Men `list_warm()` och `search()` saknar pagination-optimering.
+---
+## 🏗️ ARKITEKTUR & DESIGN
+### 9. **Dependency Injection — Halvvägs** 🟡
+**Status:** Singeltons borttagna, men inte fullt DI
+**Gott:**
+- `HAIMEngine(config=..., tier_manager=...)` stöder injection
+- `Container` pattern i `container.py`
+**Dåligt:**
+- `get_config()` är fortfarande global
+- `BinaryHDV.random()` använder global `np.random`
+**Rekommendation:**
+```python
+class BinaryHDV:
+    def __init__(self, data: np.ndarray, dimension: int, rng: Optional[np.random.Generator] = None):
+        self._rng = rng or np.random.default_rng()
+```
+---
+### 10. **Error Handling — Inkonsekvent** 🟡
+**Filer:** Spridda
+Vissa funktioner returnerar `None`:
+```python
+async def get_memory(self, node_id: str) -> Optional[MemoryNode]:
+    # Returnerar None om ej hittad
+```
+Andra kastar exceptions:
+```python
+async def delete_memory(self, node_id: str):
+    if not node:
+        raise MemoryNotFoundError(node_id)
+```
+**Rekommendation:** Konsekvent mönster:
+- `get_*` → return `Optional[T]` (None = not found)
+- `*_or_raise` → raise exception
+- `delete_*` → return `bool` (deleted or not)
+---
+## 🔒 SÄKERHET & ROBUSTHET
+### 11. **API Key i Env Var — Bra** ✅
+**Fil:** `api/main.py:81`
+```python
+security = config.security if config else None
+expected_key = (security.api_key if security else None) or os.getenv("HAIM_API_KEY", "")
+```
+**Gott:** API key måste sättas explicit, fallback till env var.
+---
+### 12. **Rate Limiting — Implementerat** ✅
+**Fil:** `api/middleware.py`
+```python
+class QueryRateLimiter(RateLimiter):
+    def __init__(self):
+        super().__init__(requests=500, window_seconds=60)  # 500/min
+```
+**Gott:** Separate limits för store/query/concept/analogy.
+---
+### 13. **Input Validation — Svag** 🟡
+**Fil:** `api/models.py`
+```python
+class StoreRequest(BaseModel):
+    content: str = Field(..., min_length=1, max_length=100000)
+    metadata: Optional[Dict[str, Any]] = None
+```
+**PROBLEM:** Ingen validering av `metadata`-innehåll. Kan innehålla:
+- Ogiltiga UTF-8 characters
+- Recursive structures
+- Sensitive data leaks
+**Fix:**
+```python
+from pydantic import field_validator
+class StoreRequest(BaseModel):
+    @field_validator('metadata')
+    @classmethod
+    def validate_metadata(cls, v):
+        if v and len(str(v)) > 10000:  # Max 10KB metadata
+            raise ValueError('Metadata too large')
+        return v
+```
+---
+## 📝 KODKVALITET
+### 14. **Test Coverage — 39 Passing** ✅
+**Fil:** `test_regression_output.txt`
+```
+39 passed, 5 warnings in 3.47s
+```
+**Gott:** Alla tester passerar. Men:
+- Inga tester för HNSW upgrade path
+- Inga tester för concurrent access
+- Inga tester för Qdrant integration (kräver live Qdrant)
+---
+### 15. **Documentation — Komplett** ✅
+**Filer:** `README.md` (43KB), `CHANGELOG.md`, inline docs
+**Gott:** Dokumentation är omfattande och uppdaterad.
+---
+## 🎯 PRIORITERAD FIX-LISTA
+| Prioritet | Problem | Fil | Estimerad tid |
+|-----------|---------|-----|---------------|
+| 🔴 P0 | Position mapping bug | `hnsw_index.py` | 2h |
+| 🔴 P0 | Token normalization | `binary_hdv.py` | 30min |
+| 🔴 P0 | Vector cache vid upgrade | `hnsw_index.py` | 1h |
+| 🟠 P1 | Qdrant distance mismatch | `config.yaml` + `qdrant_store.py` | 2h |
+| 🟠 P1 | Demotion race condition | `tier_manager.py` | 3h |
+| 🟡 P2 | Linear search fallback | `tier_manager.py` | 4h |
+| 🟡 P2 | Memory leak _vector_cache | `hnsw_index.py` | 30min |
+---
+## 🔧 REKOMMENDERAD ACTION PLAN
+### Fas 1: Query Fix (Dag 1)
+1. **Fixa `_position_to_node_id()`** — Använd insättningsordning istället för sorted IDs
+2. **Fixa `TextEncoder.encode()`** — Konsekvent tokenisering med regex
+3. **Alltid cacha vektorer** — Ta bort conditional `_vector_cache`
+### Fas 2: Qdrant Alignment (Dag 2)
+1. **Ändra distance metric** — `Distance.DOT` för binary vectors
+2. **Verifiera vector unpacking** — Säkerställ 16,384 → 2,048 byte mapping
+### Fas 3: Hardening (Dag 3)
+1. **Lägg till HNSW upgrade tester**
+2. **Fixa demotion race condition**
+3. **Input validation för metadata**
+---
+## 📋 SUMMARY
+**Total kod:** ~25,000 LOC (src/)
+**Tester:** 39 passing
+**Kritiska buggar:** 3
+**Höga risker:** 4
+**Medel risker:** 5
+**Verdict:** v4.5.0-beta är **inte production-ready**. Query-flödet har 3 kritiska buggar som förhindrar korrekt retrieval. Arkitekturen är solid, men implementationen av HNSW/index-mapping behöver omskrivas.
+**Rekommendation:**
+1. Omedelbart fixa P0-issues (4-5 timmars arbete)
+2. Kör regression tests
+3. Deploy till staging för validering
+4. Sätt Opus 4.6 + Gemini 3.1 på Fas 1-3
+---
+*Review genererad av Omega (GLM-5) för Robin Granberg*
+*Senast uppdaterad: 2026-02-20 07:45 CET*

ROADMAP.md ADDED Viewed

	@@ -0,0 +1,152 @@

+# MnemoCore Roadmap
+**Open Source Infrastructure for Persistent Cognitive Memory**
+Version: 4.5.0-beta | Updated: 2026-02-20
+---
+## Vision
+MnemoCore provides the foundational memory layer for cognitive AI systems —
+a production-ready, self-hosted alternative to cloud-dependent memory solutions.
+---
+## Current Status (v4.5.0-beta)
+| Component | Status |
+|-----------|--------|
+| Binary HDV Engine | ✅ Stable |
+| Tiered Storage (HOT/WARM/COLD) | ✅ Functional |
+| HNSW Index | ✅ Working |
+| Query/Store API | ✅ Operational |
+| Qdrant Integration | ✅ Available |
+| MCP Server | 🟡 Beta |
+| PyPI Distribution | 🟡 Pending |
+---
+## Phase 5: Production Hardening
+**Goal:** Battle-tested, enterprise-ready release
+### 5.1 Stability & Testing
+- [ ] Increase test coverage to 80%+
+- [ ] Add integration tests for Qdrant backend
+- [ ] Stress test with 100k+ memories
+- [ ] Add chaos engineering tests (network failures, disk full)
+### 5.2 Performance Optimization
+- [ ] Benchmark query latency at scale
+- [ ] Optimize HNSW index rebuild time
+- [ ] Add batch operation endpoints
+- [ ] Profile and reduce memory footprint
+### 5.3 Developer Experience
+- [ ] Complete API documentation (OpenAPI spec)
+- [ ] Add usage examples for common patterns
+- [ ] Create quickstart guide
+- [ ] Add Jupyter notebook tutorials
+### 5.4 Operations
+- [ ] Docker Compose production config
+- [ ] Kubernetes Helm chart
+- [ ] Prometheus metrics endpoint
+- [ ] Health check hardening
+**ETA:** 2-3 weeks
+---
+## Phase 6: Feature Expansion
+**Goal:** More cognitive capabilities
+### 6.1 Advanced Retrieval
+- [ ] Temporal queries ("memories from last week")
+- [ ] Multi-hop associative recall
+- [ ] Contextual ranking (personalized relevance)
+- [ ] Negation queries ("NOT about project X")
+### 6.2 Memory Enrichment
+- [ ] Auto-tagging via LLM
+- [ ] Entity extraction (names, dates, concepts)
+- [ ] Sentiment scoring
+- [ ] Importance classification
+### 6.3 Multi-Modal Support
+- [ ] Image embedding storage
+- [ ] Audio transcript indexing
+- [ ] Document chunk management
+**ETA:** 4-6 weeks
+---
+## Phase 7: Ecosystem
+**Goal:** Easy integration with existing AI stacks
+### 7.1 Integrations
+- [ ] LangChain memory adapter
+- [ ] LlamaIndex integration
+- [ ] OpenAI Assistants API compatible
+- [ ] Claude MCP protocol
+### 7.2 SDKs
+- [ ] Python SDK (official)
+- [ ] TypeScript/JavaScript SDK
+- [ ] Go SDK
+- [ ] Rust SDK
+### 7.3 Community
+- [ ] Discord/Slack community
+- [ ] Contributing guide
+- [ ] Feature request process
+- [ ] Regular release cadence
+**ETA:** 8-12 weeks
+---
+## Long-Term Vision (Phase 8+)
+### Research Directions
+- [ ] Hierarchical memory (episodic → semantic → procedural)
+- [ ] Forgetting curves with spaced repetition
+- [ ] Dream consolidation during idle cycles
+- [ ] Meta-learning from usage patterns
+### Platform
+- [ ] Managed cloud offering (optional)
+- [ ] Multi-tenant support
+- [ ] Federation across nodes
+- [ ] Privacy-preserving memory sharing
+---
+## Release Schedule
+| Version | Target | Focus |
+|---------|--------|-------|
+| v4.5.0 | Current | Beta stabilization |
+| v5.0.0 | +2 weeks | Production ready |
+| v5.1.0 | +4 weeks | Performance + DX |
+| v6.0.0 | +6 weeks | Feature expansion |
+| v7.0.0 | +10 weeks | Ecosystem |
+---
+## Contributing
+MnemoCore is open source under MIT license.
+- **GitHub:** https://github.com/RobinALG87/MnemoCore-Infrastructure-for-Persistent-Cognitive-Memory
+- **PyPI:** `pip install mnemocore`
+- **Issues:** Use GitHub Issues for bugs and feature requests
+- **PRs:** Welcome! See CONTRIBUTING.md
+---
+*Roadmap maintained by Robin Granberg & Omega*

config.yaml CHANGED Viewed

@@ -13,7 +13,7 @@ haim:
   # Memory tier thresholds
   tiers:
     hot:
-      max_memories: 2000
       ltp_threshold_min: 0.7
       eviction_policy: "lru"
@@ -135,12 +135,12 @@ haim:
   # =========================================================================
   subconscious_ai:
     # BETA FEATURE - Must be explicitly enabled
-    enabled: false
     beta_mode: true
     # Model configuration
     model_provider: "ollama"  # ollama | lm_studio | openai_api | anthropic_api
-    model_name: "phi3.5:3.8b"
     model_url: "http://localhost:11434"
     # api_key: null  # For API providers
     # api_base_url: null
@@ -152,16 +152,16 @@ haim:
     # Resource management
     max_cpu_percent: 30.0
-    cycle_timeout_seconds: 30
     rate_limit_per_hour: 50
     # Operations
     memory_sorting_enabled: true
     enhanced_dreaming_enabled: true
-    micro_self_improvement_enabled: false  # Initially disabled
     # Safety
-    dry_run: true
     log_all_decisions: true
     audit_trail_path: "./data/subconscious_audit.jsonl"
     max_memories_per_cycle: 10

   # Memory tier thresholds
   tiers:
     hot:
+      max_memories: 3000
       ltp_threshold_min: 0.7
       eviction_policy: "lru"
   # =========================================================================
   subconscious_ai:
     # BETA FEATURE - Must be explicitly enabled
+    enabled: true
     beta_mode: true
     # Model configuration
     model_provider: "ollama"  # ollama | lm_studio | openai_api | anthropic_api
+    model_name: "phi3.5:latest"
     model_url: "http://localhost:11434"
     # api_key: null  # For API providers
     # api_base_url: null
     # Resource management
     max_cpu_percent: 30.0
+    cycle_timeout_seconds: 120
     rate_limit_per_hour: 50
     # Operations
     memory_sorting_enabled: true
     enhanced_dreaming_enabled: true
+    micro_self_improvement_enabled: true  # Initially disabled
     # Safety
+    dry_run: false
     log_all_decisions: true
     audit_trail_path: "./data/subconscious_audit.jsonl"
     max_memories_per_cycle: 10

data/subconscious_audit.jsonl CHANGED Viewed

The diff for this file is too large to render. See raw diff

docs/PATTERN_LEARNER_SPEC.md ADDED Viewed

	@@ -0,0 +1,553 @@

+# MnemoCore Pattern Learner — Specification Draft
+**Version:** 0.1-draft
+**Date:** 2026-02-20
+**Status:** Draft for Review
+**Author:** Omega (GLM-5) for Robin Granberg
+---
+## Executive Summary
+Pattern Learner är en MnemoCore-modul som lär sig från användarinteraktioner **utan att lagra persondata**. Den extraherar statistiska mönster, topic clustering och kvalitetsmetrics som kan användas för att förbättra chatbot-performance över tid.
+**Key principle:** Learn patterns, forget people.
+---
+## Problem Statement
+### Healthcare Chatbot Challenges
+| Utmaning | Konsekvens |
+|----------|------------|
+| GDPR/HIPAA compliance | Kan inte lagra konversationer |
+| Multitenancy | Data får inte läcka mellan kliniker |
+| Quality improvement | Behöver veta vad som fungerar |
+| Knowledge gaps | Behöver identifiera vad som saknas i docs |
+### Current Solutions (Limitations)
+- **Stateless RAG:** Ingen inlärning alls
+- **Full memory:** GDPR-risk, sekretessproblem
+- **Manual analytics:** Tidskrävande, inte real-time
+---
+## Solution: Pattern Learner
+### Core Concept
+```
+User Query ──► Anonymize ──► Extract Pattern ──► Aggregate
+                  │
+                  └── PII removed before storage
+```
+**What IS stored:**
+- Topic clusters (anonymized)
+- Query frequency distributions
+- Response quality aggregates
+- Knowledge gap indicators
+**What is NOT stored:**
+- User identities
+- Clinic associations
+- Patient data
+- Raw conversations
+---
+## Architecture
+### High-Level Design
+```
+┌─────────────────────────────────────────────────────────────┐
+│                    Pattern Learner Module                    │
+├─────────────────────────────────────────────────────────────┤
+│                                                              │
+│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐  │
+│  │   Anonymizer │───►│Topic Extractor│───►│  Aggregator  │  │
+│  └──────────────┘    └──────────────┘    └──────────────┘  │
+│         │                   │                    │          │
+│         │                   ▼                    ▼          │
+│         │           ┌──────────────┐    ┌──────────────┐   │
+│         │           │Topic Embedder│    │ Stats Store  │   │
+│         │           │  (MnemoCore) │    │  (Encrypted) │   │
+│         │           └──────────────┘    └──────────────┘   │
+│         │                   │                    │          │
+│         └───────────────────┴────────────────────┘          │
+│                             │                               │
+│                             ▼                               │
+│                    ┌──────────────┐                        │
+│                    │  Insights API│                        │
+│                    └──────────────┘                        │
+│                                                              │
+└─────────────────────────────────────────────────────────────┘
+```
+### Components
+#### 1. Anonymizer
+**Purpose:** Remove all PII before processing
+**Methods:**
+- Named Entity Recognition (NER) for person names
+- Pattern matching for phone numbers, addresses
+- Clinic/organization detection
+- Session ID hashing
+```python
+class Anonymizer:
+    """Remove PII from queries before pattern extraction"""
+    def __init__(self):
+        self.ner_model = load_ner_model("sv")  # Swedish
+        self.patterns = {
+            "phone": r"\+?\d{1,3}[\s-]?\d{2,4}[\s-]?\d{2,4}[\s-]?\d{2,4}",
+            "email": r"[\w\.-]+@[\w\.-]+\.\w+",
+            "personal_number": r"\d{6,8}[-\s]?\d{4}",
+        }
+    def anonymize(self, text: str) -> str:
+        """Remove all PII from text"""
+        # 1. NER for names
+        entities = self.ner_model.extract(text)
+        for entity in entities:
+            if entity.type in ["PER", "ORG"]:
+                text = text.replace(entity.text, "[ANON]")
+        # 2. Pattern matching
+        for pattern_type, pattern in self.patterns.items():
+            text = re.sub(pattern, f"[{pattern_type.upper()}]", text)
+        # 3. Remove clinic names (configurable blacklist)
+        for clinic_name in self.clinic_blacklist:
+            text = text.replace(clinic_name, "[KLINIK]")
+        return text
+```
+---
+#### 2. Topic Extractor
+**Purpose:** Extract semantic topics from anonymized queries
+**Methods:**
+- Keyword extraction (TF-IDF)
+- Topic modeling (LDA, BERTopic)
+- Embedding-based clustering
+```python
+class TopicExtractor:
+    """Extract topics from anonymized queries"""
+    def __init__(self, mnemocore_engine):
+        self.engine = mnemocore_engine
+        self.topic_threshold = 0.5
+    async def extract_topics(self, query: str) -> List[str]:
+        """Extract topics from anonymized query"""
+        # 1. Get keywords
+        keywords = self._extract_keywords(query)
+        # 2. Find similar topics in MnemoCore
+        similar = await self.engine.query(query, top_k=5)
+        # 3. Cluster into topics
+        topics = []
+        for memory_id, similarity in similar:
+            if similarity > self.topic_threshold:
+                memory = await self.engine.get_memory(memory_id)
+                topics.extend(memory.metadata.get("topics", []))
+        # 4. Deduplicate
+        return list(set(topics + keywords))
+    def _extract_keywords(self, text: str) -> List[str]:
+        """Extract keywords using TF-IDF"""
+        # Simple implementation
+        words = text.lower().split()
+        return [w for w in words if len(w) > 3 and w not in STOPWORDS_SV]
+```
+---
+#### 3. Aggregator
+**Purpose:** Store statistical patterns without PII
+**Data structures:**
+```python
+@dataclass
+class TopicStats:
+    """Statistics for a topic"""
+    topic: str
+    count: int = 0
+    first_seen: datetime = None
+    last_seen: datetime = None
+    trend: float = 0.0  # Recent increase/decrease
+@dataclass
+class ResponseQuality:
+    """Aggregated response quality (no individual ratings)"""
+    response_signature: str  # Hash of response template
+    avg_rating: float = 0.5
+    sample_count: int = 0
+    last_updated: datetime = None
+@dataclass
+class KnowledgeGap:
+    """Topics with no good answers"""
+    topic: str
+    query_count: int = 0
+    failure_rate: float = 1.0  # % of queries that got "I don't know"
+    suggested_action: str = ""  # "add documentation", "improve answer"
+```
+**Storage:**
+```python
+class PatternStore:
+    """Store patterns (encrypted, no PII)"""
+    def __init__(self, encryption_key: bytes):
+        self.key = encryption_key
+        self.topics: Dict[str, TopicStats] = {}
+        self.qualities: Dict[str, ResponseQuality] = {}
+        self.gaps: Dict[str, KnowledgeGap] = {}
+    def record_topic(self, topic: str):
+        """Record that a topic was queried"""
+        if topic not in self.topics:
+            self.topics[topic] = TopicStats(
+                topic=topic,
+                first_seen=datetime.utcnow()
+            )
+        stats = self.topics[topic]
+        stats.count += 1
+        stats.last_seen = datetime.utcnow()
+    def record_quality(self, response_sig: str, rating: int):
+        """Record response quality (aggregated)"""
+        if response_sig not in self.qualities:
+            self.qualities[response_sig] = ResponseQuality(
+                response_signature=response_sig
+            )
+        q = self.qualities[response_sig]
+        # Exponential moving average
+        q.avg_rating = 0.9 * q.avg_rating + 0.1 * (rating / 5.0)
+        q.sample_count += 1
+        q.last_updated = datetime.utcnow()
+    def record_gap(self, topic: str, had_answer: bool):
+        """Record knowledge gap"""
+        if topic not in self.gaps:
+            self.gaps[topic] = KnowledgeGap(topic=topic)
+        gap = self.gaps[topic]
+        gap.query_count += 1
+        if not had_answer:
+            gap.failure_rate = (gap.failure_rate * (gap.query_count - 1) + 1) / gap.query_count
+        else:
+            gap.failure_rate = (gap.failure_rate * (gap.query_count - 1)) / gap.query_count
+```
+---
+#### 4. Insights API
+**Purpose:** Provide actionable insights to admins/developers
+**Endpoints:**
+```python
+# GET /insights/topics?top_k=10
+{
+    "topics": [
+        {"topic": "implantat", "count": 1250, "trend": 0.15},
+        {"topic": "rotfyllning", "count": 980, "trend": -0.02},
+        {"topic": "priser", "count": 850, "trend": 0.30}
+    ],
+    "period": "30d"
+}
+# GET /insights/gaps
+{
+    "knowledge_gaps": [
+        {
+            "topic": "tandreglering vuxna",
+            "query_count": 145,
+            "failure_rate": 0.85,
+            "suggested_action": "add documentation"
+        },
+        {
+            "topic": "akut tandvård",
+            "query_count": 89,
+            "failure_rate": 0.72,
+            "suggested_action": "improve answer"
+        }
+    ]
+}
+# GET /insights/quality
+{
+    "top_responses": [
+        {"signature": "abc123", "avg_rating": 4.8, "sample_count": 520},
+        {"signature": "def456", "avg_rating": 4.5, "sample_count": 340}
+    ],
+    "worst_responses": [
+        {"signature": "xyz789", "avg_rating": 2.1, "sample_count": 45}
+    ]
+}
+```
+---
+## MnemoCore Integration
+### Usage Pattern
+```python
+from mnemocore import HAIMEngine
+from mnemocore.pattern_learner import PatternLearner
+# Initialize MnemoCore (stores topic embeddings)
+engine = HAIMEngine(dimension=16384)
+await engine.initialize()
+# Initialize Pattern Learner
+learner = PatternLearner(
+    engine=engine,
+    encryption_key=get_encryption_key(),
+    anonymizer=Anonymizer()
+)
+# Process a query (automatic learning)
+async def handle_query(user_query: str, tenant_id: str):
+    # 1. Anonymize
+    anon_query = learner.anonymize(user_query)
+    # 2. Extract patterns (no PII)
+    topics = await learner.extract_topics(anon_query)
+    # 3. Record topic usage
+    for topic in topics:
+        learner.record_topic(topic)
+    # 4. Get answer from RAG
+    answer = await rag_lookup(anon_query)
+    # 5. Record if we had an answer
+    learner.record_gap(
+        topic=topics[0] if topics else "unknown",
+        had_answer=(answer is not None)
+    )
+    return answer
+# Get insights (admin only)
+async def get_dashboard():
+    top_topics = learner.get_top_topics(10)
+    gaps = learner.get_knowledge_gaps()
+    quality = learner.get_response_quality()
+    return {
+        "popular_topics": top_topics,
+        "needs_documentation": gaps,
+        "response_performance": quality
+    }
+```
+---
+## GDPR Compliance
+### Data Minimization
+| Data Type | Stored? | Justification |
+|-----------|---------|---------------|
+| Raw queries | ❌ | PII risk |
+| User IDs | ❌ | Not needed |
+| Session IDs | ❌ | Not needed |
+| Clinic IDs | ❌ | Not needed |
+| **Topic labels** | ✅ | Anonymized |
+| **Topic counts** | ✅ | Statistical |
+| **Quality scores** | ✅ | Aggregated |
+| **Gap indicators** | ✅ | Anonymized |
+### Right to Erasure (GDPR Art 17)
+Since no PII is stored, right to erasure is **automatically satisfied**.
+### Data Retention
+```python
+# Configurable retention
+retention_policy = {
+    "topic_stats": "365d",  # Keep for 1 year
+    "quality_scores": "90d",  # Keep for 3 months
+    "gap_indicators": "30d",  # Refresh monthly
+}
+# Automatic cleanup
+async def cleanup_old_patterns():
+    cutoff = datetime.utcnow() - timedelta(days=retention_policy["topic_stats"])
+    for topic, stats in learner.topics.items():
+        if stats.last_seen < cutoff:
+            del learner.topics[topic]
+```
+---
+## Security Considerations
+### Encryption
+- All pattern data encrypted at rest (AES-256)
+- Encryption keys managed via HSM or Azure Key Vault
+- Per-tenant encryption optional (for multi-tenant isolation)
+### Access Control
+```python
+# Insights API requires admin role
+@app.get("/insights/topics")
+@require_role("admin")
+async def get_topics():
+    return learner.get_top_topics(10)
+```
+### Audit Logging
+```python
+# Log all pattern access (not the patterns themselves)
+async def log_access(user_id: str, endpoint: str, timestamp: datetime):
+    await audit_log.store({
+        "user_id": user_id,
+        "endpoint": endpoint,
+        "timestamp": timestamp.isoformat(),
+        # No pattern data logged
+    })
+```
+---
+## Implementation Roadmap
+### Phase 1: MVP (2 weeks)
+- [ ] Anonymizer with Swedish NER
+- [ ] Basic topic extraction (keywords)
+- [ ] Topic counter (no MnemoCore yet)
+- [ ] Simple insights API
+### Phase 2: MnemoCore Integration (2 weeks)
+- [ ] Topic embedding storage in MnemoCore
+- [ ] Semantic topic clustering
+- [ ] Gap detection using similarity search
+### Phase 3: Quality Metrics (2 weeks)
+- [ ] Response quality tracking
+- [ ] Feedback integration
+- [ ] Quality dashboard
+### Phase 4: Production Hardening (2 weeks)
+- [ ] Encryption at rest
+- [ ] Access control
+- [ ] Audit logging
+- [ ] Performance optimization
+---
+## Business Value
+### For Healthcare Organizations
+| Value | Metric |
+|-------|--------|
+| **Documentation gaps** | Know what to add to knowledge base |
+| **Popular topics** | Prioritize documentation efforts |
+| **Response quality** | Improve user satisfaction |
+| **Trend analysis** | Identify emerging needs |
+### For Opus Dental (Competitive Advantage)
+| Advantage | Value |
+|-----------|-------|
+| **Continuous improvement** | Chatbot gets smarter without storing PII |
+| **Customer insights** | Know what clinics need |
+| **Compliance by design** | GDPR-safe from day 1 |
+| **Unique selling point** | "Learning chatbot" vs competitors |
+---
+## Technical Requirements
+### Dependencies
+```
+mnemocore>=4.5.0
+spacy[sv]>=3.7.0  # Swedish NER
+numpy>=1.24.0
+cryptography>=41.0.0  # Encryption
+```
+### Infrastructure
+- MnemoCore instance (can be shared or per-tenant)
+- Encrypted storage (Azure SQL, PostgreSQL with TDE)
+- Optional: Azure Key Vault for key management
+### Performance
+- Topic extraction: <50ms per query
+- Insights API: <200ms
+- Storage: ~1KB per unique topic (highly efficient)
+---
+## Open Questions
+1. **Topic granularity:** How specific should topics be? "Implantat" vs "Implantat pris" vs "Implantat komplikationer"
+2. **Trend detection:** What time window for trend analysis? 7d? 30d?
+3. **Multi-language:** Support for Finnish/Norwegian in addition to Swedish?
+4. **Tenant isolation:** Should patterns be shared across tenants (anonymized) or kept separate?
+5. **Feedback mechanism:** How to collect ratings? Thumbs up/down? 1-5 stars?
+---
+## Conclusion
+Pattern Learner enables **continuous improvement** of healthcare chatbots **without GDPR risk**. It learns what users ask about, which answers work, and where documentation is missing — all without storing any personal data.
+**Key innovation:** Transform "memory" into "patterns" — compliance-safe learning.
+---
+## Next Steps
+1. Review this spec
+2. Decide on open questions
+3. Prioritize MVP features
+4. Start implementation
+---
+*Draft by Omega (GLM-5) for Robin Granberg*
+*2026-02-20*

mnemocore_verify.py CHANGED Viewed

@@ -42,7 +42,7 @@ def setup_test_env():
 @pytest.mark.asyncio
 async def test_text_encoder_normalization():
     """Verify BUG-02: Text normalization fixes identical string variances"""
-    encoder = TextEncoder(dimension=1024)
     hdv1 = encoder.encode("Hello World")
     hdv2 = encoder.encode("hello, world!")
@@ -51,14 +51,14 @@ async def test_text_encoder_normalization():
 def test_hnsw_singleton():
     """Verify BUG-08: HNSWIndexManager is a thread-safe singleton"""
     HNSWIndexManager._instance = None
-    idx1 = HNSWIndexManager(dimension=1024)
-    idx2 = HNSWIndexManager(dimension=1024)
     assert idx1 is idx2, "HNSWIndexManager is not a singleton"
 def test_hnsw_index_add_search():
     """Verify BUG-01 & BUG-03: Vector cache lost / Position mapping"""
     HNSWIndexManager._instance = None
-    idx = HNSWIndexManager(dimension=1024)
     # Optional cleanup if it's reused
     idx._id_map = []
@@ -66,8 +66,8 @@ def test_hnsw_index_add_search():
     if idx._index:
         idx._index.reset()
-    vec1 = BinaryHDV.random(1024)
-    vec2 = BinaryHDV.random(1024)
     idx.add("test_node_1", vec1.data)
     idx.add("test_node_2", vec2.data)

 @pytest.mark.asyncio
 async def test_text_encoder_normalization():
     """Verify BUG-02: Text normalization fixes identical string variances"""
+    encoder = TextEncoder(dimension=16384)
     hdv1 = encoder.encode("Hello World")
     hdv2 = encoder.encode("hello, world!")
 def test_hnsw_singleton():
     """Verify BUG-08: HNSWIndexManager is a thread-safe singleton"""
     HNSWIndexManager._instance = None
+    idx1 = HNSWIndexManager(dimension=16384)
+    idx2 = HNSWIndexManager(dimension=16384)
     assert idx1 is idx2, "HNSWIndexManager is not a singleton"
 def test_hnsw_index_add_search():
     """Verify BUG-01 & BUG-03: Vector cache lost / Position mapping"""
     HNSWIndexManager._instance = None
+    idx = HNSWIndexManager(dimension=16384)
     # Optional cleanup if it's reused
     idx._id_map = []
     if idx._index:
         idx._index.reset()
+    vec1 = BinaryHDV.random(16384)
+    vec2 = BinaryHDV.random(16384)
     idx.add("test_node_1", vec1.data)
     idx.add("test_node_2", vec2.data)

src/mnemocore/api/main.py CHANGED Viewed

@@ -153,7 +153,7 @@ async def lifespan(app: FastAPI):
     from mnemocore.core.tier_manager import TierManager
     tier_manager = TierManager(config=config, qdrant_store=container.qdrant_store)
     engine = HAIMEngine(
-        persist_path="./data/memory.jsonl",
         config=config,
         tier_manager=tier_manager,
         working_memory=container.working_memory,
@@ -574,6 +574,37 @@ async def get_stats(engine: HAIMEngine = Depends(get_engine)):
     return await engine.get_stats()
 # Rate limit info endpoint
 @app.get("/rate-limits")
 async def get_rate_limits():

     from mnemocore.core.tier_manager import TierManager
     tier_manager = TierManager(config=config, qdrant_store=container.qdrant_store)
     engine = HAIMEngine(
+        persist_path=config.paths.memory_file,
         config=config,
         tier_manager=tier_manager,
         working_memory=container.working_memory,
     return await engine.get_stats()
+# ─────────────────────────────────────────────────────────────────────────────
+# Maintenance Endpoints
+# ─────────────────────────────────────────────────────────────────────────────
+@app.post("/maintenance/cleanup", dependencies=[Depends(get_api_key)])
+async def cleanup_maintenance(threshold: float = 0.1, engine: HAIMEngine = Depends(get_engine)):
+    """Remove decayed synapses and stale index nodes."""
+    await engine.cleanup_decay(threshold=threshold)
+    return {"ok": True, "message": f"Synapse cleanup triggered with threshold {threshold}"}
+@app.post("/maintenance/consolidate", dependencies=[Depends(get_api_key)])
+async def consolidate_maintenance(engine: HAIMEngine = Depends(get_engine)):
+    """Trigger manual semantic consolidation pulse."""
+    if not engine._semantic_worker:
+        raise HTTPException(status_code=503, detail="Consolidation worker not initialized")
+    stats = await engine._semantic_worker.run_once()
+    return {"ok": True, "stats": stats}
+@app.post("/maintenance/sweep", dependencies=[Depends(get_api_key)])
+async def sweep_maintenance(engine: HAIMEngine = Depends(get_engine)):
+    """Trigger manual immunology sweep."""
+    if not engine._immunology:
+        raise HTTPException(status_code=503, detail="Immunology loop not initialized")
+    stats = await engine._immunology.sweep()
+    return {"ok": True, "stats": stats}
 # Rate limit info endpoint
 @app.get("/rate-limits")
 async def get_rate_limits():

src/mnemocore/core/binary_hdv.py CHANGED Viewed

@@ -404,9 +404,8 @@ class TextEncoder:
         Each token is bound with its position via XOR(token, permute(position_marker, i)).
         All position-bound tokens are bundled via majority vote.
         """
-        # BUG-02 Fix: strip punctuation and normalize spaces
-        normalized = re.sub(r'[^\w\s]', '', text).lower()
-        tokens = normalized.split()
         if not tokens:
             return BinaryHDV.random(self.dimension)

         Each token is bound with its position via XOR(token, permute(position_marker, i)).
         All position-bound tokens are bundled via majority vote.
         """
+        # Improved Tokenization: consistent alphanumeric extraction
+        tokens = re.findall(r'\b\w+\b', text.lower())
         if not tokens:
             return BinaryHDV.random(self.dimension)

src/mnemocore/core/hnsw_index.py CHANGED Viewed

@@ -108,10 +108,7 @@ class HNSWIndexManager:
         self.VECTOR_PATH = data_dir / "mnemocore_hnsw_vectors.npy"
         if FAISS_AVAILABLE:
-            if self.INDEX_PATH.exists() and self.IDMAP_PATH.exists() and self.VECTOR_PATH.exists():
-                self._load()
-            else:
-                self._build_flat_index()
         self._initialized = True
@@ -195,7 +192,6 @@ class HNSWIndexManager:
                 return
             self._maybe_upgrade_to_hnsw()
-            self._save()
     def remove(self, node_id: str) -> None:
         """
@@ -234,7 +230,6 @@ class HNSWIndexManager:
                             self._vector_store = compact_vecs
                             self._stale_count = 0
-                self._save()
             except ValueError:
                 pass
@@ -251,7 +246,7 @@ class HNSWIndexManager:
             return []
         # Fetch more to account for deleted (None) entries
-        k = min(top_k + self._stale_count, len(self._id_map))
         if k <= 0:
             return []

         self.VECTOR_PATH = data_dir / "mnemocore_hnsw_vectors.npy"
         if FAISS_AVAILABLE:
+            self._build_flat_index()
         self._initialized = True
                 return
             self._maybe_upgrade_to_hnsw()
     def remove(self, node_id: str) -> None:
         """
                             self._vector_store = compact_vecs
                             self._stale_count = 0
             except ValueError:
                 pass
             return []
         # Fetch more to account for deleted (None) entries
+        k = min(top_k + self._stale_count + 50, len(self._id_map))
         if k <= 0:
             return []

src/mnemocore/core/qdrant_store.py CHANGED Viewed

@@ -9,6 +9,7 @@ Phase 4.3: Temporal Recall - supports time-based filtering and indexing.
 from typing import List, Any, Optional, Tuple, Dict
 from datetime import datetime
 import asyncio
 from qdrant_client import AsyncQdrantClient, models
 from loguru import logger
@@ -93,39 +94,36 @@ class QdrantStore:
                 )
             )
-        # Create HOT collection (optimized for latency)
-        if not await self.client.collection_exists(self.collection_hot):
-            logger.info(f"Creating HOT collection: {self.collection_hot}")
-            await self.client.create_collection(
-                collection_name=self.collection_hot,
-                vectors_config=models.VectorParams(
-                    size=self.dim,
-                    distance=models.Distance.DOT,
-                    on_disk=False
-                ),
-                quantization_config=quantization_config,
-                hnsw_config=models.HnswConfigDiff(
-                    m=self.hnsw_m,
-                    ef_construct=self.hnsw_ef_construct,
-                    on_disk=False
-                )
-            )
-        # Create WARM collection (optimized for scale/disk)
-        if not await self.client.collection_exists(self.collection_warm):
-            logger.info(f"Creating WARM collection: {self.collection_warm}")
             await self.client.create_collection(
-                collection_name=self.collection_warm,
                 vectors_config=models.VectorParams(
                     size=self.dim,
                     distance=models.Distance.DOT,
-                    on_disk=True
                 ),
                 quantization_config=quantization_config,
                 hnsw_config=models.HnswConfigDiff(
                     m=self.hnsw_m,
                     ef_construct=self.hnsw_ef_construct,
-                    on_disk=True
                 )
             )
@@ -173,24 +171,15 @@ class QdrantStore:
     ) -> List[models.ScoredPoint]:
         """
         Async semantic search.
-        Args:
-            collection: Collection name to search.
-            query_vector: Query embedding vector.
-            limit: Maximum number of results.
-            score_threshold: Minimum similarity score.
-            time_range: Optional (start, end) datetime tuple for temporal filtering.
-                       Phase 4.3: Enables "memories from last 48 hours" queries.
-        Returns:
-            List of scored points (empty list on errors).
-        Note:
-            This method returns an empty list on errors rather than raising,
-            as search failures should not crash the calling code.
         """
         try:
             must_conditions = []
             if time_range:
                 start_ts = int(time_range[0].timestamp())
                 end_ts = int(time_range[1].timestamp())
@@ -227,15 +216,32 @@ class QdrantStore:
                     )
                 )
-            return await qdrant_breaker.call(
                 self.client.query_points,
                 collection_name=collection,
-                query=query_vector,
                 limit=limit,
-                score_threshold=score_threshold,
                 query_filter=query_filter,
                 search_params=search_params,
             )
         except CircuitOpenError:
             logger.warning(f"Qdrant search blocked for {collection}: circuit breaker open")
             return []

 from typing import List, Any, Optional, Tuple, Dict
 from datetime import datetime
 import asyncio
+import numpy as np
 from qdrant_client import AsyncQdrantClient, models
 from loguru import logger
                 )
             )
+        for collection_name, on_disk in [
+            (self.collection_hot, False),
+            (self.collection_warm, True)
+        ]:
+            if await self.client.collection_exists(collection_name):
+                # Check for distance mismatch (Phase 4.5 alignment)
+                info = await self.client.get_collection(collection_name)
+                current_distance = info.config.params.vectors.distance
+                if current_distance != models.Distance.DOT:
+                    logger.warning(
+                        f"Collection {collection_name} has distance {current_distance}, "
+                        f"but DOT is required. Recreating collection."
+                    )
+                    await self.client.delete_collection(collection_name)
+                else:
+                    continue
+            logger.info(f"Creating collection: {collection_name} (DOT)")
             await self.client.create_collection(
+                collection_name=collection_name,
                 vectors_config=models.VectorParams(
                     size=self.dim,
                     distance=models.Distance.DOT,
+                    on_disk=on_disk
                 ),
                 quantization_config=quantization_config,
                 hnsw_config=models.HnswConfigDiff(
                     m=self.hnsw_m,
                     ef_construct=self.hnsw_ef_construct,
+                    on_disk=on_disk
                 )
             )
     ) -> List[models.ScoredPoint]:
         """
         Async semantic search.
         """
         try:
+            # Transform query to bipolar if it's (0, 1) (Phase 4.5)
+            q_vec = np.array(query_vector)
+            if np.all((q_vec == 0) | (q_vec == 1)):
+                q_vec = q_vec * 2.0 - 1.0
             must_conditions = []
+            query_filter = None
             if time_range:
                 start_ts = int(time_range[0].timestamp())
                 end_ts = int(time_range[1].timestamp())
                     )
                 )
+            response = await qdrant_breaker.call(
                 self.client.query_points,
                 collection_name=collection,
+                query=q_vec.tolist(),
                 limit=limit,
                 query_filter=query_filter,
                 search_params=search_params,
             )
+            # Normalize scores to [0, 1] range (Phase 4.5)
+            # For Bipolar DOT, score is in range [-D, D].
+            # Similarity = (score + D) / (2 * D) = 0.5 + (score / 2D)
+            normalized_points = []
+            for hit in response.points:
+                sim = 0.5 + (hit.score / (2.0 * self.dim))
+                normalized_points.append(
+                    models.ScoredPoint(
+                        id=hit.id,
+                        version=hit.version,
+                        score=float(np.clip(sim, 0.0, 1.0)),
+                        payload=hit.payload,
+                        vector=hit.vector
+                    )
+                )
+            return normalized_points
         except CircuitOpenError:
             logger.warning(f"Qdrant search blocked for {collection}: circuit breaker open")
             return []

src/mnemocore/core/subconscious_ai.py CHANGED Viewed

@@ -545,17 +545,19 @@ class SubconsciousAIWorker:
         # Build prompt for categorization
         memories_text = "\n".join([
-            f"[{i+1}] {m.content[:200]}"
             for i, m in enumerate(unsorted[:5])
         ])
-        prompt = f"""Categorize these memories into 2-3 broad categories and suggest tags for each.
-Memories:
-{memories_text}
-Return JSON format:
-{{"categories": ["cat1", "cat2"], "memory_tags": {{"1": ["tag1"], "2": ["tag2"]}}}}"""
         response = await self._model_client.generate(prompt, max_tokens=512)
         output = {"raw_response": response}
@@ -618,11 +620,12 @@ Return JSON format:
         """
         t_start = time.monotonic()
-        # Find memories with low LTP (weak connections)
-        recent = await self.engine.tier_manager.get_hot_recent(20)
         weak_memories = [
             m for m in recent
-            if m.ltp_strength < 0.5 and not m.metadata.get("dream_analyzed")
         ][:self.cfg.max_memories_per_cycle]
         if not weak_memories:
@@ -638,17 +641,18 @@ Return JSON format:
         # Build prompt for semantic bridging
         memories_text = "\n".join([
-            f"[{i+1}] {m.content[:150]} (LTP: {m.ltp_strength:.2f})"
             for i, m in enumerate(weak_memories[:5])
         ])
-        prompt = f"""Analyze these memories and suggest semantic connections or bridging concepts.
-Memories:
-{memories_text}
-For each memory, suggest 2-3 keywords or concepts that could connect it to related memories.
-Return JSON: {{"bridges": {{"1": ["concept1", "concept2"], "2": ["concept3"]}}}}"""
         response = await self._model_client.generate(prompt, max_tokens=512)
         output = {"raw_response": response}

         # Build prompt for categorization
         memories_text = "\n".join([
+            f"ID {i+1}: {m.content[:200]}"
             for i, m in enumerate(unsorted[:5])
         ])
+        prompt = f"""You are a memory sorting agent for Veristate Systems.
+Categorize these 5 memories into exactly 2 categories from this set: [Market Dynamics, Structural Integrity, Human Entropy, Digital Ascension, Strategic Intent].
+Output ONLY a valid JSON object. No explanation. No markdown.
+Format:
+{{"categories": ["category1", "category2"], "memory_tags": {{"1": ["tagA"], "2": ["tagB"], "3": ["tagC"], "4": ["tagD"], "5": ["tagE"]}}}}
+Memories:
+{memories_text}"""
         response = await self._model_client.generate(prompt, max_tokens=512)
         output = {"raw_response": response}
         """
         t_start = time.monotonic()
+        # Find memories with low LTP (weak connections) or unanalyzed
+        recent = await self.engine.tier_manager.get_hot_recent(50)
+        # Phase 4.5 Tuning: Allow dreaming on nodes with LTP <= 0.5 (new nodes)
         weak_memories = [
             m for m in recent
+            if m.ltp_strength <= 0.5 and not m.metadata.get("dream_analyzed")
         ][:self.cfg.max_memories_per_cycle]
         if not weak_memories:
         # Build prompt for semantic bridging
         memories_text = "\n".join([
+            f"ID {i+1}: {m.content[:150]}"
             for i, m in enumerate(weak_memories[:5])
         ])
+        prompt = f"""You are a semantic analysis agent. Suggest connection keywords for these 5 memories.
+Output ONLY a valid JSON object. No explanation. No markdown.
+Format:
+{{"bridges": {{"1": ["keyword1"], "2": ["keyword2"], "3": ["keyword3"], "4": ["keyword4"], "5": ["keyword5"]}}}}
+Memories:
+{memories_text}"""
         response = await self._model_client.generate(prompt, max_tokens=512)
         output = {"raw_response": response}

src/mnemocore/core/tier_manager.py CHANGED Viewed

@@ -177,6 +177,10 @@ class TierManager:
         """Add a new memory node. New memories are always HOT initially."""
         node.tier = "hot"
         # Phase 1: Add to HOT tier under lock (no I/O)
         victim_to_evict = None
         async with self.lock:
@@ -444,9 +448,9 @@ class TierManager:
             try:
                 from qdrant_client import models
-                # Unpack binary vector for Qdrant storage
                 bits = np.unpackbits(node.hdv.data)
-                vector = bits.astype(float).tolist()
                 point = models.PointStruct(
                     id=node.id,

         """Add a new memory node. New memories are always HOT initially."""
         node.tier = "hot"
+        # Delta 67.4: Ensure mutual exclusion.
+        # If the node exists in WARM, remove it before adding to HOT.
+        await self._delete_from_warm(node.id)
         # Phase 1: Add to HOT tier under lock (no I/O)
         victim_to_evict = None
         async with self.lock:
             try:
                 from qdrant_client import models
+                # Unpack binary vector for Qdrant storage (Bipolar Phase 4.5)
                 bits = np.unpackbits(node.hdv.data)
+                vector = (bits.astype(float) * 2.0 - 1.0).tolist()
                 point = models.PointStruct(
                     id=node.id,

sync_qdrant.py ADDED Viewed

	@@ -0,0 +1,50 @@

+import asyncio
+import json
+import os
+from mnemocore.core.engine import HAIMEngine
+from mnemocore.core.config import get_config
+from mnemocore.core.container import build_container
+from mnemocore.core.tier_manager import TierManager
+async def sync_qdrant():
+    config = get_config()
+    container = build_container(config)
+    # Initialize TierManager with Qdrant
+    tier_manager = TierManager(config=config, qdrant_store=container.qdrant_store)
+    engine = HAIMEngine(
+        config=config,
+        tier_manager=tier_manager,
+        working_memory=container.working_memory,
+        episodic_store=container.episodic_store,
+        semantic_store=container.semantic_store
+    )
+    await engine.initialize()
+    print(f"Engine initialized. Memories in HOT: {len(engine.tier_manager.hot)}")
+    # Force sync from memory.jsonl if HOT is empty
+    if len(engine.tier_manager.hot) == 0:
+        print("HOT is empty, reloading from legacy log...")
+        await engine._load_legacy_if_needed()
+        print(f"Memories after reload: {len(engine.tier_manager.hot)}")
+    # Consolidation will move them to WARM (Qdrant)
+    # But we can also just call _save_to_warm manually for all nodes
+    print("Syncing nodes to Qdrant...")
+    count = 0
+    for node_id, node in list(engine.tier_manager.hot.items()):
+        await engine.tier_manager._save_to_warm(node)
+        count += 1
+        if count % 100 == 0:
+            print(f"Synced {count} nodes...")
+    print(f"Total synced: {count}")
+    await engine.close()
+if __name__ == "__main__":
+    import sys
+    sys.path.append(os.path.join(os.getcwd(), "src"))
+    asyncio.run(sync_qdrant())

test_qdrant_scores.py ADDED Viewed

	@@ -0,0 +1,61 @@

+import asyncio
+import numpy as np
+from mnemocore.core.qdrant_store import QdrantStore
+from mnemocore.core.config import get_config
+async def test_qdrant_scores():
+    config = get_config()
+    store = QdrantStore(
+        url=config.qdrant.url,
+        api_key=None,
+        dimensionality=config.dimensionality
+    )
+    print(f"Ensuring collections (Migration Check)...")
+    await store.ensure_collections()
+    print(f"Searching {config.qdrant.collection_warm}...")
+    try:
+        info = await store.get_collection_info(config.qdrant.collection_warm)
+        print(f"Collection Info: {info}")
+        # Get one point first to have a valid vector
+        scroll_res = await store.scroll(config.qdrant.collection_warm, limit=1, with_vectors=True)
+        points = scroll_res[0]
+        if not points:
+            print("No points found in collection.")
+            return
+        target_vec = points[0].vector
+        target_id = points[0].id
+        print(f"Target Point: ID={target_id}")
+        # Test basic search without search_params
+        response = await store.client.query_points(
+            collection_name=config.qdrant.collection_warm,
+            query=target_vec,
+            limit=3
+        )
+        hits = response.points
+        print(f"Basic Search Hits count: {len(hits)}")
+        for i, hit in enumerate(hits):
+            print(f"Hit {i}: ID={hit.id}, Score={hit.score}")
+        hits = await store.search(config.qdrant.collection_warm, target_vec, limit=3)
+        print(f"Store Search Hits count: {len(hits)}")
+        for i, hit in enumerate(hits):
+            print(f"Hit {i}: ID={hit.id}, Score={hit.score}")
+    except Exception as e:
+        import traceback
+        traceback.print_exc()
+        print(f"Error: {e}")
+    finally:
+        await store.close()
+if __name__ == "__main__":
+    import os
+    import sys
+    sys.path.append(os.path.join(os.getcwd(), "src"))
+    asyncio.run(test_qdrant_scores())