Spaces:

Noo88ear
/

Job-Application-Assistant

Runtime error

Noo88ear commited on Aug 22, 2025

Commit

7498f2c

0 Parent(s):

🚀 Initial deployment of Multi-Agent Job Application Assistant

✅ Features:
- Gemini 2.5 Flash AI generation across all agents
- A2A Protocol for agent communication
- Advanced AI capabilities: Parallel processing, Temporal tracking, Observability
- Resume & Cover Letter generation with ATS optimization
- Job matching and aggregation from multiple sources
- Document export: Word, PowerPoint, Excel
- MCP server integration for tool interoperability
- Comprehensive agent ecosystem with 15+ specialized agents

🛠️ Technical Stack:
- Gradio with MCP support for web interface
- Google GenerativeAI for LLM operations
- Multi-source job integration (Adzuna, JobSpy, etc.)
- Document generation libraries (python-docx, python-pptx, openpyxl)
- Advanced memory and context management

🎯 Production Ready:
- Environment-based configuration
- Robust error handling and fallbacks
- Comprehensive testing and validation
- Enterprise-grade architecture

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.env.example +12 -0
README.md +439 -0
agents/__init__.py +1 -0
agents/__pycache__/__init__.cpython-313.pyc +0 -0
agents/__pycache__/context_engineer.cpython-313.pyc +0 -0
agents/__pycache__/context_scaler.cpython-313.pyc +0 -0
agents/__pycache__/cover_letter_agent.cpython-313.pyc +0 -0
agents/__pycache__/cv_owner.cpython-313.pyc +0 -0
agents/__pycache__/guidelines.cpython-313.pyc +0 -0
agents/__pycache__/job_agent.cpython-313.pyc +0 -0
agents/__pycache__/linkedin_manager.cpython-313.pyc +0 -0
agents/__pycache__/observability.cpython-313.pyc +0 -0
agents/__pycache__/orchestrator.cpython-313.pyc +0 -0
agents/__pycache__/parallel_executor.cpython-313.pyc +0 -0
agents/__pycache__/pipeline.cpython-313.pyc +0 -0
agents/__pycache__/profile_agent.cpython-313.pyc +0 -0
agents/__pycache__/router_agent.cpython-313.pyc +0 -0
agents/__pycache__/temporal_tracker.cpython-313.pyc +0 -0
agents/a2a_cv_owner.py +356 -0
agents/context_engineer.py +540 -0
agents/context_scaler.py +504 -0
agents/cover_letter_agent.py +143 -0
agents/cv_owner.py +441 -0
agents/guidelines.py +257 -0
agents/job_agent.py +29 -0
agents/linkedin_manager.py +120 -0
agents/observability.py +431 -0
agents/orchestrator.py +232 -0
agents/parallel_executor.py +425 -0
agents/pipeline.py +205 -0
agents/profile_agent.py +39 -0
agents/router_agent.py +18 -0
agents/temporal_tracker.py +464 -0
app.py +65 -0
hf_app.py +1613 -0
mcp/__init__.py +1 -0
mcp/__pycache__/__init__.cpython-313.pyc +0 -0
mcp/__pycache__/cover_letter_server.cpython-313.pyc +0 -0
mcp/__pycache__/cv_owner_server.cpython-313.pyc +0 -0
mcp/__pycache__/orchestrator_server.cpython-313.pyc +0 -0
mcp/__pycache__/server_common.cpython-313.pyc +0 -0
mcp/cover_letter_server.py +27 -0
mcp/cv_owner_server.py +27 -0
mcp/orchestrator_server.py +31 -0
mcp/server_common.py +25 -0
memory/__init__.py +1 -0
memory/__pycache__/__init__.cpython-313.pyc +0 -0
memory/__pycache__/store.cpython-313.pyc +0 -0
memory/data/anthony_test__capco_lead_ai_2024__cover_letter.json +9 -0
memory/data/anthony_test__capco_lead_ai_2024__cv_owner.json +45 -0

.env.example ADDED Viewed

	@@ -0,0 +1,12 @@

+# API Keys (Required to enable respective provider)
+ANTHROPIC_API_KEY="your_anthropic_api_key_here"       # Required: Format: sk-ant-api03-...
+PERPLEXITY_API_KEY="your_perplexity_api_key_here"     # Optional: Format: pplx-...
+OPENAI_API_KEY="your_openai_api_key_here"             # Optional, for OpenAI models. Format: sk-proj-...
+GOOGLE_API_KEY="your_google_api_key_here"             # Optional, for Google Gemini models.
+MISTRAL_API_KEY="your_mistral_key_here"               # Optional, for Mistral AI models.
+XAI_API_KEY="YOUR_XAI_KEY_HERE"                       # Optional, for xAI AI models.
+GROQ_API_KEY="YOUR_GROQ_KEY_HERE"                     # Optional, for Groq models.
+OPENROUTER_API_KEY="YOUR_OPENROUTER_KEY_HERE"         # Optional, for OpenRouter models.
+AZURE_OPENAI_API_KEY="your_azure_key_here"            # Optional, for Azure OpenAI models (requires endpoint in .taskmaster/config.json).
+OLLAMA_API_KEY="your_ollama_api_key_here"             # Optional: For remote Ollama servers that require authentication.
+GITHUB_API_KEY="your_github_api_key_here"             # Optional: For GitHub import/export features. Format: ghp_... or github_pat_...

README.md ADDED Viewed

	@@ -0,0 +1,439 @@

+## Multi‑Agent Job Application Assistant (Streamlit + Gradio/Hugging Face)
+A production‑ready system to discover jobs, generate ATS‑optimized resumes and cover letters, and export documents to Word/PowerPoint/Excel. Includes secure LinkedIn OAuth (optional), multi‑source job aggregation, Gemini‑powered generation, and advanced agent capabilities (parallelism, temporal tracking, observability, context engineering).
+---
+### What you get
+- **Two UIs**: Streamlit (`app.py`) and Gradio/HF (`hf_app.py`)
+- **LinkedIn OAuth 2.0** (optional; CSRF‑safe state validation)
+- **Job aggregation**: Adzuna (5k/month) plus resilient fallbacks
+- **ATS‑optimized drafting**: resumes + cover letters (Gemini)
+- **Office exports**:
+  - Word resumes and cover letters (5 templates)
+  - PowerPoint CV (4 templates)
+  - Excel application tracker (5 analytical sheets)
+- **Advanced agents**: parallel execution, temporal memory, observability/tracing, and context engineering/flywheel
+- **LangExtract integration**: structured extraction with Gemini key; robust regex fallback in constrained environments
+- **New**: Router pipeline, Temporal KG integration, Parallel-agents demo, HF minimal Space branch
+- **New (Aug 2025)**: UK resume rules, action-verb upgrades, anti-buzzword scrub, skills proficiency, remote readiness, Muse/Reed/Novorésumé/StandOut CV checklists, and interactive output controls (exact length, cycles, layout presets)
+---
+## Quickstart
+### 1) Environment (.env)
+Create a UTF‑8 `.env` (values optional if you want mock mode). See `.env.example` for the full list of variables:
+```ini
+# Behavior
+MOCK_MODE=true
+PORT=7860
+# LLM / Research
+LLM_PROVIDER=gemini
+LLM_MODEL=gemini-2.5-flash
+GEMINI_API_KEY=
+# Optional per-agent Gemini keys
+GEMINI_API_KEY_CV=
+GEMINI_API_KEY_COVER=
+GEMINI_API_KEY_CHAT=
+GEMINI_API_KEY_PARSER=
+GEMINI_API_KEY_MATCH=
+GEMINI_API_KEY_TAILOR=
+OPENAI_API_KEY=
+ANTHROPIC_API_KEY=
+TAVILY_API_KEY=
+# Job APIs
+ADZUNA_APP_ID=
+ADZUNA_APP_KEY=
+# Office MCP (optional)
+POWERPOINT_MCP_URL=http://localhost:3000
+WORD_MCP_URL=http://localhost:3001
+EXCEL_MCP_URL=http://localhost:3002
+# LangExtract uses GEMINI key by default
+LANGEXTRACT_API_KEY=
+```
+Hardcoded keys have been removed from utility scripts. Use `switch_api_key.py` to safely set keys into `.env` without embedding them in code.
+### 2) Install
+- Windows PowerShell
+```powershell
+python -m venv .venv
+.\.venv\Scripts\Activate.ps1
+pip install -r requirements.txt
+```
+- Linux/macOS
+```bash
+python3 -m venv .venv
+source .venv/bin/activate
+pip install -r requirements.txt
+```
+### 3) Run the apps
+- Streamlit (PATH‑safe)
+```powershell
+python -m streamlit run app.py --server.port 8501
+```
+- Gradio / Hugging Face (avoid port conflicts)
+```powershell
+$env:PORT=7861; python hf_app.py
+```
+```bash
+PORT=7861 python hf_app.py
+```
+The HF app binds on 0.0.0.0:$PORT.
+---
+## 📊 System Architecture Overview
+This is a **production-ready, multi-agent job application system** with sophisticated AI capabilities and enterprise-grade features:
+### 🏗️ Core Architecture
+#### **Dual Interface Design**
+- **Streamlit Interface** (`app.py`) - Traditional web application for desktop use
+- **Gradio/HF Interface** (`hf_app.py`) - Modern, mobile-friendly, deployable to Hugging Face Spaces
+#### **Multi-Agent System** (15 Specialized Agents)
+**Core Processing Agents:**
+- **`OrchestratorAgent`** - Central coordinator managing workflow and job orchestration
+- **`CVOwnerAgent`** - ATS-optimized resume generation with UK-specific formatting rules
+- **`CoverLetterAgent`** - Personalized cover letter generation with keyword optimization
+- **`ProfileAgent`** - Intelligent CV parsing and structured profile extraction
+- **`JobAgent`** - Job posting analysis and requirement extraction
+- **`RouterAgent`** - Dynamic routing based on payload state and workflow stage
+**Advanced AI Agents:**
+- **`ParallelExecutor`** - Concurrent processing for 3-5x faster multi-job handling
+- **`TemporalTracker`** - Time-stamped application history and pattern analysis
+- **`ObservabilityAgent`** - Real-time tracing, metrics collection, and monitoring
+- **`ContextEngineer`** - Flywheel learning and context optimization
+- **`ContextScaler`** - L1/L2/L3 memory management for scalable context handling
+- **`LinkedInManager`** - OAuth 2.0 integration and profile synchronization
+- **`MetaAgent`** - Combines outputs from multiple specialized analysis agents
+- **`TriageAgent`** - Intelligent task prioritization and routing
+#### **Guidelines Enforcement System** (`agents/guidelines.py`)
+Comprehensive rule engine ensuring document quality:
+- **UK Compliance**: British English, UK date formats (MMM YYYY), £ currency normalization
+- **ATS Optimization**: Plain text formatting, keyword density, section structure
+- **Content Quality**: Anti-buzzword filtering, action verb strengthening, first-person removal
+- **Layout Rules**: Exact length enforcement, heading validation, bullet point formatting
+### 🔌 Integration Ecosystem
+#### **LLM Integration** (`services/llm.py`)
+- **Multi-Provider Support**: OpenAI, Anthropic Claude, Google Gemini
+- **Per-Agent API Keys**: Cost optimization through agent-specific key allocation
+- **Intelligent Fallbacks**: Graceful degradation when providers unavailable
+- **Configurable Models**: Per-agent model selection for optimal performance/cost
+#### **Job Aggregation** (`services/job_aggregator.py`, `services/jobspy_client.py`)
+- **Primary Sources**: Adzuna API (5,000 jobs/month free tier)
+- **JobSpy Integration**: Indeed, LinkedIn, Glassdoor aggregation
+- **Additional APIs**: Remotive, The Muse, GitHub Jobs
+- **Smart Deduplication**: Title + company matching with fuzzy logic
+- **SSL Bypass**: Automatic retry for corporate environments
+#### **Document Generation** (`services/`)
+- **Word Documents** (`word_cv.py`): 5 professional templates, MCP server integration
+- **PowerPoint CVs** (`powerpoint_cv.py`): 4 visual templates for presentations
+- **Excel Trackers** (`excel_tracker.py`): 5 analytical sheets with metrics
+- **PDF Export**: Cross-platform compatibility with formatting preservation
+### 📈 Advanced Features
+#### **Pipeline Architecture** (`agents/pipeline.py`)
+```
+User Input → Router → Profile Analysis → Job Analysis → Resume Generation → Cover Letter → Review → Memory Storage
+                ↓           ↓                ↓              ↓                    ↓            ↓
+           Event Log   Profile Cache    Job Cache    Document Cache      Metrics Log   Temporal KG
+```
+#### **Memory & Persistence**
+- **File-backed Storage** (`memory/store.py`): Atomic writes, thread-safe operations
+- **Temporal Knowledge Graph**: Application tracking with time-stamped relationships
+- **Event Sourcing** (`events.jsonl`): Complete audit trail of all agent actions
+- **Caching System** (`utils/cache.py`): TTL-based caching with automatic eviction
+#### **LangExtract Integration** (`services/langextract_service.py`)
+- **Structured Extraction**: Job requirements, skills, company culture
+- **ATS Optimization**: Keyword extraction and scoring
+- **Fallback Mechanisms**: Regex-based extraction when API unavailable
+- **Result Caching**: Performance optimization for repeated analyses
+### 🛡️ Security & Configuration
+#### **Authentication & Security**
+- **OAuth 2.0**: LinkedIn integration with CSRF protection
+- **Input Sanitization**: Path traversal and injection prevention
+- **Environment Isolation**: Secrets management via `.env`
+- **Rate Limiting**: API throttling and abuse prevention
+#### **Configuration Management**
+- **Environment Variables**: All sensitive data in `.env`
+- **Agent Configuration** (`utils/config.py`): Centralized settings
+- **Template System**: Customizable document templates
+- **Feature Flags**: Progressive enhancement based on available services
+### 📁 Project Structure
+```
+2096955/
+├── agents/               # Multi-agent system components
+│   ├── orchestrator.py   # Main orchestration logic
+│   ├── cv_owner.py       # Resume generation with guidelines
+│   ├── guidelines.py     # UK rules and ATS optimization
+│   ├── pipeline.py       # Application pipeline flow
+│   └── ...              # Additional specialized agents
+├── services/            # External integrations and services
+│   ├── llm.py           # Multi-provider LLM client
+│   ├── job_aggregator.py # Job source aggregation
+│   ├── word_cv.py       # Word document generation
+│   └── ...              # Document and API services
+├── utils/               # Utility functions and helpers
+│   ├── ats.py           # ATS scoring and optimization
+│   ├── cache.py         # TTL caching system
+│   ├── consistency.py   # Contradiction detection
+│   └── ...              # Text processing and helpers
+├── models/              # Data models and schemas
+│   └── schemas.py       # Pydantic models for type safety
+├── mcp/                 # Model Context Protocol servers
+│   ├── cv_owner_server.py
+│   ├── cover_letter_server.py
+│   └── orchestrator_server.py
+├── memory/              # Persistent storage
+│   ├── store.py         # File-backed memory store
+│   └── data/            # Application state and history
+├── app.py               # Streamlit interface
+├── hf_app.py            # Gradio/HF interface
+└── api_llm_integration.py # REST API endpoints
+```
+### 🚀 Performance Optimizations
+- **Parallel Processing**: Async job handling with `asyncio` and `nest_asyncio`
+- **Lazy Loading**: Dependencies loaded only when needed
+- **Smart Caching**: Multi-level caching (memory, file, API responses)
+- **Batch Operations**: Efficient multi-job processing
+- **Event-Driven**: Asynchronous event handling for responsiveness
+### 🧪 Testing & Quality
+- **Test Suites**: Comprehensive tests in `tests/` directory
+- **Integration Tests**: API and service integration validation
+- **Mock Mode**: Development without API keys
+- **Smoke Tests**: Quick validation scripts for deployment
+- **Observability**: Built-in tracing and metrics collection
+---
+## Router pipeline (User → Router → Profile → Job → Resume → Cover → Review)
+- Implemented in `agents/pipeline.py` and exposed via API in `api_llm_integration.py` (`/api/llm/pipeline_run`).
+- Agents:
+  - `RouterAgent`: routes based on payload state
+  - `ProfileAgent`: parses CV to structured profile (LLM with fallback)
+  - `JobAgent`: analyzes job posting (LLM with fallback)
+  - `CVOwnerAgent` and `CoverLetterAgent`: draft documents (Gemini, per-agent keys)
+  - Review: contradiction checks and memory persist
+- Temporal tracking: on review, a `drafted` status is recorded in the temporal KG with issues metadata.
+**Flow diagram**
+```mermaid
+flowchart TD
+  U["User"] --> R["RouterAgent"]
+  R -->|cv_text present| P["ProfileAgent (LLM)"]
+  R -->|job_posting present| J["JobAgent (LLM)"]
+  P --> RESUME["CVOwnerAgent"]
+  J --> RESUME
+  RESUME --> COVER["CoverLetterAgent"]
+  COVER --> REVIEW["Orchestrator Review"]
+  REVIEW --> M["MemoryStore (file-backed)"]
+  REVIEW --> TKG["Temporal KG (triplets)"]
+  subgraph LLM["LLM Client (Gemini 2.5 Flash, per-agent keys)"]
+    P
+    J
+    RESUME
+    COVER
+  end
+  subgraph UI["Gradio (HF)"]
+    U
+  end
+  subgraph API["Flask API"]
+    PR["/api/llm/pipeline_run"]
+  end
+  U -. optional .-> PR
+```
+---
+## Hugging Face / Gradio (interactive controls)
+- In the CV Analysis tab, you can now set:
+  - **Refinement cycles** (1–5)
+  - **Exact target length** (characters) to enforce resume and cover length deterministically
+  - **Layout preset**: `classic`, `modern`, `minimalist`, `executive`
+    - classic: Summary → Skills → Experience → Education (above the fold for Summary/Skills)
+    - modern: Summary → Experience → Skills → Projects/Certifications → Education
+    - minimalist: concise Summary → Skills → Experience → Education
+    - executive: Summary → Selected Achievements (3–5) → Experience → Skills → Education → Certifications
+---
+## UK resume/cover rules (built-in)
+- UK English and dates (MMM YYYY)
+- Current role in present tense; previous roles in past tense
+- Digits for numbers; £ and % normalization
+- Remove first‑person pronouns in resume bullets; maintain active voice
+- Hard skills first (max ~10), then soft skills; verbatim critical JD keywords in bullets
+- Strip DOB/photo lines; compress older roles (>15 years) to title/company/dates
+These rules are applied by `agents/cv_owner.py` and validated by checklists.
+---
+## Checklists and observability
+- Checklists integrate guidance from:
+  - Reed: CV layout and mistakes
+  - The Muse: action verbs and layout basics
+  - Novorésumé: one‑page bias, clean sections, links
+  - StandOut CV: quantification, bullet density, recent‑role focus
+- Observability tab aggregates per‑agent events and displays checklist outcomes. Events are stored in `memory/data/events.jsonl`.
+---
+## Scripts (headless runs)
+- Capco (Anthony Lui → Capco):
+```powershell
+python .\scripts\run_with_env.py .\scripts\run_anthony_capco.py
+```
+- Anthropic (Anthony Lui → Anthropic):
+```powershell
+python .\scripts\run_with_env.py .\scripts\run_anthropic_job.py
+```
+- Pipeline (Router + Agents + Review + Events):
+```powershell
+python .\scripts\run_with_env.py .\scripts\pipeline_anthony_capco.py
+```
+These scripts print document lengths, agent diagnostics, and whether Gemini is enabled. Set `.env` with `LLM_PROVIDER=gemini`, `LLM_MODEL=gemini-2.5-flash`, and `GEMINI_API_KEY`.
+---
+## Temporal knowledge graph (micro‑memory)
+- `agents/temporal_tracker.py` stores time‑stamped triplets with non‑destructive invalidation.
+- Integrated in pipeline review to track job application states and history.
+- Utilities for timelines, active applications, and pattern analysis included.
+---
+## Parallel agents + meta‑agent demo
+- Notebook: `notebooks/agents_parallel_demo.ipynb`
+- Runs 4 analysis agents in parallel and combines outputs via a meta‑agent, with a timeline plot.
+- Uses the central LLM client (`services/llm.py`) with `LLM_PROVIDER=gemini` and `LLM_MODEL=gemini-2.5-flash`.
+Run (Jupyter/VSCode):
+```python
+%pip install nest_asyncio matplotlib
+# Ensure GEMINI_API_KEY is set in your environment
+```
+Open and run the notebook cells.
+---
+## LinkedIn OAuth (optional)
+1) Create a LinkedIn Developer App, then add redirect URLs:
+```
+http://localhost:8501
+http://localhost:8501/callback
+```
+2) Products: enable “Sign In with LinkedIn using OpenID Connect”.
+3) Update `.env` and set `MOCK_MODE=false`.
+4) In the UI, use the “LinkedIn Authentication” section to kick off the flow.
+Notes:
+- LinkedIn Jobs API is enterprise‑only. The system uses Adzuna + other sources for job data.
+---
+## Job sources
+- **Adzuna**: global coverage, 5,000 free jobs/month
+- **Resilient aggregator** and optional **JobSpy MCP** for broader search
+- **Custom jobs**: add your own postings in the UI
+- Corporate SSL environments: Adzuna calls auto‑retries with `verify=False` fallback
+---
+## LLMs and configuration
+- Central client supports OpenAI, Anthropic, and Gemini with per‑agent Gemini keys (`services/llm.py`).
+- Recommended defaults for this project:
+  - `LLM_PROVIDER=gemini`
+  - `LLM_MODEL=gemini-2.5-flash`
+- Agents pass `agent="cv|cover|parser|match|tailor|chat"` to use per‑agent keys when provided.
+---
+## Advanced agents (built‑in)
+- **Parallel processing**: 3–5× faster multi‑job drafting
+- **Temporal tracking**: time‑stamped history and pattern analysis
+- **Observability**: tracing, metrics, timeline visualization
+- **Context engineering**: flywheel learning, L1/L2/L3 memory, scalable context
+Toggle these in the HF app under “🚀 Advanced AI Features”.
+---
+## LangExtract + Gemini
+- Uses the same `GEMINI_API_KEY` (auto‑applied to `LANGEXTRACT_API_KEY` when empty)
+- Official `langextract.extract(...)` requires examples; the UI also exposes a robust regex‑based fallback (`services/langextract_service.py`) so features work even when cloud extraction is constrained
+- In HF app (“🔍 Enhanced Job Analysis”), you can:
+  - Analyze job postings (structured fields + skills)
+  - Optimize resume for ATS (score + missing keywords)
+  - Bulk analyze multiple jobs
+---
+## Office exports
+- **Word** (`services/word_cv.py`): resumes + cover letters (5 templates; `python‑docx` fallback)
+- **PowerPoint** (`services/powerpoint_cv.py`): visual CV (4 templates; `python‑pptx` fallback)
+- **Excel** (`services/excel_tracker.py`): tracker with 5 analytical sheets (`openpyxl` fallback)
+- MCP servers supported when available; local libraries are used otherwise
+In HF app, after generation, expand:
+- “📊 Export to PowerPoint CV”
+- “📝 Export to Word Documents”
+- “📈 Export Excel Tracker”
+---
+## Hugging Face minimal Space branch
+- Clean branch containing only `app.py` and `requirements.txt` for Spaces.
+- Branch name: `hf-space-min` (push from a clean worktree).
+- `.gitignore` includes `.env` and `.env.*` to avoid leaking secrets.
+---
+## Tests & scripts
+- Run test suites in `tests/`
+- Useful scripts: `test_*` files in project root (integration checks)
+---
+## Security
+- OAuth state validation, input/path/url sanitization
+- Sensitive data via environment variables; avoid committing secrets
+- Atomic writes in memory store
+---
+## Run summary
+- Streamlit: `python -m streamlit run app.py --server.port 8501`
+- Gradio/HF: `PORT=7861 python hf_app.py`
+Your system is fully documented here in one place and ready for local or HF deployment.

agents/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # agents package

agents/__pycache__/__init__.cpython-313.pyc ADDED Viewed

Binary file (186 Bytes). View file

agents/__pycache__/context_engineer.cpython-313.pyc ADDED Viewed

Binary file (24.9 kB). View file

agents/__pycache__/context_scaler.cpython-313.pyc ADDED Viewed

Binary file (21.9 kB). View file

agents/__pycache__/cover_letter_agent.cpython-313.pyc ADDED Viewed

Binary file (8.97 kB). View file

agents/__pycache__/cv_owner.cpython-313.pyc ADDED Viewed

Binary file (27.1 kB). View file

agents/__pycache__/guidelines.cpython-313.pyc ADDED Viewed

Binary file (15 kB). View file

agents/__pycache__/job_agent.cpython-313.pyc ADDED Viewed

Binary file (1.46 kB). View file

agents/__pycache__/linkedin_manager.cpython-313.pyc ADDED Viewed

Binary file (6.72 kB). View file

agents/__pycache__/observability.cpython-313.pyc ADDED Viewed

Binary file (18.7 kB). View file

agents/__pycache__/orchestrator.cpython-313.pyc ADDED Viewed

Binary file (11.2 kB). View file

agents/__pycache__/parallel_executor.cpython-313.pyc ADDED Viewed

Binary file (15.7 kB). View file

agents/__pycache__/pipeline.cpython-313.pyc ADDED Viewed

Binary file (12.6 kB). View file

agents/__pycache__/profile_agent.cpython-313.pyc ADDED Viewed

Binary file (2.06 kB). View file

agents/__pycache__/router_agent.cpython-313.pyc ADDED Viewed

Binary file (1.51 kB). View file

agents/__pycache__/temporal_tracker.cpython-313.pyc ADDED Viewed

Binary file (20.6 kB). View file

agents/a2a_cv_owner.py ADDED Viewed

	@@ -0,0 +1,356 @@

+"""
+A2A Protocol Implementation for CV Owner Agent
+Proof of Concept showing how agents can communicate via A2A protocol
+"""
+import json
+import asyncio
+from typing import Dict, Any, List, Optional
+from datetime import datetime
+from dataclasses import dataclass, asdict
+import aiohttp
+from aiohttp import web
+import logging
+# Import existing CV Owner logic
+from agents.cv_owner import CVOwnerAgent as OriginalCVOwner
+from models.schemas import JobPosting, ResumeDraft
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+@dataclass
+class AgentCard:
+    """Agent discovery card following A2A specification"""
+    name: str
+    description: str
+    version: str
+    endpoint: str
+    capabilities: List[str]
+    interaction_modes: List[str]
+    auth_required: bool = False
+    def to_dict(self) -> Dict[str, Any]:
+        return asdict(self)
+class A2ACVOwnerAgent:
+    """CV Owner Agent implementing A2A Protocol"""
+    def __init__(self, port: int = 8001):
+        self.port = port
+        self.name = "cv_owner_service"
+        self.version = "1.0.0"
+        self.original_agent = OriginalCVOwner()
+        self.app = web.Application()
+        self.setup_routes()
+        # Agent Card for discovery
+        self.card = AgentCard(
+            name=self.name,
+            description="ATS-optimized resume generation with UK formatting rules",
+            version=self.version,
+            endpoint=f"http://localhost:{self.port}",
+            capabilities=[
+                "resume.generate",
+                "resume.refine",
+                "resume.optimize_ats",
+                "resume.validate_uk_format"
+            ],
+            interaction_modes=["sync", "async", "stream"],
+            auth_required=False
+        )
+    def setup_routes(self):
+        """Setup A2A JSON-RPC 2.0 routes"""
+        self.app.router.add_post('/rpc', self.handle_rpc)
+        self.app.router.add_get('/agent-card', self.get_agent_card)
+        self.app.router.add_get('/health', self.health_check)
+    async def get_agent_card(self, request: web.Request) -> web.Response:
+        """Return agent discovery card"""
+        return web.json_response(self.card.to_dict())
+    async def health_check(self, request: web.Request) -> web.Response:
+        """Health check endpoint"""
+        return web.json_response({
+            "status": "healthy",
+            "agent": self.name,
+            "version": self.version,
+            "timestamp": datetime.now().isoformat()
+        })
+    async def handle_rpc(self, request: web.Request) -> web.Response:
+        """Handle JSON-RPC 2.0 requests"""
+        try:
+            data = await request.json()
+            # Validate JSON-RPC request
+            if "jsonrpc" not in data or data["jsonrpc"] != "2.0":
+                return self.error_response(
+                    -32600, "Invalid Request", data.get("id")
+                )
+            method = data.get("method")
+            params = data.get("params", {})
+            request_id = data.get("id")
+            # Route to appropriate method
+            if method == "resume.generate":
+                result = await self.generate_resume(params)
+            elif method == "resume.refine":
+                result = await self.refine_resume(params)
+            elif method == "resume.optimize_ats":
+                result = await self.optimize_ats(params)
+            elif method == "resume.validate_uk_format":
+                result = await self.validate_uk_format(params)
+            elif method == "_capabilities":
+                result = self.get_capabilities()
+            else:
+                return self.error_response(
+                    -32601, f"Method not found: {method}", request_id
+                )
+            # Return success response
+            return web.json_response({
+                "jsonrpc": "2.0",
+                "result": result,
+                "id": request_id
+            })
+        except Exception as e:
+            logger.error(f"RPC error: {str(e)}")
+            return self.error_response(
+                -32603, f"Internal error: {str(e)}",
+                data.get("id") if "data" in locals() else None
+            )
+    def error_response(self, code: int, message: str, request_id: Any) -> web.Response:
+        """Create JSON-RPC error response"""
+        return web.json_response({
+            "jsonrpc": "2.0",
+            "error": {
+                "code": code,
+                "message": message
+            },
+            "id": request_id
+        })
+    async def generate_resume(self, params: Dict[str, Any]) -> Dict[str, Any]:
+        """Generate resume via A2A protocol"""
+        try:
+            # Extract parameters
+            job_data = params.get("job", {})
+            cv_text = params.get("cv_text", "")
+            target_length = params.get("target_length", 4000)
+            # Convert to JobPosting object
+            job = JobPosting(
+                id=job_data.get("id", "unknown"),
+                title=job_data.get("title", ""),
+                company=job_data.get("company", ""),
+                description=job_data.get("description", ""),
+                location=job_data.get("location", ""),
+                salary_min=job_data.get("salary_min"),
+                salary_max=job_data.get("salary_max")
+            )
+            # Generate using original agent
+            result = self.original_agent.generate_resume(
+                job, cv_text, target_length=target_length
+            )
+            # Return A2A-formatted response
+            return {
+                "resume_text": result.text,
+                "metadata": result.metadata,
+                "ats_score": getattr(result, "ats_score", 0.85),
+                "keywords": getattr(result, "keywords", []),
+                "generation_time": datetime.now().isoformat(),
+                "agent": self.name
+            }
+        except Exception as e:
+            logger.error(f"Resume generation error: {str(e)}")
+            raise
+    async def refine_resume(self, params: Dict[str, Any]) -> Dict[str, Any]:
+        """Refine existing resume"""
+        try:
+            resume_text = params.get("resume_text", "")
+            feedback = params.get("feedback", {})
+            # Use original agent's refinement logic
+            refined = self.original_agent.refine_resume(
+                resume_text, feedback
+            )
+            return {
+                "refined_text": refined.text,
+                "changes_made": refined.metadata.get("changes", []),
+                "refinement_time": datetime.now().isoformat()
+            }
+        except Exception as e:
+            logger.error(f"Resume refinement error: {str(e)}")
+            raise
+    async def optimize_ats(self, params: Dict[str, Any]) -> Dict[str, Any]:
+        """Optimize resume for ATS"""
+        resume_text = params.get("resume_text", "")
+        job_description = params.get("job_description", "")
+        # Perform ATS optimization
+        optimized = self.original_agent.optimize_for_ats(
+            resume_text, job_description
+        )
+        return {
+            "optimized_text": optimized["text"],
+            "ats_score": optimized["score"],
+            "keywords_added": optimized["keywords"],
+            "optimization_time": datetime.now().isoformat()
+        }
+    async def validate_uk_format(self, params: Dict[str, Any]) -> Dict[str, Any]:
+        """Validate UK formatting rules"""
+        resume_text = params.get("resume_text", "")
+        # Check UK formatting
+        issues = []
+        # Check for US date formats
+        if "January 2024" not in resume_text and "/2024" in resume_text:
+            issues.append("Use UK date format (MMM YYYY)")
+        # Check for US spelling
+        us_words = ["optimize", "analyze", "organization"]
+        for word in us_words:
+            if word in resume_text.lower():
+                issues.append(f"Use UK spelling for '{word}'")
+        return {
+            "is_valid": len(issues) == 0,
+            "issues": issues,
+            "validation_time": datetime.now().isoformat()
+        }
+    def get_capabilities(self) -> Dict[str, Any]:
+        """Return agent capabilities"""
+        return {
+            "capabilities": self.card.capabilities,
+            "version": self.version,
+            "interaction_modes": self.card.interaction_modes,
+            "max_resume_length": 5000,
+            "supported_formats": ["text", "markdown"],
+            "uk_formatting": True,
+            "ats_optimization": True
+        }
+    async def register_with_registry(self, registry_url: str):
+        """Register this agent with A2A registry"""
+        async with aiohttp.ClientSession() as session:
+            try:
+                async with session.post(
+                    f"{registry_url}/register",
+                    json=self.card.to_dict()
+                ) as response:
+                    if response.status == 200:
+                        logger.info(f"Registered {self.name} with registry")
+                    else:
+                        logger.error(f"Registration failed: {await response.text()}")
+            except Exception as e:
+                logger.error(f"Could not register with registry: {e}")
+    def run(self):
+        """Start the A2A agent server"""
+        logger.info(f"Starting {self.name} on port {self.port}")
+        logger.info(f"Agent Card available at http://localhost:{self.port}/agent-card")
+        logger.info(f"RPC endpoint at http://localhost:{self.port}/rpc")
+        web.run_app(self.app, host='0.0.0.0', port=self.port)
+class A2AClient:
+    """Client for communicating with A2A agents"""
+    def __init__(self, agent_endpoint: str):
+        self.endpoint = agent_endpoint
+        self.session = None
+    async def __aenter__(self):
+        self.session = aiohttp.ClientSession()
+        return self
+    async def __aexit__(self, exc_type, exc_val, exc_tb):
+        if self.session:
+            await self.session.close()
+    async def call(self, method: str, params: Dict[str, Any] = None) -> Any:
+        """Call an A2A agent method"""
+        if not self.session:
+            self.session = aiohttp.ClientSession()
+        request = {
+            "jsonrpc": "2.0",
+            "method": method,
+            "params": params or {},
+            "id": datetime.now().timestamp()
+        }
+        async with self.session.post(
+            f"{self.endpoint}/rpc",
+            json=request
+        ) as response:
+            data = await response.json()
+            if "error" in data:
+                raise Exception(f"RPC Error: {data['error']}")
+            return data.get("result")
+    async def get_agent_card(self) -> Dict[str, Any]:
+        """Get agent's discovery card"""
+        if not self.session:
+            self.session = aiohttp.ClientSession()
+        async with self.session.get(
+            f"{self.endpoint}/agent-card"
+        ) as response:
+            return await response.json()
+async def test_a2a_agent():
+    """Test the A2A CV Owner Agent"""
+    # Start agent in background
+    agent = A2ACVOwnerAgent()
+    # In production, this would run in separate process
+    # For testing, we'll use the client
+    async with A2AClient("http://localhost:8001") as client:
+        # Get agent card
+        card = await client.get_agent_card()
+        print(f"Agent: {card['name']}")
+        print(f"Capabilities: {card['capabilities']}")
+        # Generate resume
+        result = await client.call("resume.generate", {
+            "job": {
+                "id": "test_job",
+                "title": "Senior AI Engineer",
+                "company": "TechCorp",
+                "description": "Looking for AI expert with LLM experience..."
+            },
+            "cv_text": "John Doe, AI Engineer with 5 years experience..."
+        })
+        print(f"Generated resume: {result['resume_text'][:200]}...")
+        print(f"ATS Score: {result['ats_score']}")
+if __name__ == "__main__":
+    # Run the A2A agent
+    agent = A2ACVOwnerAgent()
+    agent.run()

agents/context_engineer.py ADDED Viewed

	@@ -0,0 +1,540 @@

+"""
+Context Engineering System
+Implements the complete context engineering framework for optimal LLM performance
+Based on the three-step evolution: Retrieval/Generation → Processing → Management
+"""
+import json
+import logging
+from typing import Dict, List, Any, Optional, Tuple
+from datetime import datetime, timedelta
+from dataclasses import dataclass, field
+import hashlib
+from collections import deque
+import numpy as np
+from pathlib import Path
+logger = logging.getLogger(__name__)
+@dataclass
+class ContextChunk:
+    """A unit of context with metadata"""
+    content: str
+    source: str
+    timestamp: datetime
+    relevance_score: float = 0.0
+    token_count: int = 0
+    embedding: Optional[np.ndarray] = None
+    metadata: Dict = field(default_factory=dict)
+    compression_ratio: float = 1.0
+    access_count: int = 0
+    last_accessed: Optional[datetime] = None
+    def update_access(self):
+        """Update access statistics"""
+        self.access_count += 1
+        self.last_accessed = datetime.now()
+class DataFlywheel:
+    """
+    NVIDIA's concept: Continuous improvement through input/output pairing
+    Learns from successful context usage to optimize future retrievals
+    """
+    def __init__(self, storage_path: str = "flywheel_data.json"):
+        self.storage_path = Path(storage_path)
+        self.successful_contexts: List[Dict] = []
+        self.feedback_pairs: List[Tuple[str, str, float]] = []  # (input, output, score)
+        self.pattern_cache: Dict[str, List[str]] = {}
+        self.load()
+    def record_success(
+        self,
+        input_context: str,
+        output: str,
+        success_score: float,
+        context_chunks: List[ContextChunk]
+    ):
+        """Record successful context usage for learning"""
+        self.successful_contexts.append({
+            'timestamp': datetime.now().isoformat(),
+            'input': input_context[:500],  # Truncate for storage
+            'output': output[:500],
+            'score': success_score,
+            'chunks_used': [c.source for c in context_chunks],
+            'avg_relevance': np.mean([c.relevance_score for c in context_chunks])
+        })
+        # Update pattern cache
+        key = self._generate_pattern_key(input_context)
+        if key not in self.pattern_cache:
+            self.pattern_cache[key] = []
+        self.pattern_cache[key].extend([c.source for c in context_chunks])
+        self.save()
+    def get_recommended_sources(self, query: str) -> List[str]:
+        """Get recommended context sources based on past successes"""
+        key = self._generate_pattern_key(query)
+        if key in self.pattern_cache:
+            # Return most frequently used sources for similar queries
+            sources = self.pattern_cache[key]
+            from collections import Counter
+            return [s for s, _ in Counter(sources).most_common(5)]
+        return []
+    def _generate_pattern_key(self, text: str) -> str:
+        """Generate pattern key for caching"""
+        # Simple keyword extraction for pattern matching
+        keywords = sorted(set(text.lower().split()[:10]))
+        return hashlib.md5('_'.join(keywords).encode()).hexdigest()[:8]
+    def save(self):
+        """Persist flywheel data"""
+        data = {
+            'successful_contexts': self.successful_contexts[-100:],  # Keep last 100
+            'pattern_cache': {k: v[-20:] for k, v in self.pattern_cache.items()}  # Keep last 20 per pattern
+        }
+        with open(self.storage_path, 'w') as f:
+            json.dump(data, f, indent=2)
+    def load(self):
+        """Load flywheel data"""
+        if self.storage_path.exists():
+            try:
+                with open(self.storage_path, 'r') as f:
+                    data = json.load(f)
+                self.successful_contexts = data.get('successful_contexts', [])
+                self.pattern_cache = data.get('pattern_cache', {})
+            except Exception as e:
+                logger.error(f"Error loading flywheel data: {e}")
+class ContextProcessor:
+    """
+    Step 2: Process and refine raw context
+    Handles chunking, embedding, relevance scoring, and compression
+    """
+    def __init__(self, max_chunk_size: int = 500, overlap: int = 50):
+        self.max_chunk_size = max_chunk_size
+        self.overlap = overlap
+    def process_context(
+        self,
+        raw_context: str,
+        query: str,
+        source: str = "unknown"
+    ) -> List[ContextChunk]:
+        """Process raw context into optimized chunks"""
+        # 1. Chunk the context
+        chunks = self._chunk_text(raw_context)
+        # 2. Create ContextChunk objects
+        context_chunks = []
+        for chunk_text in chunks:
+            chunk = ContextChunk(
+                content=chunk_text,
+                source=source,
+                timestamp=datetime.now(),
+                token_count=len(chunk_text.split()),
+                relevance_score=self._calculate_relevance(chunk_text, query)
+            )
+            # 3. Apply compression if needed
+            if chunk.token_count > 100:
+                chunk.content, chunk.compression_ratio = self._compress_text(chunk_text)
+            context_chunks.append(chunk)
+        # 4. Sort by relevance
+        context_chunks.sort(key=lambda c: c.relevance_score, reverse=True)
+        return context_chunks
+    def _chunk_text(self, text: str) -> List[str]:
+        """Smart chunking with overlap"""
+        words = text.split()
+        chunks = []
+        for i in range(0, len(words), self.max_chunk_size - self.overlap):
+            chunk = ' '.join(words[i:i + self.max_chunk_size])
+            chunks.append(chunk)
+        return chunks
+    def _calculate_relevance(self, chunk: str, query: str) -> float:
+        """Calculate relevance score between chunk and query"""
+        # Simple keyword overlap scoring (would use embeddings in production)
+        query_words = set(query.lower().split())
+        chunk_words = set(chunk.lower().split())
+        if not query_words:
+            return 0.0
+        overlap = len(query_words & chunk_words)
+        return overlap / len(query_words)
+    def _compress_text(self, text: str) -> Tuple[str, float]:
+        """Compress text by removing redundancy"""
+        # Simple compression: remove duplicate sentences
+        sentences = text.split('.')
+        unique_sentences = []
+        seen = set()
+        for sent in sentences:
+            sent_clean = sent.strip().lower()
+            if sent_clean and sent_clean not in seen:
+                unique_sentences.append(sent.strip())
+                seen.add(sent_clean)
+        compressed = '. '.join(unique_sentences)
+        if unique_sentences and not compressed.endswith('.'):
+            compressed += '.'
+        compression_ratio = len(compressed) / len(text) if text else 1.0
+        return compressed, compression_ratio
+class MemoryHierarchy:
+    """
+    Hierarchical memory system with different levels
+    L1: Hot cache (immediate access)
+    L2: Working memory (recent contexts)
+    L3: Long-term storage (compressed historical)
+    """
+    def __init__(
+        self,
+        l1_size: int = 10,
+        l2_size: int = 100,
+        l3_path: str = "long_term_memory.json"
+    ):
+        self.l1_cache: deque = deque(maxlen=l1_size)  # Most recent/relevant
+        self.l2_memory: deque = deque(maxlen=l2_size)  # Working memory
+        self.l3_storage_path = Path(l3_path)
+        self.l3_index: Dict[str, Dict] = {}  # Index for long-term storage
+        self.load_l3()
+    def add_context(self, chunk: ContextChunk):
+        """Add context to appropriate memory level"""
+        # High relevance goes to L1
+        if chunk.relevance_score > 0.8:
+            self.l1_cache.append(chunk)
+        # Medium relevance to L2
+        elif chunk.relevance_score > 0.5:
+            self.l2_memory.append(chunk)
+        # Everything gets indexed in L3
+        self._add_to_l3(chunk)
+    def retrieve(
+        self,
+        query: str,
+        max_chunks: int = 10,
+        recency_weight: float = 0.3
+    ) -> List[ContextChunk]:
+        """Retrieve relevant context from all memory levels"""
+        all_chunks = []
+        # Get from all levels
+        all_chunks.extend(list(self.l1_cache))
+        all_chunks.extend(list(self.l2_memory))
+        # Score chunks considering both relevance and recency
+        now = datetime.now()
+        for chunk in all_chunks:
+            # Calculate recency score (0-1, where 1 is most recent)
+            age_hours = (now - chunk.timestamp).total_seconds() / 3600
+            recency_score = max(0, 1 - (age_hours / 168))  # Decay over a week
+            # Combine relevance and recency
+            chunk.metadata['combined_score'] = (
+                chunk.relevance_score * (1 - recency_weight) +
+                recency_score * recency_weight
+            )
+        # Sort by combined score
+        all_chunks.sort(
+            key=lambda c: c.metadata.get('combined_score', 0),
+            reverse=True
+        )
+        # Update access statistics
+        for chunk in all_chunks[:max_chunks]:
+            chunk.update_access()
+        return all_chunks[:max_chunks]
+    def _add_to_l3(self, chunk: ContextChunk):
+        """Add to long-term storage index"""
+        key = hashlib.md5(chunk.content.encode()).hexdigest()[:16]
+        self.l3_index[key] = {
+            'source': chunk.source,
+            'timestamp': chunk.timestamp.isoformat(),
+            'relevance': chunk.relevance_score,
+            'summary': chunk.content[:100],  # Store summary only
+            'access_count': chunk.access_count
+        }
+        # Periodically save
+        if len(self.l3_index) % 10 == 0:
+            self.save_l3()
+    def save_l3(self):
+        """Save long-term memory to disk"""
+        with open(self.l3_storage_path, 'w') as f:
+            json.dump(self.l3_index, f, indent=2)
+    def load_l3(self):
+        """Load long-term memory from disk"""
+        if self.l3_storage_path.exists():
+            try:
+                with open(self.l3_storage_path, 'r') as f:
+                    self.l3_index = json.load(f)
+            except Exception as e:
+                logger.error(f"Error loading L3 memory: {e}")
+class MultiModalContext:
+    """
+    Handle different types of context beyond text
+    Temporal, spatial, participant states, intentional, cultural
+    """
+    def __init__(self):
+        self.temporal_context: List[Dict] = []  # Time-based relationships
+        self.spatial_context: Dict = {}  # Location/geometry
+        self.participant_states: Dict[str, Dict] = {}  # Entity tracking
+        self.intentional_context: Dict = {}  # Goals and motivations
+        self.cultural_context: Dict = {}  # Social/cultural nuances
+    def add_temporal_context(
+        self,
+        event: str,
+        timestamp: datetime,
+        duration: Optional[timedelta] = None,
+        related_events: List[str] = None
+    ):
+        """Add time-based context"""
+        self.temporal_context.append({
+            'event': event,
+            'timestamp': timestamp,
+            'duration': duration,
+            'related': related_events or []
+        })
+        # Sort by timestamp
+        self.temporal_context.sort(key=lambda x: x['timestamp'])
+    def add_participant_state(
+        self,
+        participant_id: str,
+        state: Dict,
+        timestamp: Optional[datetime] = None
+    ):
+        """Track participant/entity states over time"""
+        if participant_id not in self.participant_states:
+            self.participant_states[participant_id] = {
+                'current': state,
+                'history': []
+            }
+        else:
+            # Archive current state
+            self.participant_states[participant_id]['history'].append({
+                'state': self.participant_states[participant_id]['current'],
+                'timestamp': timestamp or datetime.now()
+            })
+            self.participant_states[participant_id]['current'] = state
+    def add_intentional_context(
+        self,
+        goal: str,
+        motivation: str,
+        constraints: List[str] = None,
+        priority: float = 0.5
+    ):
+        """Add goals and motivations"""
+        self.intentional_context[goal] = {
+            'motivation': motivation,
+            'constraints': constraints or [],
+            'priority': priority,
+            'added': datetime.now()
+        }
+    def get_multimodal_summary(self) -> Dict:
+        """Get summary of all context types"""
+        return {
+            'temporal_events': len(self.temporal_context),
+            'tracked_participants': len(self.participant_states),
+            'active_goals': len(self.intentional_context),
+            'has_spatial': bool(self.spatial_context),
+            'has_cultural': bool(self.cultural_context)
+        }
+class ContextEngineer:
+    """
+    Main context engineering orchestrator
+    Implements the complete 3-step framework
+    """
+    def __init__(self):
+        self.flywheel = DataFlywheel()
+        self.processor = ContextProcessor()
+        self.memory = MemoryHierarchy()
+        self.multimodal = MultiModalContext()
+    def engineer_context(
+        self,
+        query: str,
+        raw_sources: List[Tuple[str, str]],  # (source_name, content)
+        multimodal_data: Optional[Dict] = None
+    ) -> Dict[str, Any]:
+        """
+        Complete context engineering pipeline
+        Step 1: Retrieval & Generation
+        Step 2: Processing
+        Step 3: Management
+        """
+        # Step 1: Retrieval & Generation
+        # Get recommended sources from flywheel
+        recommended = self.flywheel.get_recommended_sources(query)
+        # Prioritize recommended sources
+        prioritized_sources = []
+        for source_name, content in raw_sources:
+            priority = 2.0 if source_name in recommended else 1.0
+            prioritized_sources.append((source_name, content, priority))
+        # Step 2: Processing
+        all_chunks = []
+        for source_name, content, priority in prioritized_sources:
+            chunks = self.processor.process_context(content, query, source_name)
+            # Apply priority boost
+            for chunk in chunks:
+                chunk.relevance_score *= priority
+            all_chunks.extend(chunks)
+        # Add to memory hierarchy
+        for chunk in all_chunks:
+            self.memory.add_context(chunk)
+        # Step 3: Management
+        # Retrieve optimized context
+        final_chunks = self.memory.retrieve(query, max_chunks=10)
+        # Add multimodal context if provided
+        if multimodal_data:
+            for key, value in multimodal_data.items():
+                if key == 'temporal':
+                    for event in value:
+                        self.multimodal.add_temporal_context(**event)
+                elif key == 'participants':
+                    for pid, state in value.items():
+                        self.multimodal.add_participant_state(pid, state)
+                elif key == 'goals':
+                    for goal, details in value.items():
+                        self.multimodal.add_intentional_context(goal, **details)
+        # Build final context
+        context = {
+            'primary_context': '\n\n'.join([c.content for c in final_chunks[:5]]),
+            'supporting_context': '\n'.join([c.content for c in final_chunks[5:10]]),
+            'metadata': {
+                'total_chunks': len(all_chunks),
+                'selected_chunks': len(final_chunks),
+                'avg_relevance': np.mean([c.relevance_score for c in final_chunks]) if final_chunks else 0,
+                'compression_ratio': np.mean([c.compression_ratio for c in final_chunks]) if final_chunks else 1,
+                'sources_used': list(set(c.source for c in final_chunks)),
+                'multimodal': self.multimodal.get_multimodal_summary()
+            },
+            'chunks': final_chunks  # For feedback loop
+        }
+        return context
+    def record_feedback(
+        self,
+        context: Dict,
+        output: str,
+        success_score: float
+    ):
+        """Record feedback for continuous improvement"""
+        self.flywheel.record_success(
+            context['primary_context'],
+            output,
+            success_score,
+            context['chunks']
+        )
+    def optimize_memory(self):
+        """Optimize memory by removing low-value chunks"""
+        # This would implement memory pruning based on:
+        # - Access frequency
+        # - Age
+        # - Relevance scores
+        # - Compression potential
+        pass
+# Demo usage
+def demo_context_engineering():
+    """Demonstrate context engineering"""
+    engineer = ContextEngineer()
+    # Sample sources
+    sources = [
+        ("resume", "10 years experience in Python, AI, Machine Learning..."),
+        ("job_description", "Looking for senior AI engineer with Python skills..."),
+        ("company_research", "TechCorp is a leading AI company focused on NLP...")
+    ]
+    # Multimodal context
+    multimodal = {
+        'temporal': [
+            {
+                'event': 'Application deadline',
+                'timestamp': datetime.now() + timedelta(days=7)
+            }
+        ],
+        'participants': {
+            'applicant': {'status': 'preparing', 'confidence': 0.8}
+        },
+        'goals': {
+            'get_interview': {
+                'motivation': 'Career advancement',
+                'constraints': ['Remote only'],
+                'priority': 0.9
+            }
+        }
+    }
+    # Engineer context
+    context = engineer.engineer_context(
+        query="Write a cover letter for AI engineer position",
+        raw_sources=sources,
+        multimodal_data=multimodal
+    )
+    print("Engineered Context:")
+    print(f"Primary: {context['primary_context'][:200]}...")
+    print(f"Metadata: {context['metadata']}")
+    # Simulate success and record feedback
+    engineer.record_feedback(context, "Generated cover letter...", 0.9)
+    print("\nFlywheel learned patterns for future use!")
+if __name__ == "__main__":
+    demo_context_engineering()

agents/context_scaler.py ADDED Viewed

	@@ -0,0 +1,504 @@

+"""
+Context Scaling System
+Handles length scaling (millions of tokens) and multi-modal/structural scaling
+Implements advanced attention methods and memory techniques from the article
+"""
+import logging
+from typing import Dict, List, Any, Optional, Tuple
+from dataclasses import dataclass
+import numpy as np
+from datetime import datetime
+import heapq
+logger = logging.getLogger(__name__)
+@dataclass
+class ScaledContext:
+    """Context that can scale to millions of tokens"""
+    segments: List[str]  # Segmented content
+    attention_map: np.ndarray  # Attention weights for segments
+    token_count: int
+    compression_level: int  # 0=none, 1=light, 2=medium, 3=heavy
+    modalities: Dict[str, Any]  # Different context modalities
+class AttentionOptimizer:
+    """
+    Advanced attention methods for handling extremely long contexts
+    Implements sliding window, sparse attention, and hierarchical attention
+    """
+    def __init__(self, window_size: int = 512, stride: int = 256):
+        self.window_size = window_size
+        self.stride = stride
+    def sliding_window_attention(
+        self,
+        context: str,
+        query: str,
+        max_windows: int = 10
+    ) -> List[Tuple[str, float]]:
+        """
+        Process context using sliding window attention
+        Returns relevant windows with attention scores
+        """
+        tokens = context.split()
+        windows = []
+        # Create sliding windows
+        for i in range(0, len(tokens) - self.window_size + 1, self.stride):
+            window = ' '.join(tokens[i:i + self.window_size])
+            score = self._calculate_attention_score(window, query)
+            windows.append((window, score))
+        # Return top windows
+        windows.sort(key=lambda x: x[1], reverse=True)
+        return windows[:max_windows]
+    def hierarchical_attention(
+        self,
+        context: str,
+        query: str,
+        levels: int = 3
+    ) -> Dict[int, List[str]]:
+        """
+        Multi-level hierarchical attention
+        Higher levels = more compressed/abstract
+        """
+        hierarchy = {}
+        current_text = context
+        for level in range(levels):
+            if level == 0:
+                # Finest level - full detail
+                hierarchy[level] = self._segment_text(current_text, 500)
+            elif level == 1:
+                # Middle level - paragraphs/sections
+                hierarchy[level] = self._extract_key_sentences(current_text)
+            else:
+                # Highest level - summary
+                hierarchy[level] = [self._generate_summary(current_text)]
+            # Compress for next level
+            current_text = ' '.join(hierarchy[level])
+        return hierarchy
+    def sparse_attention(
+        self,
+        context: str,
+        query: str,
+        sparsity: float = 0.1
+    ) -> List[str]:
+        """
+        Sparse attention - only attend to most relevant tokens
+        Reduces computation from O(n²) to O(n*k)
+        """
+        tokens = context.split()
+        query_tokens = set(query.lower().split())
+        # Calculate relevance for each token
+        token_scores = []
+        for i, token in enumerate(tokens):
+            score = 1.0 if token.lower() in query_tokens else np.random.random() * 0.5
+            token_scores.append((i, token, score))
+        # Keep only top k% tokens
+        k = int(len(tokens) * sparsity)
+        top_tokens = heapq.nlargest(k, token_scores, key=lambda x: x[2])
+        # Sort by original position to maintain order
+        top_tokens.sort(key=lambda x: x[0])
+        # Reconstruct sparse context
+        sparse_context = []
+        last_idx = -1
+        for idx, token, score in top_tokens:
+            if idx > last_idx + 1:
+                sparse_context.append("...")
+            sparse_context.append(token)
+            last_idx = idx
+        return sparse_context
+    def _calculate_attention_score(self, window: str, query: str) -> float:
+        """Calculate attention score between window and query"""
+        window_words = set(window.lower().split())
+        query_words = set(query.lower().split())
+        if not query_words:
+            return 0.0
+        overlap = len(window_words & query_words)
+        return overlap / len(query_words)
+    def _segment_text(self, text: str, segment_size: int) -> List[str]:
+        """Segment text into chunks"""
+        words = text.split()
+        segments = []
+        for i in range(0, len(words), segment_size):
+            segments.append(' '.join(words[i:i + segment_size]))
+        return segments
+    def _extract_key_sentences(self, text: str) -> List[str]:
+        """Extract key sentences (simplified)"""
+        sentences = text.split('.')
+        # Keep sentences with more than 10 words (likely more informative)
+        key_sentences = [s.strip() + '.' for s in sentences if len(s.split()) > 10]
+        return key_sentences[:10]  # Top 10 sentences
+    def _generate_summary(self, text: str) -> str:
+        """Generate summary (simplified - would use LLM in production)"""
+        sentences = text.split('.')[:3]  # First 3 sentences as summary
+        return '. '.join(sentences) + '.'
+class LengthScaler:
+    """
+    Handle context scaling from thousands to millions of tokens
+    Maintains coherence across long documents
+    """
+    def __init__(self, max_tokens: int = 1000000):
+        self.max_tokens = max_tokens
+        self.attention_optimizer = AttentionOptimizer()
+    def scale_context(
+        self,
+        context: str,
+        query: str,
+        target_tokens: int = 2000
+    ) -> ScaledContext:
+        """Scale context to target token count while maintaining relevance"""
+        tokens = context.split()
+        current_tokens = len(tokens)
+        # Determine compression level needed
+        compression_ratio = current_tokens / target_tokens
+        if compression_ratio <= 1:
+            # No compression needed
+            return ScaledContext(
+                segments=[context],
+                attention_map=np.array([1.0]),
+                token_count=current_tokens,
+                compression_level=0,
+                modalities={}
+            )
+        # Apply appropriate scaling strategy
+        if compression_ratio < 5:
+            # Light compression - sliding window
+            segments = self._light_compression(context, query, target_tokens)
+            compression_level = 1
+        elif compression_ratio < 20:
+            # Medium compression - hierarchical
+            segments = self._medium_compression(context, query, target_tokens)
+            compression_level = 2
+        else:
+            # Heavy compression - sparse attention
+            segments = self._heavy_compression(context, query, target_tokens)
+            compression_level = 3
+        # Calculate attention map
+        attention_map = self._calculate_attention_map(segments, query)
+        return ScaledContext(
+            segments=segments,
+            attention_map=attention_map,
+            token_count=sum(len(s.split()) for s in segments),
+            compression_level=compression_level,
+            modalities={}
+        )
+    def _light_compression(
+        self,
+        context: str,
+        query: str,
+        target_tokens: int
+    ) -> List[str]:
+        """Light compression using sliding windows"""
+        windows = self.attention_optimizer.sliding_window_attention(
+            context, query, max_windows=target_tokens // 100
+        )
+        return [w for w, _ in windows]
+    def _medium_compression(
+        self,
+        context: str,
+        query: str,
+        target_tokens: int
+    ) -> List[str]:
+        """Medium compression using hierarchical attention"""
+        hierarchy = self.attention_optimizer.hierarchical_attention(context, query)
+        segments = []
+        remaining_tokens = target_tokens
+        # Add from each level based on available tokens
+        for level in sorted(hierarchy.keys()):
+            level_segments = hierarchy[level]
+            for segment in level_segments:
+                segment_tokens = len(segment.split())
+                if segment_tokens <= remaining_tokens:
+                    segments.append(segment)
+                    remaining_tokens -= segment_tokens
+                if remaining_tokens <= 0:
+                    break
+        return segments
+    def _heavy_compression(
+        self,
+        context: str,
+        query: str,
+        target_tokens: int
+    ) -> List[str]:
+        """Heavy compression using sparse attention"""
+        sparsity = target_tokens / len(context.split())
+        sparse_tokens = self.attention_optimizer.sparse_attention(
+            context, query, sparsity=min(sparsity, 0.3)
+        )
+        # Group sparse tokens into segments
+        segments = []
+        current_segment = []
+        for token in sparse_tokens:
+            if token == "...":
+                if current_segment:
+                    segments.append(' '.join(current_segment))
+                    current_segment = []
+                segments.append("...")
+            else:
+                current_segment.append(token)
+        if current_segment:
+            segments.append(' '.join(current_segment))
+        return segments
+    def _calculate_attention_map(
+        self,
+        segments: List[str],
+        query: str
+    ) -> np.ndarray:
+        """Calculate attention weights for each segment"""
+        query_words = set(query.lower().split())
+        attention_scores = []
+        for segment in segments:
+            if segment == "...":
+                attention_scores.append(0.0)
+            else:
+                segment_words = set(segment.lower().split())
+                overlap = len(query_words & segment_words)
+                score = overlap / max(len(query_words), 1)
+                attention_scores.append(score)
+        # Normalize
+        scores = np.array(attention_scores)
+        if scores.sum() > 0:
+            scores = scores / scores.sum()
+        return scores
+class MultiModalScaler:
+    """
+    Handle multi-modal and structural context scaling
+    Temporal, spatial, participant states, intentional, cultural
+    """
+    def __init__(self):
+        self.modality_handlers = {
+            'temporal': self._scale_temporal,
+            'spatial': self._scale_spatial,
+            'participant': self._scale_participant,
+            'intentional': self._scale_intentional,
+            'cultural': self._scale_cultural
+        }
+    def scale_multimodal(
+        self,
+        modalities: Dict[str, Any],
+        importance_weights: Optional[Dict[str, float]] = None
+    ) -> Dict[str, Any]:
+        """Scale multiple modalities based on importance"""
+        if importance_weights is None:
+            importance_weights = {
+                'temporal': 0.3,
+                'spatial': 0.1,
+                'participant': 0.3,
+                'intentional': 0.2,
+                'cultural': 0.1
+            }
+        scaled = {}
+        for modality, data in modalities.items():
+            if modality in self.modality_handlers:
+                weight = importance_weights.get(modality, 0.1)
+                scaled[modality] = self.modality_handlers[modality](data, weight)
+        return scaled
+    def _scale_temporal(self, data: List[Dict], weight: float) -> List[Dict]:
+        """Scale temporal context - keep most recent and important events"""
+        # Sort by timestamp
+        sorted_data = sorted(data, key=lambda x: x.get('timestamp', datetime.min), reverse=True)
+        # Keep based on weight (more weight = more events kept)
+        keep_count = max(1, int(len(sorted_data) * weight))
+        return sorted_data[:keep_count]
+    def _scale_spatial(self, data: Dict, weight: float) -> Dict:
+        """Scale spatial context - simplify based on importance"""
+        if weight < 0.3:
+            # Low importance - just keep basic location
+            return {'location': data.get('primary_location', 'unknown')}
+        else:
+            # Higher importance - keep more detail
+            return data
+    def _scale_participant(self, data: Dict, weight: float) -> Dict:
+        """Scale participant states - keep most active participants"""
+        if not data:
+            return {}
+        # Sort by activity level (approximated by state changes)
+        participants = []
+        for pid, pdata in data.items():
+            activity = len(pdata.get('history', []))
+            participants.append((pid, pdata, activity))
+        participants.sort(key=lambda x: x[2], reverse=True)
+        # Keep based on weight
+        keep_count = max(1, int(len(participants) * weight))
+        return {pid: pdata for pid, pdata, _ in participants[:keep_count]}
+    def _scale_intentional(self, data: Dict, weight: float) -> Dict:
+        """Scale intentional context - keep high priority goals"""
+        if not data:
+            return {}
+        # Sort by priority
+        goals = [(k, v) for k, v in data.items()]
+        goals.sort(key=lambda x: x[1].get('priority', 0), reverse=True)
+        # Keep based on weight
+        keep_count = max(1, int(len(goals) * weight))
+        return {k: v for k, v in goals[:keep_count]}
+    def _scale_cultural(self, data: Dict, weight: float) -> Dict:
+        """Scale cultural context - keep if important"""
+        if weight < 0.2:
+            return {}  # Skip if low importance
+        return data
+class ContextScalingOrchestrator:
+    """
+    Main orchestrator for context scaling
+    Combines length and multi-modal scaling
+    """
+    def __init__(self, max_context_tokens: int = 100000):
+        self.length_scaler = LengthScaler(max_context_tokens)
+        self.multimodal_scaler = MultiModalScaler()
+    def scale_complete_context(
+        self,
+        text_context: str,
+        multimodal_context: Dict[str, Any],
+        query: str,
+        target_tokens: int = 2000,
+        modality_weights: Optional[Dict[str, float]] = None
+    ) -> Dict[str, Any]:
+        """
+        Scale both text and multi-modal context
+        Returns optimally scaled context
+        """
+        # Scale text context
+        scaled_text = self.length_scaler.scale_context(
+            text_context, query, target_tokens
+        )
+        # Scale multi-modal context
+        scaled_multimodal = self.multimodal_scaler.scale_multimodal(
+            multimodal_context, modality_weights
+        )
+        # Combine
+        result = {
+            'text': {
+                'segments': scaled_text.segments,
+                'attention_map': scaled_text.attention_map.tolist(),
+                'token_count': scaled_text.token_count,
+                'compression_level': scaled_text.compression_level
+            },
+            'multimodal': scaled_multimodal,
+            'metadata': {
+                'original_tokens': len(text_context.split()),
+                'scaled_tokens': scaled_text.token_count,
+                'compression_ratio': len(text_context.split()) / max(scaled_text.token_count, 1),
+                'modalities_preserved': list(scaled_multimodal.keys())
+            }
+        }
+        return result
+# Demo usage
+def demo_context_scaling():
+    """Demonstrate context scaling capabilities"""
+    # Create a very long context
+    long_context = " ".join([
+        f"Sentence {i} about various topics including AI, engineering, and software development."
+        for i in range(10000)
+    ])  # ~100k tokens
+    # Multi-modal context
+    multimodal = {
+        'temporal': [
+            {'event': f'Event {i}', 'timestamp': datetime.now()}
+            for i in range(50)
+        ],
+        'participant': {
+            f'person_{i}': {'state': 'active', 'history': []}
+            for i in range(20)
+        },
+        'intentional': {
+            f'goal_{i}': {'priority': np.random.random()}
+            for i in range(10)
+        }
+    }
+    # Scale the context
+    orchestrator = ContextScalingOrchestrator()
+    scaled = orchestrator.scale_complete_context(
+        text_context=long_context,
+        multimodal_context=multimodal,
+        query="AI engineering position requirements",
+        target_tokens=2000
+    )
+    print(f"Scaling Results:")
+    print(f"Original tokens: {scaled['metadata']['original_tokens']}")
+    print(f"Scaled tokens: {scaled['metadata']['scaled_tokens']}")
+    print(f"Compression ratio: {scaled['metadata']['compression_ratio']:.2f}x")
+    print(f"Compression level: {scaled['text']['compression_level']}")
+    print(f"Modalities preserved: {scaled['metadata']['modalities_preserved']}")
+    print(f"Text segments: {len(scaled['text']['segments'])}")
+    print(f"Temporal events kept: {len(scaled['multimodal'].get('temporal', []))}")
+if __name__ == "__main__":
+    demo_context_scaling()

agents/cover_letter_agent.py ADDED Viewed

	@@ -0,0 +1,143 @@

+from __future__ import annotations
+from typing import List, Optional
+import re
+from models.schemas import UserProfile, JobPosting, CoverLetterDraft
+from memory.store import memory_store
+from utils.text import extract_keywords_from_text, clamp_to_char_limit
+from utils.ats import basic_cover_letter_template, strengthen_action_verbs
+from utils.consistency import allowed_keywords_from_profile, coverage_score, conciseness_score
+from services.web_research import get_role_guidelines, cover_letter_inspiration_from_url
+from services.llm import llm
+from utils.langextractor import distill_text
+class CoverLetterAgent:
+    def __init__(self) -> None:
+        self.name = "cover_letter"
+        self.max_chars = 4000
+    def create_cover_letter(self, profile: UserProfile, job: JobPosting, user_id: str = "default_user", user_chat: Optional[str] = None, seed_text: Optional[str] = None, agent2_notes: Optional[str] = None, inspiration_url: Optional[str] = None) -> CoverLetterDraft:
+        jd_keywords: List[str] = extract_keywords_from_text(job.description or "", top_k=25)
+        allowed = allowed_keywords_from_profile(profile.skills, profile.experiences)
+        greeting = "Hiring Manager,"
+        body = [
+            (
+                f"I am excited to apply for the {job.title} role at {job.company}. "
+                f"With experience across {', '.join(profile.skills[:8])}, I can quickly contribute to your team."
+            ),
+            (
+                "In my recent work, I delivered outcomes such as driving cost reductions, building scalable platforms, "
+                "and improving reliability. I have hands-on experience with the tools and practices highlighted "
+                f"in your description, including {', '.join(jd_keywords[:8])}."
+            ),
+            (
+                "I am particularly interested in this opportunity because it aligns with my background and career goals. "
+                "I value impact, ownership, and collaboration."
+            ),
+        ]
+        closing = "Thank you for your time and consideration."
+        signature = profile.full_name
+        base_text = seed_text.strip() if seed_text else None
+        draft = base_text or basic_cover_letter_template(greeting, body, closing, signature)
+        if base_text and len(base_text) > 1500:
+            bullets = distill_text(base_text, max_points=10)
+            draft = ("\n".join(f"- {b}" for b in bullets) + "\n\n") + draft[:3000]
+        guidance = get_role_guidelines(job.title, job.description)
+        humor_notes = cover_letter_inspiration_from_url(inspiration_url) if inspiration_url else ""
+        used_keywords: List[str] = []
+        # Detect low overlap between profile and JD keywords to hint a career pivot narrative
+        overlap_count = sum(1 for k in jd_keywords if k.lower() in allowed)
+        overlap_ratio = overlap_count / max(1, len(jd_keywords[:15]))
+        career_change_hint = overlap_ratio < 0.25
+        # Prepare transferable skills (top profile skills), and pull 1-2 achievements across experiences
+        transferable_skills = profile.skills[:6] if profile.skills else []
+        sample_achievements: List[str] = []
+        for e in profile.experiences:
+            if e.achievements:
+                for a in e.achievements:
+                    if a and len(sample_achievements) < 2:
+                        sample_achievements.append(a.strip())
+        for cycle in range(3):
+            new_mentions = []
+            for kw in jd_keywords[:12]:
+                if kw.lower() in allowed and kw.lower() not in draft.lower():
+                    new_mentions.append(kw)
+            if new_mentions:
+                draft = draft.rstrip() + "\n\nRelevant focus: " + ", ".join(new_mentions[:8]) + "\n"
+                used_keywords = list({*used_keywords, *new_mentions[:8]})
+            if llm.enabled:
+                system = (
+                    "You refine cover letters. Preserve factual accuracy. Be concise (<= 1 page). "
+                    "Keep ATS-friendly text; avoid flowery language. "
+                    f"Apply latest guidance: {guidance}. "
+                    "Emphasize transferable skills and a positive pivot narrative when the candidate is changing careers. "
+                    "Structure: concise hook; 1–2 quantified achievements (STAR compressed); alignment to role/company; clear close/CTA. "
+                    "Use active voice and strong action verbs; avoid clichés/buzzwords. UK English. Use digits for numbers and £ for currency. "
+                )
+                humor = f"\nInspiration guideline (do not copy text): {humor_notes}" if humor_notes else ""
+                notes = (f"\nNotes from Agent 2: {agent2_notes}" if agent2_notes else "")
+                custom = f"\nUser instructions: {user_chat}" if user_chat else ""
+                pivot = "\nCareer change: true — highlight transferable skills and motivation for the pivot." if career_change_hint else ""
+                examples = ("\nAchievements to consider: " + "; ".join(sample_achievements)) if sample_achievements else ""
+                tskills = ("\nTransferable skills: " + ", ".join(transferable_skills)) if transferable_skills else ""
+                user = (
+                    f"Role: {job.title}. Company: {job.company}.\n"
+                    f"Job keywords: {', '.join(jd_keywords[:20])}.\n"
+                    f"Allowed keywords (from user profile): {', '.join(sorted(list(allowed))[:40])}.\n"
+                    f"Rewrite the following cover letter to strengthen alignment without inventing new skills.{custom}{notes}{humor}{pivot}{examples}{tskills}\n"
+                    f"Keep within {self.max_chars} characters.\n\n"
+                    f"Cover letter content:\n{draft}"
+                )
+                draft = llm.generate(system, user, max_tokens=800, agent="cover")
+            # Simple buzzword scrub
+            lower = draft.lower()
+            for bad in [
+                "results-driven", "team player", "works well alone", "people person",
+                "perfectionist", "multi-tasker", "multi tasker", "dynamic go-getter",
+            ]:
+                if bad in lower:
+                    draft = draft.replace(bad, "")
+                    lower = draft.lower()
+            # Strengthen weak openers
+            draft = strengthen_action_verbs(draft)
+            # Normalise £/% hints
+            draft = draft.replace("GBP", "£")
+            draft = re.sub(r"\bpercent\b", "%", draft, flags=re.IGNORECASE)
+            cov = coverage_score(draft, jd_keywords)
+            conc = conciseness_score(draft, self.max_chars)
+            if conc < 1.0:
+                draft = clamp_to_char_limit(draft, self.max_chars)
+            memory_store.save(user_id, self.name, {
+                "job_id": job.id,
+                "cycle": cycle + 1,
+                "coverage": cov,
+                "conciseness": conc,
+                "keywords_used": used_keywords,
+                "guidance": guidance[:500],
+                "user_chat": (user_chat or "")[:500],
+                "agent2_notes": (agent2_notes or "")[:500],
+                "inspiration_url": inspiration_url or "",
+                "draft": draft,
+                "career_change_hint": career_change_hint,
+            }, job_id=job.id)
+        draft = clamp_to_char_limit(draft, self.max_chars)
+        memory_store.save(user_id, self.name, {
+            "job_id": job.id,
+            "final": True,
+            "keywords_used": used_keywords,
+            "draft": draft,
+        }, job_id=job.id)
+        return CoverLetterDraft(job_id=job.id, text=draft, keywords_used=used_keywords[:12])

agents/cv_owner.py ADDED Viewed

	@@ -0,0 +1,441 @@

+from __future__ import annotations
+from typing import List, Optional
+import logging
+import re
+import textwrap
+from datetime import datetime
+from models.schemas import UserProfile, JobPosting, ResumeDraft
+from memory.store import memory_store
+from utils.text import extract_keywords_from_text, clamp_to_char_limit
+from utils.ats import (
+	format_resume_header,
+	format_experience_section,
+	format_skills_section,
+	basic_resume_template,
+	ensure_keywords,
+	ACTION_VERBS,
+	strengthen_action_verbs,
+)
+from utils.consistency import allowed_keywords_from_profile, coverage_score, conciseness_score
+from utils.config import AgentConfig, LLMConfig
+from services.web_research import get_role_guidelines
+from services.llm import llm
+from utils.langextractor import distill_text
+try:
+	from utils.langextractor_enhanced import extract_structured_info, extract_ats_keywords
+	ENHANCED_EXTRACTION = True
+except ImportError:
+	ENHANCED_EXTRACTION = False
+logger = logging.getLogger(__name__)
+def _clamp_words(text: str, max_words: int) -> str:
+	if not text:
+		return ""
+	words = text.strip().split()
+	if len(words) <= max_words:
+		return text.strip()
+	return " ".join(words[:max_words]).strip()
+def _extract_year(s: Optional[str]) -> Optional[int]:
+	if not s:
+		return None
+	m = re.search(r"(19|20)\d{2}", s)
+	return int(m.group(0)) if m else None
+def _uk_month_name(m: int) -> str:
+	return ["", "Jan", "Feb", "Mar", "Apr", "May", "Jun", "Jul", "Aug", "Sep", "Oct", "Nov", "Dec"][max(0, min(12, m))]
+def _uk_date_str(s: Optional[str]) -> Optional[str]:
+	if not s:
+		return None
+	ss = s.strip()
+	if ss.lower() == "present":
+		return "Present"
+	# YYYY-MM or YYYY/M or YYYY/MM
+	m = re.match(r"^(\d{4})[-/](\d{1,2})$", ss)
+	if m:
+		y = int(m.group(1)); mo = int(m.group(2))
+		return f"{_uk_month_name(mo)} {y}"
+	# MM/YYYY
+	m = re.match(r"^(\d{1,2})/(\d{4})$", ss)
+	if m:
+		mo = int(m.group(1)); y = int(m.group(2))
+		return f"{_uk_month_name(mo)} {y}"
+	# YYYY only
+	m = re.match(r"^(\d{4})$", ss)
+	if m:
+		return m.group(1)
+	return ss
+def _postprocess_bullets(text: str) -> str:
+	if not text:
+		return text
+	lines = []
+	for line in text.splitlines():
+		newline = line
+		if newline.lstrip().startswith("-"):
+			# Remove first-person pronouns at bullet start
+			newline = re.sub(r"^(\s*-\s*)(?:I|We|My)\s+", r"\1", newline, flags=re.IGNORECASE)
+			# Remove trailing period
+			newline = re.sub(r"\.(\s*)$", r"\1", newline)
+		# Normalise percent and GBP
+		newline = re.sub(r"\bper\s*cent\b", "%", newline, flags=re.IGNORECASE)
+		newline = re.sub(r"\bpercent\b", "%", newline, flags=re.IGNORECASE)
+		newline = newline.replace("GBP", "£")
+		lines.append(newline)
+	return "\n".join(lines)
+def _strip_personal_info(text: str) -> str:
+	if not text:
+		return text
+	# Remove DOB lines and photo references
+	text = re.sub(r"^.*\b(date of birth|dob)\b.*$", "", text, flags=re.IGNORECASE | re.MULTILINE)
+	text = re.sub(r"^.*\b(photo|headshot)\b.*$", "", text, flags=re.IGNORECASE | re.MULTILINE)
+	# Clean extra blank lines
+	text = re.sub(r"\n{3,}", "\n\n", text)
+	return text.strip() + "\n"
+class CVOwnerAgent:
+	def __init__(self) -> None:
+		self.name = "cv_owner"
+		self.max_chars = AgentConfig.RESUME_MAX_CHARS
+	def create_resume(
+		self,
+		profile: UserProfile,
+		job: JobPosting,
+		user_id: str = "default_user",
+		user_chat: Optional[str] = None,
+		seed_text: Optional[str] = None,
+		agent2_notes: Optional[str] = None,
+		layout_preset: Optional[str] = None,
+	) -> ResumeDraft:
+		"""Create an optimized resume for a specific job posting."""
+		jd_keywords: List[str] = extract_keywords_from_text(
+			job.description or "",
+			top_k=AgentConfig.JOB_KEYWORDS_COUNT
+		)
+		allowed = allowed_keywords_from_profile(profile.skills, profile.experiences)
+		# Format resume sections
+		header = format_resume_header(
+			full_name=profile.full_name,
+			headline=profile.headline or job.title,
+			email=profile.email,
+			phone=profile.phone,
+			location=profile.location,
+			links=profile.links,
+		)
+		# Sort experiences reverse-chronologically (Reed/Indeed best practice)
+		def _date_key(s: Optional[str]) -> str:
+			val = (s or "").strip()
+			if not val or val.lower() == "present":
+				return "9999-12-31"
+			return val
+		experiences_sorted = sorted(
+			profile.experiences,
+			key=lambda e: (_date_key(e.end_date), _date_key(e.start_date)),
+			reverse=True,
+		)
+		# Compute simple gap signal based on years between adjacent roles
+		gap_years_flag = False
+		for i in range(len(experiences_sorted) - 1):
+			end_y = _extract_year(experiences_sorted[i].end_date or "Present") or 9999
+			start_next_y = _extract_year(experiences_sorted[i + 1].start_date)
+			if start_next_y and end_y != 9999 and (start_next_y - end_y) >= 2:
+				gap_years_flag = True
+				break
+		# Limit achievements depth: recent roles get more bullets, older roles compressed
+		current_year = datetime.now().year
+		experience_payload = []
+		for idx, e in enumerate(experiences_sorted):
+			ach = e.achievements or []
+			# Compress if older than 15 years
+			start_y = _extract_year(e.start_date or "")
+			older = bool(start_y and (current_year - start_y > 15))
+			if idx < 2 and not older:
+				limited = ach[:6]
+			else:
+				limited = [] if older else ach[:1]
+			experience_payload.append({
+				"title": e.title,
+				"company": e.company,
+				"start_date": _uk_date_str(e.start_date) or e.start_date,
+				"end_date": _uk_date_str(e.end_date) or ("Present" if (e.end_date or "").lower()=="present" else (e.end_date or "")),
+				"achievements": limited,
+			})
+		experience = format_experience_section(experience_payload)
+		skills = format_skills_section(profile.skills)
+		# Personal statement (Summary) refinement (~150 words), tailored to job
+		summary_text = profile.summary or ""
+		if summary_text:
+			if llm.enabled:
+				sys_ps = (
+					"You write CV personal statements (Summary) for UK job applications. Keep to ~150 words (100–180). "
+					"Use active voice and clear, specific language; avoid clichés/buzzwords; no personal info. "
+					"Structure: 1) who you are/pro background; 2) key skills + 1–2 quantified achievements relevant to the role; "
+					"3) concise career goal aligned to the target role/company. Tailor to the job's keywords."
+				)
+				usr_ps = (
+					f"Target role: {job.title} at {job.company}\n"
+					f"Job keywords: {', '.join(jd_keywords[:15])}\n\n"
+					f"Existing summary (edit and improve):\n{summary_text}"
+				)
+				summary_text = llm.generate(sys_ps, usr_ps, max_tokens=220, agent="cv")
+			summary_text = _clamp_words(summary_text, 180)
+			# Ensure critical JD keywords appear in summary (top 3)
+			try:
+				needed = []
+				low = (summary_text or "").lower()
+				for k in jd_keywords[:6]:
+					if k and (k.lower() not in low) and len(needed) < 3:
+						needed.append(k)
+				if needed:
+					summary_text = (summary_text or "").strip() + " " + ("Key strengths: " + ", ".join(needed) + ".")
+			except Exception:
+				pass
+		else:
+			# No summary provided: keep empty to avoid adding new sections implicitly
+			summary_text = ""
+		education_text = "\n".join(
+			[f"{ed.degree or ''} {ed.field_of_study or ''} — {ed.school} ({ed.end_date or ''})"
+			 for ed in profile.education]
+		).strip()
+		# Process seed text if provided
+		base_text = seed_text.strip() if seed_text else None
+		if base_text and len(base_text) > 2000:
+			# Distill dense seed into key points to guide the draft
+			bullets = distill_text(base_text, max_points=AgentConfig.DISTILL_MAX_POINTS)
+			base_text = ("\n".join(f"- {b}" for b in bullets) + "\n\n") + base_text[:4000]
+		# Compose initial draft by layout preset (ATS-friendly, single column)
+		preset = (layout_preset or "").strip().lower()
+		preset = {
+			"traditional": "classic",
+			"classic": "classic",
+			"modern": "modern",
+			"minimalist": "minimalist",
+			"executive": "executive",
+		}.get(preset, "")
+		def sec_summary(s: str) -> str:
+			return ("\nSummary\n" + textwrap.fill(s, width=100)) if s else ""
+		def sec_skills(sk: str) -> str:
+			return ("\n" + sk) if sk else ""
+		def sec_experience(ex: str) -> str:
+			return ("\n\nExperience\n" + ex) if ex else ""
+		def sec_education(ed: str) -> str:
+			return ("\n\nEducation\n" + ed) if ed else ""
+		def sec_languages() -> str:
+			langs = getattr(profile, "languages", []) or []
+			pairs = []
+			for it in langs[:8]:
+				if isinstance(it, dict):
+					name = it.get("language") or it.get("name") or ""
+					lvl = it.get("level") or ""
+					if name:
+						pairs.append(f"{name}{' ('+lvl+')' if lvl else ''}")
+			return ("\n\nLanguages\n- " + "\n- ".join(pairs)) if pairs else ""
+		def sec_certs() -> str:
+			certs = getattr(profile, "certifications", []) or []
+			lines = []
+			for c in certs[:6]:
+				if isinstance(c, dict):
+					name = c.get("name") or ""
+					issuer = c.get("issuer") or ""
+					year = c.get("year") or ""
+					if name:
+						parts = [name]
+						if issuer: parts.append(issuer)
+						if year: parts.append(str(year))
+						lines.append(" — ".join(parts))
+			return ("\n\nCertifications\n- " + "\n- ".join(lines)) if lines else ""
+		def sec_projects() -> str:
+			projs = getattr(profile, "projects", []) or []
+			lines = []
+			for p in projs[:4]:
+				if isinstance(p, dict):
+					title = p.get("title") or ""
+					link = p.get("link") or ""
+					impact = p.get("impact") or ""
+					if title or impact:
+						line = title
+						if link: line += f" — {link}"
+						if impact: line += f" — {impact}"
+						lines.append(line)
+			return ("\n\nSelected Projects\n- " + "\n- ".join(lines)) if lines else ""
+		def sec_achievements() -> str:
+			bul = []
+			for e in experiences_sorted[:2]:
+				for a in (e.achievements or []):
+					if a and len(bul) < 5:
+						bul.append(a)
+			return ("\n\nSelected Achievements\n- " + "\n- ".join(bul)) if bul else ""
+		if base_text:
+			draft = base_text
+		elif preset == "classic":
+			parts: List[str] = [header, sec_summary(summary_text), sec_skills(skills), sec_experience(experience), sec_education(education_text), sec_certs(), sec_languages()]
+			draft = "".join(parts).strip() + "\n"
+		elif preset == "modern":
+			parts = [header, sec_summary(summary_text), sec_experience(experience), sec_skills(skills), sec_projects(), sec_certs(), sec_education(education_text)]
+			draft = "".join(parts).strip() + "\n"
+		elif preset == "minimalist":
+			parts = [header, sec_summary(summary_text), sec_skills(skills), sec_experience(experience), sec_education(education_text)]
+			draft = "".join(parts).strip() + "\n"
+		elif preset == "executive":
+			parts = [header, sec_summary(summary_text), sec_achievements(), sec_experience(experience), sec_skills(skills), sec_education(education_text), sec_certs()]
+			draft = "".join(parts).strip() + "\n"
+		else:
+			# Default formatting
+			draft = basic_resume_template(
+				header=header,
+				summary=(summary_text or None),
+				skills=skills,
+				experience=experience,
+				education=education_text,
+			)
+		# If profile.skill_proficiency exists, append a simple proficiency hint line under Skills (ATS-safe)
+		try:
+			if hasattr(profile, "links") and isinstance(profile.links, dict):
+				pass
+			# naive inject: if "Skills:" line exists, add a second line with proficiencies
+			if getattr(profile, "skills", None) and getattr(profile, "links", None) is not None:
+				prof_map = getattr(profile, "skill_proficiency", {}) or {}
+				if prof_map:
+					profs = ", ".join([f"{k}: {v}" for k, v in list(prof_map.items())[:8]])
+					if "\nSkills:" in draft:
+						parts = draft.split("\nSkills:")
+						draft = parts[0] + "\nSkills:" + parts[1].split("\n", 1)[0] + ("\n" + profs) + "\n" + (parts[1].split("\n", 1)[1] if "\n" in parts[1] else "")
+		except Exception:
+			pass
+		guidance = get_role_guidelines(job.title, job.description)
+		used_keywords: List[str] = []
+		# Optimization cycles
+		for cycle in range(AgentConfig.OPTIMIZATION_CYCLES):
+			draft, used_cycle = ensure_keywords(
+				draft,
+				jd_keywords,
+				max_new=AgentConfig.MAX_NEW_KEYWORDS,
+				allowed_keywords=allowed
+			)
+			used_keywords = list({*used_keywords, *used_cycle})
+			if llm.enabled:
+				system = (
+					"You refine resumes. Preserve factual accuracy. Keep ATS-friendly text-only formatting. "
+					"Follow UK best practices (Indeed/Reed/StandOut/Novorésumé): keep concise (prefer 1 page; <= 2 pages for senior roles), use clear section headings. "
+					"Present work experience in reverse chronological order, highlight recent quantified achievements, and keep older roles brief. "
+					"Use bullet points for skimmability, maintain consistent spacing and layout, avoid irrelevant info. Do not add images/tables or unusual symbols. "
+					"Tailor to the job's keywords. Prefer quantification where truthful (%, £, time, team size); never fabricate metrics. "
+					"AVOID vague buzzwords (e.g., 'results-driven', 'team player', 'people person', 'perfectionist', 'multi-tasker'). Replace with specific, measurable achievements. "
+					"Use active voice and strong action verbs (e.g., Achieved, Led, Implemented, Improved, Generated, Managed, Completed, Designed). "
+					"Skills: when possible, separate Hard skills vs Soft skills (hard skills first, max ~10), then soft skills. Keep Education concise (highest/most recent first). "
+					"Contact hygiene: prefer professional email; include relevant links (e.g., LinkedIn/portfolio) if provided; never include DOB or photos. "
+					"If a 'Summary'/'Personal Statement' section exists, keep it ~150 words with the intro–skills/achievements–goal structure; do not add new sections. "
+					"UK English, UK date style (MMM YYYY). Use present tense for the current role and past tense for previous roles. Remove first-person pronouns in bullets. "
+					"Use digits for numbers (e.g., 7, 12%, £1,200). Include critical JD keywords verbatim inside bullets (not only in Skills). "
+					f"Apply latest guidance: {guidance}."
+				)
+				notes = (f"\nNotes from Agent 2: {agent2_notes}" if agent2_notes else "")
+				custom = f"\nUser instructions: {user_chat}" if user_chat else ""
+				user = (
+					f"Role: {job.title}. Company: {job.company}.\n"
+					f"Job keywords: {', '.join(jd_keywords[:AgentConfig.RESUME_KEYWORDS_COUNT])}.\n"
+					f"Allowed keywords (from user profile): {', '.join(sorted(list(allowed))[:40])}.\n"
+					f"Rewrite the following resume content to strengthen alignment without inventing new skills.{custom}{notes}\n"
+					f"Enforce reverse chronological experience ordering, bullet points, and consistent headings. Keep within {self.max_chars} characters.\n\n"
+					f"Resume content:\n{draft}"
+				)
+				draft = llm.generate(system, user, max_tokens=LLMConfig.RESUME_MAX_TOKENS, agent="cv")
+			# Simple buzzword scrub per Reed guidance
+			lower = draft.lower()
+			for bad in [
+				"results-driven", "team player", "works well alone", "people person",
+				"perfectionist", "multi-tasker", "multi tasker", "dynamic go-getter",
+			]:
+				if bad in lower:
+					# Replace phrase occurrences with an empty string; rely on achievements to convey value
+					draft = draft.replace(bad, "")
+					lower = draft.lower()
+			# Strengthen weak bullet openers to action verbs (The Muse)
+			draft = strengthen_action_verbs(draft)
+			# ATS plain-text scrub: remove tabs and unusual symbols
+			draft = draft.replace("\t", " ")
+			# Pronoun/punctuation/currency/percent normalisation
+			draft = _postprocess_bullets(draft)
+			# Strip DOB/photo lines if present
+			draft = _strip_personal_info(draft)
+			cov = coverage_score(draft, jd_keywords)
+			conc = conciseness_score(draft, self.max_chars)
+			if conc < 1.0:
+				draft = clamp_to_char_limit(draft, self.max_chars)
+			# Signals for orchestrator/observability (StandOut CV + Novorésumé)
+			bullet_lines = sum(1 for l in (draft or "").splitlines() if l.strip().startswith("-"))
+			line_count = max(1, len((draft or "").splitlines()))
+			bullet_density = round(bullet_lines / line_count, 3)
+			quant_count = sum(1 for ch in (draft or "") if ch.isdigit()) + (draft or "").count('%') + (draft or "").count('£')
+			email_ok = bool(re.match(r"^[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Za-z]{2,}$", profile.email or ""))
+			links_present = ("http://" in (draft or "").lower()) or ("https://" in (draft or "").lower()) or ("linkedin" in (draft or "").lower())
+			skills_split_hint = ("hard skills" in (draft or "").lower()) or ("soft skills" in (draft or "").lower())
+			languages_section = "\nlanguages" in (draft or "").lower()
+			action_verb_count = sum(1 for v in ACTION_VERBS if v.lower() in (draft or "").lower())
+			approx_pages = round(max(1, len(draft or "")) / 2400.0, 2)
+			approx_one_page = approx_pages <= 1.2
+			memory_store.save(user_id, self.name, {
+				"job_id": job.id,
+				"cycle": cycle + 1,
+				"coverage": cov,
+				"conciseness": conc,
+				"keywords_used": used_keywords,
+				"guidance": guidance[:500],
+				"user_chat": (user_chat or "")[:500],
+				"agent2_notes": (agent2_notes or "")[:500],
+				"draft": draft,
+				"signals": {
+					"bullet_density": bullet_density,
+					"quant_count": quant_count,
+					"email_ok": email_ok,
+					"gap_years_flag": gap_years_flag,
+					"skills_split_hint": skills_split_hint,
+					"languages_section": languages_section,
+					"links_present": links_present,
+					"action_verb_count": action_verb_count,
+					"approx_pages": approx_pages,
+					"approx_one_page": approx_one_page,
+				},
+			}, job_id=job.id)
+			logger.debug(f"Resume optimization cycle {cycle + 1}: coverage={cov:.2f}, conciseness={conc:.2f}")
+		# Final cleanup
+		draft = clamp_to_char_limit(draft, self.max_chars)
+		memory_store.save(user_id, self.name, {
+			"job_id": job.id,
+			"final": True,
+			"keywords_used": used_keywords,
+			"draft": draft,
+		}, job_id=job.id)
+		logger.info(f"Resume created for job {job.id} with {len(used_keywords)} keywords")
+		return ResumeDraft(job_id=job.id, text=draft, keywords_used=used_keywords)

agents/guidelines.py ADDED Viewed

	@@ -0,0 +1,257 @@

+from __future__ import annotations
+from dataclasses import dataclass
+from typing import Callable, Dict, Any, List, Tuple
+import re
+@dataclass
+class Guideline:
+	id: str
+	description: str
+	condition: Callable[[Dict[str, Any]], bool]
+	validate: Callable[[str, Dict[str, Any]], Tuple[bool, str]]
+	enforce: Callable[[str, Dict[str, Any]], str]
+class GuidelineEngine:
+	def __init__(self, rules: List[Guideline]) -> None:
+		self.rules = rules
+	def check_and_enforce(self, text: str, ctx: Dict[str, Any]) -> Tuple[str, List[str], List[str]]:
+		matched: List[str] = []
+		fixed: List[str] = []
+		out = text or ""
+		for g in self.rules:
+			try:
+				if not g.condition(ctx):
+					continue
+				matched.append(g.id)
+				ok, _ = g.validate(out, ctx)
+				if not ok:
+					out = g.enforce(out, ctx)
+					fixed.append(g.id)
+			except Exception:
+				# fail-safe, do not block
+				continue
+		return out, matched, fixed
+# ---------- Helpers ----------
+_BUZZWORDS = [
+	"results-driven", "team player", "people person", "perfectionist",
+	"multi-tasker", "multi tasker", "dynamic go-getter", "rockstar",
+	"guru", "ninja"
+]
+_WEAK_OPENERS = [
+	(re.compile(r"^\s*[-•]\s*responsible for\s+", re.I), "- Led "),
+	(re.compile(r"^\s*[-•]\s*tasked with\s+", re.I), "- Executed "),
+	(re.compile(r"^\s*[-•]\s*worked on\s+", re.I), "- Delivered "),
+	(re.compile(r"^\s*[-•]\s*helped\s+", re.I), "- Supported "),
+	(re.compile(r"^\s*[-•]\s*assisted with\s+", re.I), "- Supported "),
+	(re.compile(r"^\s*[-•]\s*handled\s+", re.I), "- Managed "),
+]
+def _enforce_exact_length(text: str, target_len: int) -> str:
+	if target_len <= 0:
+		return text or ""
+	txt = (text or "")
+	if len(txt) == target_len:
+		return txt
+	if len(txt) > target_len:
+		return txt[:target_len]
+	return txt + (" " * (target_len - len(txt)))
+def _ensure_headings(text: str) -> str:
+	"""Ensure key headings exist: SUMMARY, SKILLS, EXPERIENCE, EDUCATION."""
+	t = text or ""
+	low = t.lower()
+	out = t
+	def add_heading(h: str) -> None:
+		nonlocal out
+		if h.lower() not in low:
+			out = (out + f"\n\n{h}\n").strip()
+	for h in ["SUMMARY", "SKILLS", "EXPERIENCE", "EDUCATION"]:
+		if h.lower() not in low:
+			add_heading(h)
+	return out
+def _strip_tabs(text: str) -> str:
+	return (text or "").replace("\t", " ")
+def _scrub_buzzwords(text: str) -> str:
+	out = text or ""
+	low = out.lower()
+	for bw in _BUZZWORDS:
+		if bw in low:
+			out = re.sub(re.escape(bw), "", out, flags=re.I)
+	return out
+def _strengthen_action_verbs(text: str) -> str:
+	lines = (text or "").splitlines()
+	fixed: List[str] = []
+	for ln in lines:
+		new_ln = ln
+		for pat, repl in _WEAK_OPENERS:
+			if pat.search(new_ln):
+				new_ln = pat.sub(repl, new_ln)
+				break
+		fixed.append(new_ln)
+	return "\n".join(fixed)
+def _remove_first_person(text: str) -> str:
+	# Remove leading "I " / "My " in bullets only
+	lines = (text or "").splitlines()
+	out: List[str] = []
+	for ln in lines:
+		m = re.match(r"^\s*[-•]\s*(i|my|we)\b", ln, flags=re.I)
+		if m:
+			ln = re.sub(r"^\s*([-•]\s*)(i|my|we)\b\s*", r"\1", ln, flags=re.I)
+		out.append(ln)
+	return "\n".join(out)
+def _ats_plain_text(text: str) -> str:
+	# normalize bullets and strip odd symbols
+	out = _strip_tabs(text)
+	out = out.replace("•\t", "- ").replace("• ", "- ")
+	out = re.sub(r"[■▪◦●○✔✦♦]", "-", out)
+	return out
+def _enforce_uk_habits(text: str) -> str:
+	# normalize currency symbol spacing and percentages
+	out = re.sub(r"\s*£\s*", " £", text or "")
+	out = re.sub(r"\s*%\s*", "%", out)
+	return out
+def _allowed_skills_from_profile(ctx: Dict[str, Any]) -> List[str]:
+	p = (ctx.get("profile_text") or "").lower()
+	# naive split of alphanum skill-like tokens
+	cands = re.findall(r"[a-zA-Z][a-zA-Z0-9+_.#-]{2,}", p)
+	seen: Dict[str, int] = {}
+	for c in cands:
+		seen[c.lower()] = 1
+	return list(seen.keys())
+def _no_invented_skills(text: str, ctx: Dict[str, Any]) -> Tuple[bool, str]:
+	allowed = set(_allowed_skills_from_profile(ctx))
+	if not allowed:
+		return True, "no baseline"
+	skills_block = re.search(r"(?is)\n\s*(skills|core skills)[\s:]*\n(.+?)(\n\n|$)", text or "")
+	if not skills_block:
+		return True, "no skills block"
+	block = skills_block.group(0)
+	found = re.findall(r"[A-Za-z][A-Za-z0-9+_.#-]{2,}", block)
+	for f in found:
+		if f.lower() not in allowed:
+			return False, f
+	return True, "ok"
+# ---------- Rule sets ----------
+def build_resume_rules() -> List[Guideline]:
+	return [
+		Guideline(
+			id="exact_length",
+			description="Enforce exact target length when provided",
+			condition=lambda ctx: bool(ctx.get("target_len")),
+			validate=lambda txt, ctx: (len(txt or "") == int(ctx.get("target_len", 0)), "len"),
+			enforce=lambda txt, ctx: _enforce_exact_length(txt, int(ctx.get("target_len", 0))),
+		),
+		Guideline(
+			id="headings_present",
+			description="Ensure key headings exist",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: (all(h.lower() in (txt or "").lower() for h in ["summary", "experience", "education", "skills"]), "headings"),
+			enforce=lambda txt, ctx: _ensure_headings(txt),
+		),
+		Guideline(
+			id="ats_plain_text",
+			description="Normalize bullets/tabs for ATS",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: ("\t" not in (txt or ""), "tabs"),
+			enforce=lambda txt, ctx: _ats_plain_text(txt),
+		),
+		Guideline(
+			id="buzzword_scrub",
+			description="Remove common buzzwords",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: (not any(bw in (txt or "").lower() for bw in _BUZZWORDS), "buzz"),
+			enforce=lambda txt, ctx: _scrub_buzzwords(txt),
+		),
+		Guideline(
+			id="verb_strengthen",
+			description="Strengthen weak bullet openers",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: (True, "noop"),
+			enforce=lambda txt, ctx: _strengthen_action_verbs(txt),
+		),
+		Guideline(
+			id="remove_first_person",
+			description="Remove first-person pronouns on bullets",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: (not re.search(r"^\s*[-•]\s*(i|my|we)\b", txt or "", re.I | re.M), "pronouns"),
+			enforce=lambda txt, ctx: _remove_first_person(txt),
+		),
+		Guideline(
+			id="uk_normalization",
+			description="Normalize UK currency/percent spacing",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: (True, "noop"),
+			enforce=lambda txt, ctx: _enforce_uk_habits(txt),
+		),
+		Guideline(
+			id="no_invented_skills",
+			description="Prevent skills not evidenced in profile",
+			condition=lambda ctx: True,
+			validate=_no_invented_skills,
+			enforce=lambda txt, ctx: txt,  # log-only to avoid false positives
+		),
+	]
+def build_cover_rules() -> List[Guideline]:
+	return [
+		Guideline(
+			id="exact_length",
+			description="Enforce exact target length when provided",
+			condition=lambda ctx: bool(ctx.get("target_len")),
+			validate=lambda txt, ctx: (len(txt or "") == int(ctx.get("target_len", 0)), "len"),
+			enforce=lambda txt, ctx: _enforce_exact_length(txt, int(ctx.get("target_len", 0))),
+		),
+		Guideline(
+			id="ats_plain_text",
+			description="Normalize bullets/tabs for ATS",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: ("\t" not in (txt or ""), "tabs"),
+			enforce=lambda txt, ctx: _ats_plain_text(txt),
+		),
+		Guideline(
+			id="buzzword_scrub",
+			description="Remove common buzzwords",
+			condition=lambda ctx: True,
+			validate=lambda txt, ctx: (not any(bw in (txt or "").lower() for bw in _BUZZWORDS), "buzz"),
+			enforce=lambda txt, ctx: _scrub_buzzwords(txt),
+		),
+	]
+def apply_resume_guidelines(text: str, ctx: Dict[str, Any]) -> Tuple[str, List[str], List[str]]:
+	engine = GuidelineEngine(build_resume_rules())
+	return engine.check_and_enforce(text, ctx)
+def apply_cover_guidelines(text: str, ctx: Dict[str, Any]) -> Tuple[str, List[str], List[str]]:
+	engine = GuidelineEngine(build_cover_rules())
+	return engine.check_and_enforce(text, ctx)

agents/job_agent.py ADDED Viewed

	@@ -0,0 +1,29 @@

+from __future__ import annotations
+from typing import Dict, Any
+from services.llm import llm
+import json
+class JobAgent:
+    """Analyzes a job posting to extract structured requirements."""
+    def analyze(self, job_posting_text: str) -> Dict[str, Any]:
+        if not job_posting_text:
+            return {}
+        if not llm.enabled:
+            return {
+                "company": "",
+                "role": "",
+                "key_requirements": [],
+                "nice_to_have": [],
+            }
+        system = (
+            "Analyze this job posting and output JSON with fields: company, role, key_requirements (list), "
+            "nice_to_have (list), industry, employment_type, location, ats_keywords (list of top 15 keywords), "
+            "top_skills_summary (short string)."
+        )
+        resp = llm.generate(system, job_posting_text, max_tokens=700, agent="match")
+        try:
+            return json.loads(resp)
+        except Exception:
+            return {"raw": resp}

agents/linkedin_manager.py ADDED Viewed

	@@ -0,0 +1,120 @@

+from __future__ import annotations
+from typing import List, Optional, Dict
+import logging
+from models.schemas import JobPosting, UserProfile
+from services.linkedin_client import LinkedInClient
+from services.mcp_linkedin_client import mcp_linkedin_client
+from utils.salary import estimate_salary_range
+logger = logging.getLogger(__name__)
+class LinkedInManagerAgent:
+    def __init__(self) -> None:
+        self.client = LinkedInClient()
+        self.user_profile: Optional[UserProfile] = None
+    def get_login_url(self) -> str:
+        return self.client.get_authorize_url()
+    def handle_oauth_callback(self, code: str, state: Optional[str] = None) -> bool:
+        """Handle OAuth callback with state validation."""
+        ok = self.client.exchange_code_for_token(code, state)
+        if ok:
+            self.user_profile = self.client.get_profile()
+        return ok
+    def get_profile(self) -> UserProfile:
+        if not self.user_profile:
+            # Try MCP first if available
+            if mcp_linkedin_client.enabled:
+                try:
+                    import asyncio
+                    prof = asyncio.run(mcp_linkedin_client.get_profile())
+                    if prof:
+                        self.user_profile = prof
+                except Exception:
+                    self.user_profile = None
+            if not self.user_profile:
+                self.user_profile = self.client.get_profile()
+        return self.user_profile
+    def set_profile(self, profile: UserProfile) -> None:
+        """Update the stored profile with new data."""
+        self.user_profile = profile
+        logger.info(f"Profile updated: {profile.full_name}")
+    def update_profile_fields(self, **kwargs) -> None:
+        """Update specific profile fields."""
+        if not self.user_profile:
+            self.user_profile = UserProfile()
+        for key, value in kwargs.items():
+            if hasattr(self.user_profile, key):
+                setattr(self.user_profile, key, value)
+                logger.debug(f"Updated profile.{key}")
+    def get_saved_jobs(self) -> List[JobPosting]:
+        all_jobs = []
+        # Try MCP client first
+        if mcp_linkedin_client.enabled:
+            try:
+                import asyncio
+                jobs = asyncio.run(mcp_linkedin_client.get_saved_jobs())
+                if jobs:
+                    all_jobs.extend(jobs)
+            except Exception:
+                pass
+        # Try LinkedIn API
+        linkedin_jobs = self.client.get_saved_jobs()
+        all_jobs.extend(linkedin_jobs)
+        # If in mock mode or no real LinkedIn jobs, supplement with job aggregators
+        if self.client.mock_mode or len(all_jobs) < 5:
+            # Try JobSpy MCP Server first (most comprehensive)
+            try:
+                from services.jobspy_client import JobSpyClient
+                jobspy = JobSpyClient()
+                jobspy_jobs = jobspy.search_jobs_sync(
+                    search_term="software engineer",
+                    location="Remote",
+                    site_names="indeed,linkedin,glassdoor",
+                    results_wanted=15
+                )
+                all_jobs.extend(jobspy_jobs)
+            except Exception as e:
+                import logging
+                logging.getLogger(__name__).info(f"JobSpy not available: {e}")
+            # Fall back to basic job aggregator
+            if len(all_jobs) < 5:
+                try:
+                    from services.job_aggregator import JobAggregator
+                    aggregator = JobAggregator()
+                    aggregated_jobs = aggregator.search_all("software engineer", "Remote")
+                    all_jobs.extend(aggregated_jobs[:10])
+                except Exception as e:
+                    import logging
+                    logging.getLogger(__name__).info(f"Job aggregator not available: {e}")
+        # Deduplicate jobs
+        seen = set()
+        unique_jobs = []
+        for job in all_jobs:
+            key = (job.title.lower(), job.company.lower())
+            if key not in seen:
+                seen.add(key)
+                unique_jobs.append(job)
+        return unique_jobs
+    def get_job(self, job_id: str) -> Optional[JobPosting]:
+        return self.client.get_job_details(job_id)
+    def estimate_salary(self, job: JobPosting) -> Dict[str, Dict[str, int]]:
+        profile = self.get_profile()
+        industry = None
+        return estimate_salary_range(job.title, job.location, industry, profile.skills)

agents/observability.py ADDED Viewed

	@@ -0,0 +1,431 @@

+"""
+Agent Observability and Debugging
+Provides transparency into agent interactions and decision-making
+Based on the OpenAI Deep Research observability pattern
+"""
+import json
+import logging
+import time
+from typing import Dict, List, Any, Optional
+from datetime import datetime
+from dataclasses import dataclass, field
+from pathlib import Path
+import traceback
+logger = logging.getLogger(__name__)
+@dataclass
+class AgentEvent:
+    """Single event in agent execution"""
+    timestamp: datetime
+    agent_name: str
+    event_type: str  # 'start', 'tool_call', 'reasoning', 'output', 'error', 'handoff'
+    data: Dict[str, Any]
+    duration_ms: Optional[float] = None
+    parent_event: Optional[str] = None
+    def to_dict(self) -> Dict:
+        return {
+            'timestamp': self.timestamp.isoformat(),
+            'agent_name': self.agent_name,
+            'event_type': self.event_type,
+            'data': self.data,
+            'duration_ms': self.duration_ms,
+            'parent_event': self.parent_event
+        }
+class AgentTracer:
+    """
+    Trace and log agent interactions for debugging and monitoring
+    Similar to OpenAI's print_agent_interaction function
+    """
+    def __init__(self, trace_file: Optional[str] = "agent_traces.jsonl"):
+        self.events: List[AgentEvent] = []
+        self.trace_file = Path(trace_file) if trace_file else None
+        self.active_agents: Dict[str, float] = {}  # Track active agent start times
+    def start_agent(self, agent_name: str, input_data: Any) -> str:
+        """Log agent start"""
+        event_id = f"{agent_name}_{int(time.time() * 1000)}"
+        self.active_agents[agent_name] = time.time()
+        event = AgentEvent(
+            timestamp=datetime.now(),
+            agent_name=agent_name,
+            event_type='start',
+            data={
+                'event_id': event_id,
+                'input': str(input_data)[:500]  # Truncate for readability
+            }
+        )
+        self._log_event(event)
+        return event_id
+    def tool_call(
+        self,
+        agent_name: str,
+        tool_name: str,
+        tool_args: Dict,
+        result: Any = None
+    ):
+        """Log tool call"""
+        event = AgentEvent(
+            timestamp=datetime.now(),
+            agent_name=agent_name,
+            event_type='tool_call',
+            data={
+                'tool': tool_name,
+                'args': tool_args,
+                'result': str(result)[:500] if result else None
+            }
+        )
+        self._log_event(event)
+    def reasoning_step(self, agent_name: str, reasoning: str):
+        """Log reasoning or thought process"""
+        event = AgentEvent(
+            timestamp=datetime.now(),
+            agent_name=agent_name,
+            event_type='reasoning',
+            data={'reasoning': reasoning}
+        )
+        self._log_event(event)
+    def agent_output(self, agent_name: str, output: Any):
+        """Log agent output"""
+        duration = None
+        if agent_name in self.active_agents:
+            duration = (time.time() - self.active_agents[agent_name]) * 1000
+            del self.active_agents[agent_name]
+        event = AgentEvent(
+            timestamp=datetime.now(),
+            agent_name=agent_name,
+            event_type='output',
+            data={'output': str(output)[:1000]},
+            duration_ms=duration
+        )
+        self._log_event(event)
+    def agent_handoff(
+        self,
+        from_agent: str,
+        to_agent: str,
+        handoff_data: Any
+    ):
+        """Log handoff between agents"""
+        event = AgentEvent(
+            timestamp=datetime.now(),
+            agent_name=from_agent,
+            event_type='handoff',
+            data={
+                'to_agent': to_agent,
+                'handoff_data': str(handoff_data)[:500]
+            }
+        )
+        self._log_event(event)
+    def error(self, agent_name: str, error: Exception):
+        """Log error"""
+        event = AgentEvent(
+            timestamp=datetime.now(),
+            agent_name=agent_name,
+            event_type='error',
+            data={
+                'error_type': type(error).__name__,
+                'error_message': str(error),
+                'traceback': traceback.format_exc()
+            }
+        )
+        self._log_event(event)
+    def _log_event(self, event: AgentEvent):
+        """Log event to memory and file"""
+        self.events.append(event)
+        # Log to file if configured
+        if self.trace_file:
+            with open(self.trace_file, 'a') as f:
+                f.write(json.dumps(event.to_dict()) + '\n')
+        # Also log to standard logger
+        logger.info(f"[{event.agent_name}] {event.event_type}: {event.data}")
+    def print_interaction_flow(self, start_time: Optional[datetime] = None):
+        """
+        Print human-readable interaction flow
+        Similar to OpenAI's print_agent_interaction
+        """
+        print("\n" + "="*60)
+        print("AGENT INTERACTION FLOW")
+        print("="*60 + "\n")
+        filtered_events = self.events
+        if start_time:
+            filtered_events = [e for e in self.events if e.timestamp >= start_time]
+        for i, event in enumerate(filtered_events, 1):
+            prefix = f"{i:3}. [{event.timestamp.strftime('%H:%M:%S')}] {event.agent_name}"
+            if event.event_type == 'start':
+                print(f"{prefix} → STARTED")
+                print(f"     Input: {event.data.get('input', '')[:100]}...")
+            elif event.event_type == 'tool_call':
+                tool = event.data.get('tool', 'unknown')
+                print(f"{prefix} → TOOL: {tool}")
+                if event.data.get('args'):
+                    print(f"     Args: {event.data['args']}")
+            elif event.event_type == 'reasoning':
+                print(f"{prefix} → THINKING:")
+                print(f"     {event.data.get('reasoning', '')[:200]}...")
+            elif event.event_type == 'handoff':
+                to_agent = event.data.get('to_agent', 'unknown')
+                print(f"{prefix} → HANDOFF to {to_agent}")
+            elif event.event_type == 'output':
+                print(f"{prefix} → OUTPUT:")
+                print(f"     {event.data.get('output', '')[:200]}...")
+                if event.duration_ms:
+                    print(f"     Duration: {event.duration_ms:.0f}ms")
+            elif event.event_type == 'error':
+                print(f"{prefix} → ERROR: {event.data.get('error_type', 'unknown')}")
+                print(f"     {event.data.get('error_message', '')}")
+            print()
+        print("="*60 + "\n")
+    def get_metrics(self) -> Dict[str, Any]:
+        """Get execution metrics"""
+        metrics = {
+            'total_events': len(self.events),
+            'agents_involved': len(set(e.agent_name for e in self.events)),
+            'tool_calls': len([e for e in self.events if e.event_type == 'tool_call']),
+            'errors': len([e for e in self.events if e.event_type == 'error']),
+            'handoffs': len([e for e in self.events if e.event_type == 'handoff']),
+            'avg_duration_ms': 0
+        }
+        durations = [e.duration_ms for e in self.events if e.duration_ms]
+        if durations:
+            metrics['avg_duration_ms'] = sum(durations) / len(durations)
+        return metrics
+class TriageAgent:
+    """
+    Triage agent that routes requests to appropriate specialized agents
+    Based on OpenAI's Deep Research triage pattern
+    """
+    def __init__(self, tracer: Optional[AgentTracer] = None):
+        self.tracer = tracer or AgentTracer()
+    def triage_request(self, request: str) -> Dict[str, Any]:
+        """
+        Analyze request and determine routing
+        """
+        self.tracer.start_agent("TriageAgent", request)
+        # Analyze request type
+        request_lower = request.lower()
+        routing = {
+            'needs_clarification': False,
+            'route_to': None,
+            'confidence': 0.0,
+            'reasoning': '',
+            'suggested_agents': []
+        }
+        # Check if clarification needed
+        if len(request.split()) < 5 or '?' in request:
+            routing['needs_clarification'] = True
+            routing['reasoning'] = "Request is too brief or unclear"
+            self.tracer.reasoning_step("TriageAgent", routing['reasoning'])
+        # Determine routing based on keywords
+        if 'research' in request_lower or 'analyze' in request_lower:
+            routing['route_to'] = 'ResearchAgent'
+            routing['suggested_agents'] = ['ResearchAgent', 'WebSearchAgent']
+            routing['confidence'] = 0.9
+        elif 'resume' in request_lower or 'cv' in request_lower:
+            routing['route_to'] = 'CVAgent'
+            routing['suggested_agents'] = ['CVAgent', 'ATSOptimizer']
+            routing['confidence'] = 0.95
+        elif 'cover' in request_lower or 'letter' in request_lower:
+            routing['route_to'] = 'CoverLetterAgent'
+            routing['suggested_agents'] = ['CoverLetterAgent']
+            routing['confidence'] = 0.95
+        elif 'job' in request_lower or 'application' in request_lower:
+            routing['route_to'] = 'OrchestratorAgent'
+            routing['suggested_agents'] = ['OrchestratorAgent', 'CVAgent', 'CoverLetterAgent']
+            routing['confidence'] = 0.85
+        else:
+            routing['route_to'] = 'GeneralAgent'
+            routing['confidence'] = 0.5
+        self.tracer.agent_output("TriageAgent", routing)
+        return routing
+class AgentMonitor:
+    """
+    Monitor agent performance and health
+    """
+    def __init__(self):
+        self.performance_stats: Dict[str, Dict] = {}
+        self.error_counts: Dict[str, int] = {}
+        self.last_errors: Dict[str, str] = {}
+    def record_execution(
+        self,
+        agent_name: str,
+        duration_ms: float,
+        success: bool,
+        error: Optional[str] = None
+    ):
+        """Record agent execution stats"""
+        if agent_name not in self.performance_stats:
+            self.performance_stats[agent_name] = {
+                'total_runs': 0,
+                'successful_runs': 0,
+                'failed_runs': 0,
+                'total_duration_ms': 0,
+                'avg_duration_ms': 0,
+                'min_duration_ms': float('inf'),
+                'max_duration_ms': 0
+            }
+        stats = self.performance_stats[agent_name]
+        stats['total_runs'] += 1
+        if success:
+            stats['successful_runs'] += 1
+        else:
+            stats['failed_runs'] += 1
+            self.error_counts[agent_name] = self.error_counts.get(agent_name, 0) + 1
+            if error:
+                self.last_errors[agent_name] = error
+        stats['total_duration_ms'] += duration_ms
+        stats['avg_duration_ms'] = stats['total_duration_ms'] / stats['total_runs']
+        stats['min_duration_ms'] = min(stats['min_duration_ms'], duration_ms)
+        stats['max_duration_ms'] = max(stats['max_duration_ms'], duration_ms)
+    def get_health_status(self) -> Dict[str, Any]:
+        """Get overall system health"""
+        total_errors = sum(self.error_counts.values())
+        total_runs = sum(s['total_runs'] for s in self.performance_stats.values())
+        if total_runs == 0:
+            error_rate = 0
+        else:
+            error_rate = (total_errors / total_runs) * 100
+        # Determine health status
+        if error_rate < 5:
+            status = "healthy"
+        elif error_rate < 15:
+            status = "degraded"
+        else:
+            status = "unhealthy"
+        return {
+            'status': status,
+            'error_rate': f"{error_rate:.1f}%",
+            'total_runs': total_runs,
+            'total_errors': total_errors,
+            'agent_stats': self.performance_stats,
+            'recent_errors': self.last_errors
+        }
+    def reset_stats(self):
+        """Reset all statistics"""
+        self.performance_stats.clear()
+        self.error_counts.clear()
+        self.last_errors.clear()
+# Global instances for easy access
+global_tracer = AgentTracer()
+global_monitor = AgentMonitor()
+# Decorator for automatic tracing
+def trace_agent(agent_name: str):
+    """Decorator to automatically trace agent execution"""
+    def decorator(func):
+        def wrapper(*args, **kwargs):
+            event_id = global_tracer.start_agent(agent_name, args)
+            start_time = time.time()
+            try:
+                result = func(*args, **kwargs)
+                duration = (time.time() - start_time) * 1000
+                global_tracer.agent_output(agent_name, result)
+                global_monitor.record_execution(agent_name, duration, True)
+                return result
+            except Exception as e:
+                duration = (time.time() - start_time) * 1000
+                global_tracer.error(agent_name, e)
+                global_monitor.record_execution(agent_name, duration, False, str(e))
+                raise
+        return wrapper
+    return decorator
+# Demo usage
+def demo_observability():
+    """Demonstrate observability features"""
+    tracer = AgentTracer()
+    monitor = AgentMonitor()
+    triage = TriageAgent(tracer)
+    # Simulate agent interactions
+    routing = triage.triage_request("Help me write a resume for a software engineering position")
+    # Simulate tool calls
+    tracer.tool_call("CVAgent", "extract_keywords", {"text": "software engineering"})
+    tracer.tool_call("CVAgent", "optimize_ats", {"resume": "..."})
+    # Simulate handoff
+    tracer.agent_handoff("CVAgent", "ATSOptimizer", {"resume_draft": "..."})
+    # Print interaction flow
+    tracer.print_interaction_flow()
+    # Show metrics
+    print("Metrics:", tracer.get_metrics())
+if __name__ == "__main__":
+    demo_observability()

agents/orchestrator.py ADDED Viewed

	@@ -0,0 +1,232 @@

+from __future__ import annotations
+from typing import List, Tuple, Optional
+import logging
+import re
+from models.schemas import OrchestrationResult, JobPosting, UserProfile
+from utils.text import extract_keywords_from_text
+from utils.consistency import detect_contradictions, allowed_keywords_from_profile
+from utils.probability import resume_probability, cover_letter_probability
+from utils.config import AgentConfig, UIConfig
+from memory.store import memory_store
+from .linkedin_manager import LinkedInManagerAgent
+from .cv_owner import CVOwnerAgent
+from .cover_letter_agent import CoverLetterAgent
+logger = logging.getLogger(__name__)
+class OrchestratorAgent:
+    def __init__(self) -> None:
+        self.linkedin = LinkedInManagerAgent()
+        self.cv_owner = CVOwnerAgent()
+        self.cover_letter = CoverLetterAgent()
+        self.name = "orchestrator"
+    def login_url(self) -> str:
+        return self.linkedin.get_login_url()
+    def handle_login_code(self, code: str, state: Optional[str] = None) -> bool:
+        """Handle OAuth callback with state validation for CSRF protection."""
+        return self.linkedin.handle_oauth_callback(code, state)
+    def get_profile(self) -> UserProfile:
+        return self.linkedin.get_profile()
+    def get_saved_jobs(self) -> List[JobPosting]:
+        return self.linkedin.get_saved_jobs()
+    def get_tailored_jobs(self, limit: int = UIConfig.MAX_SUGGESTED_JOBS) -> List[Tuple[JobPosting, float]]:
+        """Get jobs tailored to user's profile, scored by skill overlap."""
+        profile = self.get_profile()
+        jobs = self.get_saved_jobs()
+        scored: List[Tuple[JobPosting, float]] = []
+        profile_keywords = set([s.lower() for s in profile.skills])
+        if not profile_keywords:
+            logger.warning("No profile keywords found for job matching")
+            return [(j, 0.0) for j in jobs[:limit]]
+        for j in jobs:
+            jd_keywords = set([k.lower() for k in extract_keywords_from_text(
+                j.description or "",
+                top_k=AgentConfig.JOB_KEYWORDS_COUNT
+            )])
+            overlap = profile_keywords.intersection(jd_keywords)
+            score = len(overlap) / max(1, len(profile_keywords))
+            scored.append((j, score))
+        scored.sort(key=lambda t: t[1], reverse=True)
+        return scored[:limit]
+    def _smart_remove_keyword(self, text: str, keyword: str) -> str:
+        """Intelligently remove a keyword from text without breaking sentences."""
+        # Try to remove complete phrases containing the keyword
+        patterns = [
+            rf'\b[^.]*\b{re.escape(keyword)}\b[^.]*\.',  # Full sentence
+            rf',\s*[^,]*\b{re.escape(keyword)}\b[^,]*(?=,|\.|$)',  # Clause
+            rf'\b{re.escape(keyword)}\b\s*(?:and|or|,)\s*',  # List item
+            rf'(?:and|or|,)\s*\b{re.escape(keyword)}\b',  # List item
+            rf'\b{re.escape(keyword)}\b',  # Just the word
+        ]
+        for pattern in patterns:
+            new_text = re.sub(pattern, '', text, flags=re.IGNORECASE)
+            # Clean up any double spaces or punctuation
+            new_text = re.sub(r'\s+', ' ', new_text)
+            new_text = re.sub(r',\s*,', ',', new_text)
+            new_text = re.sub(r'\.\s*\.', '.', new_text)
+            if new_text != text:
+                logger.debug(f"Removed keyword '{keyword}' using pattern: {pattern[:30]}...")
+                return new_text.strip()
+        return text
+    def run_for_jobs(
+        self,
+        jobs: List[JobPosting],
+        user_id: str = "default_user",
+        cv_chat: Optional[str] = None,
+        cover_chat: Optional[str] = None,
+        cv_seed: Optional[str] = None,
+        cover_seed: Optional[str] = None,
+        agent2_notes: Optional[str] = None,
+        inspiration_url: Optional[str] = None
+    ) -> List[OrchestrationResult]:
+        """Orchestrate resume and cover letter generation for multiple jobs."""
+        profile = self.get_profile()
+        results: List[OrchestrationResult] = []
+        allowed = allowed_keywords_from_profile(profile.skills, profile.experiences)
+        logger.info(f"Starting orchestration for {len(jobs)} jobs")
+        for job in jobs:
+            logger.info(f"Processing job: {job.title} at {job.company}")
+            # Initial generation
+            resume_draft = self.cv_owner.create_resume(
+                profile, job,
+                user_id=user_id,
+                user_chat=cv_chat,
+                seed_text=cv_seed,
+                agent2_notes=agent2_notes
+            )
+            cover_draft = self.cover_letter.create_cover_letter(
+                profile, job,
+                user_id=user_id,
+                user_chat=cover_chat,
+                seed_text=cover_seed,
+                agent2_notes=agent2_notes,
+                inspiration_url=inspiration_url
+            )
+            # Consistency checking and refinement
+            for cycle in range(AgentConfig.OPTIMIZATION_CYCLES):
+                issues = detect_contradictions(resume_draft.text, cover_draft.text, allowed)
+                memory_store.save(user_id, self.name, {
+                    "job_id": job.id,
+                    "cycle": cycle + 1,
+                    "issues": issues,
+                    "issues_count": len(issues),
+                }, job_id=job.id)
+                if not issues:
+                    logger.info(f"No consistency issues found in cycle {cycle + 1}")
+                    break
+                logger.warning(f"Found {len(issues)} consistency issues in cycle {cycle + 1}")
+                # Smart removal of contradictory keywords
+                issues_to_fix = issues[:AgentConfig.MAX_CONTRADICTION_FIXES]
+                for keyword in issues_to_fix:
+                    if keyword.lower() not in allowed:
+                        # Use smart removal instead of simple replace
+                        cover_draft.text = self._smart_remove_keyword(cover_draft.text, keyword)
+                        logger.debug(f"Removed unauthorized keyword: {keyword}")
+                # Regenerate cover letter with fixes
+                cover_draft = self.cover_letter.create_cover_letter(
+                    profile, job,
+                    user_id=user_id,
+                    user_chat=cover_chat,
+                    seed_text=cover_draft.text,  # Use modified text as seed
+                    agent2_notes=agent2_notes,
+                    inspiration_url=inspiration_url
+                )
+            # Calculate metrics
+            salary = self.linkedin.estimate_salary(job)
+            p_resume = resume_probability(resume_draft.text, job.description)
+            p_cover = cover_letter_probability(cover_draft.text, job.description)
+            overall_p = max(0.0, min(1.0, p_resume * p_cover))
+            # Validate salary estimates
+            reasoning_ok = (
+                overall_p >= 0.0 and
+                salary.get("GBP", {}).get("low", 0) < salary.get("GBP", {}).get("high", 999999)
+            )
+            # Save final metrics
+            memory_store.save(user_id, self.name, {
+                "job_id": job.id,
+                "final": True,
+                "resume_keywords": resume_draft.keywords_used,
+                "cover_keywords": cover_draft.keywords_used,
+                "metrics": {
+                    "salary": salary,
+                    "p_resume": p_resume,
+                    "p_cover": p_cover,
+                    "overall_p": overall_p,
+                    "reasoning_ok": reasoning_ok,
+                }
+            }, job_id=job.id)
+            result = OrchestrationResult(
+                job=job,
+                resume=resume_draft,
+                cover_letter=cover_draft,
+                metrics={
+                    "salary": salary,
+                    "p_resume": p_resume,
+                    "p_cover": p_cover,
+                    "overall_p": overall_p,
+                    "reasoning_ok": reasoning_ok,
+                }
+            )
+            results.append(result)
+            logger.info(
+                f"Completed job {job.id}: resume_p={p_resume:.2f}, "
+                f"cover_p={p_cover:.2f}, overall_p={overall_p:.2f}"
+            )
+        logger.info(f"Orchestration complete for {len(results)} jobs")
+        return results
+    def regenerate_for_job(
+        self,
+        job: JobPosting,
+        user_id: str,
+        cv_chat: Optional[str] = None,
+        cover_chat: Optional[str] = None,
+        cv_seed: Optional[str] = None,
+        cover_seed: Optional[str] = None,
+        agent2_notes: Optional[str] = None,
+        inspiration_url: Optional[str] = None
+    ) -> OrchestrationResult:
+        """Regenerate documents for a single job."""
+        logger.info(f"Regenerating documents for job: {job.title} at {job.company}")
+        results = self.run_for_jobs(
+            [job],
+            user_id=user_id,
+            cv_chat=cv_chat,
+            cover_chat=cover_chat,
+            cv_seed=cv_seed,
+            cover_seed=cover_seed,
+            agent2_notes=agent2_notes,
+            inspiration_url=inspiration_url
+        )
+        return results[0]

agents/parallel_executor.py ADDED Viewed

	@@ -0,0 +1,425 @@

+"""
+Parallel Agent Executor
+Implements async parallel execution of agents for faster processing
+Based on the parallel agent pattern for improved performance
+"""
+import asyncio
+import time
+import logging
+from typing import List, Dict, Any, Tuple, Optional
+from dataclasses import dataclass
+from datetime import datetime
+import nest_asyncio
+import matplotlib.pyplot as plt
+from concurrent.futures import ThreadPoolExecutor
+from models.schemas import JobPosting, ResumeDraft, CoverLetterDraft, OrchestrationResult
+# Apply nest_asyncio to allow nested event loops (useful in Jupyter/Gradio)
+try:
+    nest_asyncio.apply()
+except:
+    pass
+logger = logging.getLogger(__name__)
+@dataclass
+class AgentResult:
+    """Result from an agent execution"""
+    agent_name: str
+    output: Any
+    start_time: float
+    end_time: float
+    duration: float
+    success: bool
+    error: Optional[str] = None
+class ParallelAgentExecutor:
+    """Execute multiple agents in parallel for faster processing"""
+    def __init__(self, max_workers: int = 4):
+        self.max_workers = max_workers
+        self.executor = ThreadPoolExecutor(max_workers=max_workers)
+        self.execution_history: List[Tuple[str, float, float]] = []
+    async def run_agent_async(
+        self,
+        agent_func: callable,
+        agent_name: str,
+        *args,
+        **kwargs
+    ) -> AgentResult:
+        """Run a single agent asynchronously"""
+        start_time = time.time()
+        try:
+            # Log start
+            logger.info(f"Starting {agent_name} at {datetime.now()}")
+            # Run the agent function
+            if asyncio.iscoroutinefunction(agent_func):
+                result = await agent_func(*args, **kwargs)
+            else:
+                # Run sync function in executor
+                loop = asyncio.get_event_loop()
+                result = await loop.run_in_executor(
+                    self.executor,
+                    agent_func,
+                    *args
+                )
+            end_time = time.time()
+            duration = end_time - start_time
+            # Track execution
+            self.execution_history.append((agent_name, start_time, end_time))
+            logger.info(f"Completed {agent_name} in {duration:.2f}s")
+            return AgentResult(
+                agent_name=agent_name,
+                output=result,
+                start_time=start_time,
+                end_time=end_time,
+                duration=duration,
+                success=True
+            )
+        except Exception as e:
+            end_time = time.time()
+            duration = end_time - start_time
+            logger.error(f"Error in {agent_name}: {str(e)}")
+            return AgentResult(
+                agent_name=agent_name,
+                output=None,
+                start_time=start_time,
+                end_time=end_time,
+                duration=duration,
+                success=False,
+                error=str(e)
+            )
+    async def run_parallel_agents(
+        self,
+        agents: List[Dict[str, Any]]
+    ) -> Dict[str, AgentResult]:
+        """
+        Run multiple agents in parallel
+        Args:
+            agents: List of dicts with 'name', 'func', 'args', 'kwargs'
+        Returns:
+            Dict mapping agent names to results
+        """
+        tasks = []
+        for agent in agents:
+            task = self.run_agent_async(
+                agent['func'],
+                agent['name'],
+                *agent.get('args', []),
+                **agent.get('kwargs', {})
+            )
+            tasks.append(task)
+        # Run all agents in parallel
+        results = await asyncio.gather(*tasks, return_exceptions=True)
+        # Map results by name
+        result_map = {}
+        for i, agent in enumerate(agents):
+            if isinstance(results[i], Exception):
+                result_map[agent['name']] = AgentResult(
+                    agent_name=agent['name'],
+                    output=None,
+                    start_time=time.time(),
+                    end_time=time.time(),
+                    duration=0,
+                    success=False,
+                    error=str(results[i])
+                )
+            else:
+                result_map[agent['name']] = results[i]
+        return result_map
+    def plot_timeline(self, save_path: Optional[str] = None):
+        """Plot execution timeline of agents"""
+        if not self.execution_history:
+            logger.warning("No execution history to plot")
+            return
+        # Normalize times to zero
+        base = min(start for _, start, _ in self.execution_history)
+        # Prepare data
+        labels = []
+        start_offsets = []
+        durations = []
+        for name, start, end in self.execution_history:
+            labels.append(name)
+            start_offsets.append(start - base)
+            durations.append(end - start)
+        # Create plot
+        plt.figure(figsize=(10, 6))
+        plt.barh(labels, durations, left=start_offsets, height=0.5)
+        plt.xlabel("Seconds since start")
+        plt.title("Agent Execution Timeline")
+        plt.grid(True, alpha=0.3)
+        # Add duration labels
+        for i, (offset, duration) in enumerate(zip(start_offsets, durations)):
+            plt.text(offset + duration/2, i, f'{duration:.2f}s',
+                    ha='center', va='center', color='white', fontsize=8)
+        plt.tight_layout()
+        if save_path:
+            plt.savefig(save_path)
+            logger.info(f"Timeline saved to {save_path}")
+        else:
+            plt.show()
+        return plt.gcf()
+class ParallelJobProcessor:
+    """Process multiple jobs in parallel using agent parallelization"""
+    def __init__(self):
+        self.executor = ParallelAgentExecutor(max_workers=4)
+    async def process_jobs_parallel(
+        self,
+        jobs: List[JobPosting],
+        cv_agent_func: callable,
+        cover_agent_func: callable,
+        research_func: callable = None,
+        **kwargs
+    ) -> List[OrchestrationResult]:
+        """
+        Process multiple jobs in parallel
+        Each job gets:
+        1. Resume generation
+        2. Cover letter generation
+        3. Optional web research
+        All running in parallel per job
+        """
+        all_results = []
+        for job in jobs:
+            # Define agents for this job
+            agents = [
+                {
+                    'name': f'Resume_{job.company}',
+                    'func': cv_agent_func,
+                    'args': [job],
+                    'kwargs': kwargs
+                },
+                {
+                    'name': f'CoverLetter_{job.company}',
+                    'func': cover_agent_func,
+                    'args': [job],
+                    'kwargs': kwargs
+                }
+            ]
+            # Add research if available
+            if research_func:
+                agents.append({
+                    'name': f'Research_{job.company}',
+                    'func': research_func,
+                    'args': [job.company],
+                    'kwargs': {}
+                })
+            # Run agents in parallel for this job
+            results = await self.executor.run_parallel_agents(agents)
+            # Combine results
+            orchestration_result = OrchestrationResult(
+                job=job,
+                resume=results[f'Resume_{job.company}'].output,
+                cover_letter=results[f'CoverLetter_{job.company}'].output,
+                keywords=[],  # Would be extracted
+                research=results.get(f'Research_{job.company}', {}).output if research_func else None
+            )
+            all_results.append(orchestration_result)
+        # Generate timeline
+        self.executor.plot_timeline(save_path="parallel_execution_timeline.png")
+        return all_results
+class MetaAgent:
+    """
+    Meta-agent that combines outputs from multiple specialized agents
+    Similar to the article's pattern of combining summaries
+    """
+    def __init__(self):
+        self.executor = ParallelAgentExecutor()
+    async def analyze_job_fit(
+        self,
+        job: JobPosting,
+        resume: ResumeDraft
+    ) -> Dict[str, Any]:
+        """
+        Run multiple analysis agents in parallel and combine results
+        """
+        # Define specialized analysis agents
+        agents = [
+            {
+                'name': 'SkillsMatcher',
+                'func': self._match_skills,
+                'args': [job, resume]
+            },
+            {
+                'name': 'ExperienceAnalyzer',
+                'func': self._analyze_experience,
+                'args': [job, resume]
+            },
+            {
+                'name': 'CultureFit',
+                'func': self._assess_culture_fit,
+                'args': [job, resume]
+            },
+            {
+                'name': 'SalaryEstimator',
+                'func': self._estimate_salary_fit,
+                'args': [job, resume]
+            }
+        ]
+        # Run all agents in parallel
+        results = await self.executor.run_parallel_agents(agents)
+        # Combine into executive summary
+        summary = self._combine_analyses(results)
+        return summary
+    def _match_skills(self, job: JobPosting, resume: ResumeDraft) -> Dict:
+        """Match skills between job and resume"""
+        job_skills = set(job.description.lower().split())
+        resume_skills = set(resume.text.lower().split())
+        matched = job_skills & resume_skills
+        missing = job_skills - resume_skills
+        return {
+            'matched_skills': len(matched),
+            'missing_skills': len(missing),
+            'match_percentage': len(matched) / len(job_skills) * 100 if job_skills else 0,
+            'top_matches': list(matched)[:10]
+        }
+    def _analyze_experience(self, job: JobPosting, resume: ResumeDraft) -> Dict:
+        """Analyze experience relevance"""
+        # Simplified analysis
+        return {
+            'years_experience': 5,  # Would extract from resume
+            'relevant_roles': 3,
+            'industry_match': True
+        }
+    def _assess_culture_fit(self, job: JobPosting, resume: ResumeDraft) -> Dict:
+        """Assess cultural fit"""
+        return {
+            'remote_preference': 'remote' in job.location.lower() if job.location else False,
+            'company_size_fit': True,
+            'values_alignment': 0.8
+        }
+    def _estimate_salary_fit(self, job: JobPosting, resume: ResumeDraft) -> Dict:
+        """Estimate salary fit"""
+        return {
+            'estimated_range': '$100k-$150k',
+            'market_rate': True,
+            'negotiation_room': 'moderate'
+        }
+    def _combine_analyses(self, results: Dict[str, AgentResult]) -> Dict:
+        """Combine all analyses into executive summary"""
+        summary = {
+            'overall_fit_score': 0,
+            'strengths': [],
+            'gaps': [],
+            'recommendations': [],
+            'detailed_analysis': {}
+        }
+        # Extract successful results
+        for name, result in results.items():
+            if result.success and result.output:
+                summary['detailed_analysis'][name] = result.output
+        # Calculate overall score
+        if 'SkillsMatcher' in summary['detailed_analysis']:
+            skills_score = summary['detailed_analysis']['SkillsMatcher'].get('match_percentage', 0)
+            summary['overall_fit_score'] = skills_score
+        # Generate recommendations
+        if summary['overall_fit_score'] > 70:
+            summary['recommendations'].append("Strong candidate - proceed with application")
+        elif summary['overall_fit_score'] > 50:
+            summary['recommendations'].append("Moderate fit - customize resume for better match")
+        else:
+            summary['recommendations'].append("Low fit - consider if this role aligns with goals")
+        return summary
+# Usage example
+async def demo_parallel_execution():
+    """Demonstrate parallel agent execution"""
+    # Create executor
+    executor = ParallelAgentExecutor()
+    # Define sample agents
+    async def agent1():
+        await asyncio.sleep(2)
+        return "Agent 1 result"
+    async def agent2():
+        await asyncio.sleep(1)
+        return "Agent 2 result"
+    async def agent3():
+        await asyncio.sleep(3)
+        return "Agent 3 result"
+    agents = [
+        {'name': 'FastAgent', 'func': agent2},
+        {'name': 'MediumAgent', 'func': agent1},
+        {'name': 'SlowAgent', 'func': agent3}
+    ]
+    # Run in parallel
+    results = await executor.run_parallel_agents(agents)
+    # Show results
+    for name, result in results.items():
+        print(f"{name}: {result.output} (took {result.duration:.2f}s)")
+    # Plot timeline
+    executor.plot_timeline()
+if __name__ == "__main__":
+    # Run demo
+    asyncio.run(demo_parallel_execution())

agents/pipeline.py ADDED Viewed

	@@ -0,0 +1,205 @@

+from __future__ import annotations
+from typing import Dict, Any
+import os
+import json
+from datetime import datetime
+from models.schemas import JobPosting, UserProfile, ResumeDraft, CoverLetterDraft, OrchestrationResult
+from .router_agent import RouterAgent
+from .profile_agent import ProfileAgent
+from .job_agent import JobAgent
+from .cv_owner import CVOwnerAgent
+from .cover_letter_agent import CoverLetterAgent
+from utils.consistency import detect_contradictions, allowed_keywords_from_profile, coverage_score
+from memory.store import memory_store
+from .temporal_tracker import TemporalApplicationTracker
+from utils.text import extract_keywords_from_text
+class ApplicationPipeline:
+	"""User -> Router -> Profile -> Job -> Resume -> Cover -> Orchestrator Review -> User"""
+	def __init__(self) -> None:
+		self.router = RouterAgent()
+		self.profile_agent = ProfileAgent()
+		self.job_agent = JobAgent()
+		self.resume_agent = CVOwnerAgent()
+		self.cover_agent = CoverLetterAgent()
+		self.temporal_tracker = TemporalApplicationTracker()
+		self._events_path = os.path.join(str(memory_store.base_dir), "events.jsonl")
+	def _log_event(self, agent: str, event: str, payload: Dict[str, Any]) -> None:
+		try:
+			os.makedirs(os.path.dirname(self._events_path), exist_ok=True)
+			entry = {
+				"ts": datetime.now().isoformat(),
+				"agent": agent,
+				"event": event,
+				"payload": payload or {},
+			}
+			with open(self._events_path, "a", encoding="utf-8") as f:
+				f.write(json.dumps(entry, ensure_ascii=False) + "\n")
+		except Exception:
+			pass
+	def run(self, payload: Dict[str, Any], user_id: str = "default_user") -> Dict[str, Any]:
+		state = dict(payload)
+		while True:
+			next_step = self.router.route(state)
+			# Router decision summary (safe, no chain-of-thought)
+			try:
+				self._log_event(
+					"RouterAgent",
+					"route_decision",
+					{
+						"cv_present": bool(state.get("cv_text")),
+						"job_present": bool(state.get("job_posting")),
+						"profile_ready": bool(state.get("profile")),
+						"job_analyzed": bool(state.get("job_analysis")),
+						"resume_ready": bool(state.get("resume_draft")),
+						"cover_ready": bool(state.get("cover_letter_draft")),
+						"next": next_step,
+					},
+				)
+			except Exception:
+				pass
+			if next_step == "profile":
+				parsed = self.profile_agent.parse(state.get("cv_text", ""))
+				state["profile"] = parsed
+				# Profile summary
+				try:
+					prof = parsed or {}
+					self._log_event(
+						"ProfileAgent",
+						"parsed_profile",
+						{
+							"has_full_name": bool(prof.get("full_name")),
+							"skills_count": len(prof.get("skills", [])) if isinstance(prof, dict) else 0,
+						},
+					)
+				except Exception:
+					pass
+			elif next_step == "job":
+				analysis = self.job_agent.analyze(state.get("job_posting", ""))
+				state["job_analysis"] = analysis
+				# Job analysis summary
+				try:
+					ja = analysis or {}
+					self._log_event(
+						"JobAgent",
+						"job_analyzed",
+						{
+							"has_company": bool(ja.get("company")),
+							"has_role": bool(ja.get("role")),
+							"key_req_count": len(ja.get("key_requirements", [])) if isinstance(ja, dict) else 0,
+						},
+					)
+				except Exception:
+					pass
+			elif next_step == "resume":
+				profile_model = self._to_profile_model(state["profile"]) if not isinstance(state["profile"], UserProfile) else state["profile"]
+				job_model = self._to_job_model(state)
+				resume = self.resume_agent.create_resume(profile_model, job_model, user_id=user_id)
+				state["resume_draft"] = resume
+				# Optional summary
+				try:
+					job_k = extract_keywords_from_text(job_model.description or "", top_k=20)
+					cov = coverage_score(getattr(resume, "text", "") or "", job_k)
+					self._log_event("CVOwnerAgent", "resume_generated", {"job_id": job_model.id, "chars": len(getattr(resume, "text", "") or ""), "coverage": round(cov, 3)})
+				except Exception:
+					pass
+			elif next_step == "cover":
+				profile_model = self._to_profile_model(state["profile"]) if not isinstance(state["profile"], UserProfile) else state["profile"]
+				job_model = self._to_job_model(state)
+				cover = self.cover_agent.create_cover_letter(profile_model, job_model, user_id=user_id)
+				state["cover_letter_draft"] = cover
+				# Optional summary
+				try:
+					job_k = extract_keywords_from_text(job_model.description or "", top_k=20)
+					cov = coverage_score(getattr(cover, "text", "") or "", job_k)
+					self._log_event("CoverLetterAgent", "cover_generated", {"job_id": job_model.id, "chars": len(getattr(cover, "text", "") or ""), "coverage": round(cov, 3)})
+				except Exception:
+					pass
+			elif next_step == "review":
+				self._review(state, user_id)
+				break
+		return state
+	def _to_job_model(self, state: Dict[str, Any]) -> JobPosting:
+		return JobPosting(
+			id=state.get("job_id", "job_1"),
+			title=state.get("job_title") or state.get("job_analysis", {}).get("role", "Role"),
+			company=state.get("job_company") or state.get("job_analysis", {}).get("company", "Company"),
+			description=state.get("job_posting", ""),
+			location=state.get("job_analysis", {}).get("location"),
+			employment_type=state.get("job_analysis", {}).get("employment_type"),
+		)
+	def _to_profile_model(self, profile_dict: Dict[str, Any]) -> UserProfile:
+		# Best-effort mapping from parsed dict to model
+		return UserProfile(
+			full_name=profile_dict.get("full_name", ""),
+			headline=profile_dict.get("headline"),
+			summary=profile_dict.get("summary"),
+			email=profile_dict.get("email"),
+			phone=profile_dict.get("phone"),
+			location=profile_dict.get("location"),
+			skills=profile_dict.get("skills", []),
+			experiences=[
+				# Minimal mapping; agents rely on text and keywords anyway
+			]
+		)
+	def _review(self, state: Dict[str, Any], user_id: str) -> None:
+		# Orchestrator-style review: detect contradictions and persist
+		resume_text = state.get("resume_draft").text if isinstance(state.get("resume_draft"), ResumeDraft) else ""
+		cover_text = state.get("cover_letter_draft").text if isinstance(state.get("cover_letter_draft"), CoverLetterDraft) else ""
+		profile = state.get("profile") or {}
+		job_desc = state.get("job_posting", "")
+		job_k = extract_keywords_from_text(job_desc or "", top_k=30)
+		base_allowed = allowed_keywords_from_profile(profile.get("skills", []), profile.get("experiences", [])) if isinstance(profile, dict) else allowed_keywords_from_profile(profile.skills, profile.experiences)
+		# Broaden allowed keywords with those present in the generated documents to reduce false positives
+		resume_k = set(k.lower() for k in extract_keywords_from_text(resume_text or "", top_k=150))
+		cover_k = set(k.lower() for k in extract_keywords_from_text(cover_text or "", top_k=150))
+		allowed = set(base_allowed) | resume_k | cover_k | set(k.lower() for k in job_k)
+		issues = detect_contradictions(resume_text, cover_text, allowed)
+		# Coverage metrics
+		resume_cov = coverage_score(resume_text or "", job_k)
+		cover_cov = coverage_score(cover_text or "", job_k)
+		# Simple recommendation score and decision
+		score = 0.45 * resume_cov + 0.45 * cover_cov - min(0.3, len(issues) / 100.0)
+		decision = "interview" if score >= 0.45 else "review"
+		memory_store.save(user_id, "orchestrator_review", {
+			"issues": issues,
+			"issues_count": len(issues),
+			"resume_coverage": round(resume_cov, 3),
+			"cover_coverage": round(cover_cov, 3),
+			"score": round(score, 3),
+			"decision": decision,
+		})
+		# Emit review event
+		try:
+			self._log_event(
+				"Orchestrator",
+				"review_summary",
+				{
+					"issues_count": len(issues),
+					"resume_cov": round(resume_cov, 3),
+					"cover_cov": round(cover_cov, 3),
+					"decision": decision,
+				},
+			)
+		except Exception:
+			pass
+		# Temporal tracking: record a drafted status with issues metadata
+		try:
+			job_model = self._to_job_model(state)
+			self.temporal_tracker.track_application(job_model, status="drafted", metadata={"issues_count": len(issues)})
+		except Exception:
+			# Non-fatal; continue even if temporal tracking fails
+			pass

agents/profile_agent.py ADDED Viewed

	@@ -0,0 +1,39 @@

+from __future__ import annotations
+from typing import Dict, Any
+from services.llm import llm
+import json
+class ProfileAgent:
+    """Parses raw CV text into a structured profile using LLM with fallback."""
+    def parse(self, cv_text: str) -> Dict[str, Any]:
+        if not cv_text:
+            return {}
+        if not llm.enabled:
+            return {
+                "full_name": "Unknown",
+                "email": "",
+                "skills": [],
+                "experiences": [],
+                "links": {},
+                "languages": [],
+                "certifications": [],
+                "projects": [],
+                "work_mode": "",
+                "skill_proficiency": {},
+            }
+        system = (
+            "You are a CV parser. Extract JSON with fields: full_name, email, phone, location, "
+            "skills (list), experiences (list of {title, company, start_date, end_date, achievements, technologies}), "
+            "education (list of {school, degree, field_of_study, start_date, end_date}), links (map with linkedin/portfolio/website if present). "
+            "Also extract optional: languages (list of {language, level}), certifications (list of {name, issuer, year}), "
+            "projects (list of {title, link, impact}), work_mode (remote/hybrid/on-site if evident), skill_proficiency (map skill->level). "
+            "Keep values concise; do not invent information."
+        )
+        user = f"Parse this CV into JSON with the schema above. Be strict JSON.\n\n{cv_text}"
+        resp = llm.generate(system, user, max_tokens=900, agent="parser")
+        try:
+            return json.loads(resp)
+        except Exception:
+            return {"raw": resp}

agents/router_agent.py ADDED Viewed

	@@ -0,0 +1,18 @@

+from __future__ import annotations
+from typing import Literal, Optional, Dict, Any
+class RouterAgent:
+    """Simple router that decides the next step in the pipeline."""
+    def route(self, payload: Dict[str, Any]) -> Literal["profile", "job", "resume", "cover", "review"]:
+        # Basic heuristics based on provided payload
+        if payload.get("cv_text") and not payload.get("profile"):
+            return "profile"
+        if payload.get("job_posting") and not payload.get("job_analysis"):
+            return "job"
+        if payload.get("profile") and payload.get("job_analysis") and not payload.get("resume_draft"):
+            return "resume"
+        if payload.get("resume_draft") and not payload.get("cover_letter_draft"):
+            return "cover"
+        return "review"

agents/temporal_tracker.py ADDED Viewed

	@@ -0,0 +1,464 @@

+"""
+Temporal Application Tracker
+Implements time-aware tracking of job applications with versioned history
+Based on the Temporal AI Agents pattern for maintaining historical context
+"""
+import json
+import logging
+from typing import Dict, List, Tuple, Optional, Any
+from datetime import datetime, timedelta
+from dataclasses import dataclass, field
+from pathlib import Path
+import hashlib
+from models.schemas import JobPosting, OrchestrationResult
+logger = logging.getLogger(__name__)
+@dataclass
+class Triplet:
+    """
+    A time-stamped fact in subject-predicate-object format
+    Example: (JobID123, status, applied, 2025-01-15)
+    """
+    subject: str
+    predicate: str
+    object: Any
+    valid_at: datetime
+    expired_at: Optional[datetime] = None
+    confidence: float = 1.0
+    source: str = "user"
+    metadata: Dict = field(default_factory=dict)
+    def to_dict(self) -> Dict:
+        return {
+            'subject': self.subject,
+            'predicate': self.predicate,
+            'object': str(self.object),
+            'valid_at': self.valid_at.isoformat(),
+            'expired_at': self.expired_at.isoformat() if self.expired_at else None,
+            'confidence': self.confidence,
+            'source': self.source,
+            'metadata': self.metadata
+        }
+    @classmethod
+    def from_dict(cls, data: Dict) -> 'Triplet':
+        return cls(
+            subject=data['subject'],
+            predicate=data['predicate'],
+            object=data['object'],
+            valid_at=datetime.fromisoformat(data['valid_at']),
+            expired_at=datetime.fromisoformat(data['expired_at']) if data.get('expired_at') else None,
+            confidence=data.get('confidence', 1.0),
+            source=data.get('source', 'user'),
+            metadata=data.get('metadata', {})
+        )
+class TemporalKnowledgeGraph:
+    """
+    Knowledge graph that tracks changes over time
+    Maintains history of all application states and changes
+    """
+    def __init__(self, storage_path: str = "temporal_graph.json"):
+        self.storage_path = Path(storage_path)
+        self.triplets: List[Triplet] = []
+        self.load()
+    def add_triplet(self, triplet: Triplet) -> None:
+        """Add a new fact to the graph"""
+        # Check for contradictions
+        existing = self.find_current(triplet.subject, triplet.predicate)
+        if existing and existing.object != triplet.object:
+            # Invalidate old triplet
+            existing.expired_at = triplet.valid_at
+            logger.info(f"Invalidated old triplet: {existing.subject}-{existing.predicate}")
+        self.triplets.append(triplet)
+        self.save()
+    def find_current(
+        self,
+        subject: str,
+        predicate: str,
+        at_time: Optional[datetime] = None
+    ) -> Optional[Triplet]:
+        """Find the current valid triplet for a subject-predicate pair"""
+        at_time = at_time or datetime.now()
+        for triplet in reversed(self.triplets):  # Check most recent first
+            if (triplet.subject == subject and
+                triplet.predicate == predicate and
+                triplet.valid_at <= at_time and
+                (triplet.expired_at is None or triplet.expired_at > at_time)):
+                return triplet
+        return None
+    def get_history(
+        self,
+        subject: str,
+        predicate: Optional[str] = None
+    ) -> List[Triplet]:
+        """Get full history for a subject"""
+        history = []
+        for triplet in self.triplets:
+            if triplet.subject == subject:
+                if predicate is None or triplet.predicate == predicate:
+                    history.append(triplet)
+        return sorted(history, key=lambda t: t.valid_at)
+    def query_timerange(
+        self,
+        start_date: datetime,
+        end_date: datetime,
+        predicate: Optional[str] = None
+    ) -> List[Triplet]:
+        """Query all triplets valid within a time range"""
+        results = []
+        for triplet in self.triplets:
+            if (triplet.valid_at >= start_date and
+                triplet.valid_at <= end_date):
+                if predicate is None or triplet.predicate == predicate:
+                    results.append(triplet)
+        return results
+    def save(self) -> None:
+        """Save graph to disk"""
+        data = {
+            'triplets': [t.to_dict() for t in self.triplets],
+            'last_updated': datetime.now().isoformat()
+        }
+        with open(self.storage_path, 'w') as f:
+            json.dump(data, f, indent=2)
+    def load(self) -> None:
+        """Load graph from disk"""
+        if not self.storage_path.exists():
+            return
+        try:
+            with open(self.storage_path, 'r') as f:
+                data = json.load(f)
+            self.triplets = [
+                Triplet.from_dict(t) for t in data.get('triplets', [])
+            ]
+            logger.info(f"Loaded {len(self.triplets)} triplets from storage")
+        except Exception as e:
+            logger.error(f"Error loading temporal graph: {e}")
+class TemporalApplicationTracker:
+    """
+    Track job applications with full temporal history
+    Maintains versioned states and changes over time
+    """
+    def __init__(self):
+        self.graph = TemporalKnowledgeGraph("application_history.json")
+    def track_application(
+        self,
+        job: JobPosting,
+        status: str,
+        metadata: Optional[Dict] = None
+    ) -> None:
+        """Track a new application or status change"""
+        job_id = self._get_job_id(job)
+        now = datetime.now()
+        # Core application triplets
+        triplets = [
+            Triplet(job_id, "company", job.company, now),
+            Triplet(job_id, "position", job.title, now),
+            Triplet(job_id, "status", status, now),
+            Triplet(job_id, "applied_date", now.isoformat(), now),
+        ]
+        # Optional fields
+        if job.location:
+            triplets.append(Triplet(job_id, "location", job.location, now))
+        if job.salary:
+            triplets.append(Triplet(job_id, "salary", job.salary, now))
+        if job.url:
+            triplets.append(Triplet(job_id, "url", job.url, now))
+        # Add metadata as triplets
+        if metadata:
+            for key, value in metadata.items():
+                triplets.append(
+                    Triplet(job_id, f"meta_{key}", value, now, metadata={'source': 'metadata'})
+                )
+        # Add all triplets
+        for triplet in triplets:
+            self.graph.add_triplet(triplet)
+        logger.info(f"Tracked application for {job.company} - {job.title}")
+    def update_status(
+        self,
+        job_id: str,
+        new_status: str,
+        notes: Optional[str] = None
+    ) -> None:
+        """Update application status"""
+        now = datetime.now()
+        # Add new status triplet (old one auto-invalidated)
+        self.graph.add_triplet(
+            Triplet(job_id, "status", new_status, now)
+        )
+        # Add notes if provided
+        if notes:
+            self.graph.add_triplet(
+                Triplet(job_id, "status_notes", notes, now, metadata={'type': 'note'})
+            )
+        # Track status change event
+        self.graph.add_triplet(
+            Triplet(
+                job_id,
+                "status_changed",
+                f"Changed to {new_status}",
+                now,
+                metadata={'event_type': 'status_change'}
+            )
+        )
+    def add_interview(
+        self,
+        job_id: str,
+        interview_date: datetime,
+        interview_type: str,
+        notes: Optional[str] = None
+    ) -> None:
+        """Track interview scheduling"""
+        now = datetime.now()
+        self.graph.add_triplet(
+            Triplet(
+                job_id,
+                "interview_scheduled",
+                interview_date.isoformat(),
+                now,
+                metadata={'type': interview_type}
+            )
+        )
+        if notes:
+            self.graph.add_triplet(
+                Triplet(job_id, "interview_notes", notes, now)
+            )
+        # Auto-update status
+        self.update_status(job_id, "interview_scheduled")
+    def get_application_timeline(self, job_id: str) -> List[Dict]:
+        """Get complete timeline for an application"""
+        history = self.graph.get_history(job_id)
+        timeline = []
+        for triplet in history:
+            timeline.append({
+                'date': triplet.valid_at.isoformat(),
+                'event': f"{triplet.predicate}: {triplet.object}",
+                'expired': triplet.expired_at is not None
+            })
+        return timeline
+    def get_active_applications(self) -> List[Dict]:
+        """Get all currently active applications"""
+        # Find all unique job IDs
+        job_ids = set()
+        for triplet in self.graph.triplets:
+            if triplet.subject.startswith('JOB_'):
+                job_ids.add(triplet.subject)
+        active = []
+        for job_id in job_ids:
+            status = self.graph.find_current(job_id, "status")
+            if status and status.object not in ['rejected', 'withdrawn', 'archived']:
+                company = self.graph.find_current(job_id, "company")
+                position = self.graph.find_current(job_id, "position")
+                active.append({
+                    'job_id': job_id,
+                    'company': company.object if company else 'Unknown',
+                    'position': position.object if position else 'Unknown',
+                    'status': status.object,
+                    'last_updated': status.valid_at.isoformat()
+                })
+        return active
+    def analyze_patterns(self) -> Dict[str, Any]:
+        """Analyze application patterns over time"""
+        now = datetime.now()
+        # Applications per week
+        week_ago = now - timedelta(days=7)
+        month_ago = now - timedelta(days=30)
+        week_apps = self.graph.query_timerange(week_ago, now, "status")
+        month_apps = self.graph.query_timerange(month_ago, now, "status")
+        # Status distribution
+        status_counts = {}
+        for triplet in self.graph.triplets:
+            if triplet.predicate == "status" and triplet.expired_at is None:
+                status = triplet.object
+                status_counts[status] = status_counts.get(status, 0) + 1
+        # Response rate
+        total_apps = len([t for t in self.graph.triplets if t.predicate == "status" and t.object == "applied"])
+        responses = len([t for t in self.graph.triplets if t.predicate == "status" and t.object in ["interview_scheduled", "rejected", "offer"]])
+        response_rate = (responses / total_apps * 100) if total_apps > 0 else 0
+        return {
+            'applications_this_week': len(week_apps),
+            'applications_this_month': len(month_apps),
+            'status_distribution': status_counts,
+            'response_rate': f"{response_rate:.1f}%",
+            'total_applications': total_apps
+        }
+    def _get_job_id(self, job: JobPosting) -> str:
+        """Generate consistent job ID"""
+        if job.id:
+            return job.id
+        # Generate ID from company and title
+        key = f"{job.company}_{job.title}".lower().replace(' ', '_')
+        hash_val = hashlib.md5(key.encode()).hexdigest()[:8]
+        return f"JOB_{hash_val}"
+class TemporalInvalidationAgent:
+    """
+    Agent that checks for and invalidates outdated information
+    Based on the invalidation pattern from the article
+    """
+    def __init__(self, graph: TemporalKnowledgeGraph):
+        self.graph = graph
+    def check_contradictions(
+        self,
+        new_triplet: Triplet,
+        threshold: float = 0.8
+    ) -> Optional[Triplet]:
+        """Check if new triplet contradicts existing ones"""
+        # Find existing triplets with same subject-predicate
+        existing = self.graph.find_current(
+            new_triplet.subject,
+            new_triplet.predicate
+        )
+        if not existing:
+            return None
+        # Check for contradiction
+        if existing.object != new_triplet.object:
+            # Calculate confidence in contradiction
+            time_diff = (new_triplet.valid_at - existing.valid_at).total_seconds()
+            # More recent info is more likely to be correct
+            if time_diff > 0:  # New triplet is more recent
+                confidence = min(1.0, time_diff / (24 * 3600))  # Max confidence after 1 day
+                if confidence > threshold:
+                    return existing  # Return triplet to invalidate
+        return None
+    def cleanup_expired(self, days_old: int = 90) -> int:
+        """Archive triplets older than specified days"""
+        cutoff = datetime.now() - timedelta(days=days_old)
+        archived = 0
+        for triplet in self.graph.triplets:
+            if triplet.expired_at and triplet.expired_at < cutoff:
+                # Move to archive (in real implementation)
+                triplet.metadata['archived'] = True
+                archived += 1
+        if archived > 0:
+            self.graph.save()
+            logger.info(f"Archived {archived} expired triplets")
+        return archived
+# Usage example
+def demo_temporal_tracking():
+    """Demonstrate temporal tracking"""
+    tracker = TemporalApplicationTracker()
+    # Create sample job
+    job = JobPosting(
+        id="JOB_001",
+        title="Senior Software Engineer",
+        company="TechCorp",
+        location="San Francisco",
+        salary="$150k-$200k",
+        url="https://techcorp.com/jobs/123"
+    )
+    # Track initial application
+    tracker.track_application(job, "applied", {
+        'cover_letter_version': 'v1',
+        'resume_version': 'v2'
+    })
+    # Simulate status updates over time
+    import time
+    time.sleep(1)
+    tracker.update_status("JOB_001", "screening", "Passed initial ATS scan")
+    time.sleep(1)
+    tracker.add_interview(
+        "JOB_001",
+        datetime.now() + timedelta(days=7),
+        "phone_screen",
+        "30 min call with hiring manager"
+    )
+    # Get timeline
+    timeline = tracker.get_application_timeline("JOB_001")
+    print("Application Timeline:")
+    for event in timeline:
+        print(f"  {event['date']}: {event['event']}")
+    # Get active applications
+    active = tracker.get_active_applications()
+    print(f"\nActive Applications: {len(active)}")
+    # Analyze patterns
+    patterns = tracker.analyze_patterns()
+    print(f"\nPatterns: {patterns}")
+if __name__ == "__main__":
+    demo_temporal_tracking()

app.py ADDED Viewed

	@@ -0,0 +1,65 @@

+#!/usr/bin/env python3
+"""
+Multi-Agent Job Application Assistant - HuggingFace Spaces Deployment
+Production-ready system with Gemini 2.5 Flash, A2A Protocol, and MCP Integration
+Features: Resume/Cover Letter Generation, Job Matching, Document Export, Advanced AI Agents
+"""
+# Use the hf_app.py as the main app for HuggingFace Spaces
+from hf_app import *
+if __name__ == "__main__":
+    # Configure for HuggingFace Spaces deployment
+    import os
+    # Set up HF-specific configurations
+    os.environ.setdefault("GRADIO_SERVER_NAME", "0.0.0.0")
+    os.environ.setdefault("GRADIO_SERVER_PORT", str(os.getenv("PORT", "7860")))
+    print("🚀 Starting Multi-Agent Job Application Assistant on HuggingFace Spaces")
+    print("=" * 70)
+    print("Features:")
+    print("✅ Gemini 2.5 Flash AI Generation")
+    print("✅ Advanced Multi-Agent System (A2A Protocol)")
+    print("✅ Resume & Cover Letter Generation")
+    print("✅ Job Matching & Research")
+    print("✅ Document Export (Word/PowerPoint/Excel)")
+    print("✅ MCP Server Integration")
+    print("=" * 70)
+    try:
+        app = build_app()
+        app.launch(
+            server_name="0.0.0.0",
+            server_port=int(os.getenv("PORT", 7860)),
+            share=False,
+            show_error=True,
+            mcp_server=True  # Enable MCP server for HuggingFace Spaces
+        )
+    except Exception as e:
+        print(f"❌ Startup Error: {e}")
+        print("\n🔧 Troubleshooting:")
+        print("1. Check environment variables in Space settings")
+        print("2. Verify all dependencies in requirements.txt")
+        print("3. Check logs for detailed error information")
+        # Fallback: Simple demo interface
+        print("\n🔄 Starting simplified interface...")
+        import gradio as gr
+        def simple_demo():
+            return "Multi-Agent Job Application Assistant is initializing. Please check back in a moment."
+        demo = gr.Interface(
+            fn=simple_demo,
+            inputs=gr.Textbox(label="Status Check"),
+            outputs=gr.Textbox(label="System Status"),
+            title="🚀 Job Application Assistant",
+            description="Production-ready multi-agent system for job applications"
+        )
+        demo.launch(
+            server_name="0.0.0.0",
+            server_port=int(os.getenv("PORT", 7860)),
+            share=False
+        )

hf_app.py ADDED Viewed

	@@ -0,0 +1,1613 @@

+#!/usr/bin/env python3
+"""
+Multi-Agent Job Application Assistant - HuggingFace Spaces Deployment
+Production-ready system with Gemini 2.5 Flash, A2A Protocol, and MCP Integration
+Features: Resume/Cover Letter Generation, Job Matching, Document Export, Advanced AI Agents
+"""
+import os
+import uuid
+import time
+import logging
+import asyncio
+from typing import List, Optional, Dict, Any
+from dataclasses import dataclass, field
+import webbrowser
+from datetime import datetime, timedelta
+import json
+from pathlib import Path
+import gradio as gr
+from dotenv import load_dotenv
+import nest_asyncio
+# Apply nest_asyncio for async support in Gradio
+try:
+    nest_asyncio.apply()
+except:
+    pass
+# Load environment variables
+load_dotenv(override=True)
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+# =======================
+# Try to import from system, fall back to standalone mode if not available
+# =======================
+USE_SYSTEM_AGENTS = True
+ADVANCED_FEATURES = False
+LANGEXTRACT_AVAILABLE = False
+try:
+    from agents.orchestrator import OrchestratorAgent
+    from models.schemas import JobPosting, OrchestrationResult
+    logger.info("System agents loaded - full functionality available")
+    # Try to import LangExtract service
+    try:
+        from services.langextract_service import (
+            extract_job_info,
+            extract_ats_keywords,
+            optimize_for_ats,
+            create_extraction_summary,
+            create_ats_report
+        )
+        LANGEXTRACT_AVAILABLE = True
+        logger.info("📊 LangExtract service loaded for enhanced extraction")
+    except ImportError:
+        LANGEXTRACT_AVAILABLE = False
+    # Try to import advanced AI agent features
+    try:
+        from agents.parallel_executor import ParallelAgentExecutor, ParallelJobProcessor, MetaAgent
+        from agents.temporal_tracker import TemporalApplicationTracker, TemporalKnowledgeGraph
+        from agents.observability import AgentTracer, AgentMonitor, TriageAgent, global_tracer
+        from agents.context_engineer import ContextEngineer, DataFlywheel
+        from agents.context_scaler import ContextScalingOrchestrator
+        ADVANCED_FEATURES = True
+        logger.info("✨ Advanced AI agent features loaded successfully!")
+    except ImportError as e:
+        logger.info(f"Advanced features not available: {e}")
+    # Try to import knowledge graph service
+    try:
+        from services.knowledge_graph_service import get_knowledge_graph_service
+        kg_service = get_knowledge_graph_service()
+        KG_AVAILABLE = kg_service.is_enabled()
+        if KG_AVAILABLE:
+            logger.info("📊 Knowledge Graph service initialized - tracking enabled")
+    except ImportError:
+        KG_AVAILABLE = False
+        kg_service = None
+        logger.info("Knowledge graph service not available")
+    USE_SYSTEM_AGENTS = True
+except ImportError:
+    logger.info("Running in standalone mode - using simplified agents")
+    USE_SYSTEM_AGENTS = False
+    # Define minimal data structures for standalone operation
+    @dataclass
+    class JobPosting:
+        id: str
+        title: str
+        company: str
+        description: str
+        location: Optional[str] = None
+        url: Optional[str] = None
+        source: Optional[str] = None
+        saved_by_user: bool = False
+    @dataclass
+    class ResumeDraft:
+        job_id: str
+        text: str
+        keywords_used: List[str] = field(default_factory=list)
+    @dataclass
+    class CoverLetterDraft:
+        job_id: str
+        text: str
+        keywords_used: List[str] = field(default_factory=list)
+    @dataclass
+    class OrchestrationResult:
+        job: JobPosting
+        resume: ResumeDraft
+        cover_letter: CoverLetterDraft
+        metrics: Optional[Dict[str, Any]] = None
+    # Simplified orchestrator for standalone operation
+    class OrchestratorAgent:
+        def __init__(self):
+            self.mock_jobs = [
+                JobPosting(
+                    id="example_1",
+                    title="Senior Software Engineer",
+                    company="Tech Corp",
+                    location="Remote",
+                    description="We need a Senior Software Engineer with Python, AWS, Docker experience.",
+                    saved_by_user=True
+                )
+            ]
+        def get_saved_jobs(self):
+            return self.mock_jobs
+        def run_for_jobs(self, jobs, **kwargs):
+            results = []
+            for job in jobs:
+                resume = ResumeDraft(
+                    job_id=job.id,
+                    text=f"Professional Resume for {job.title}\n\nExperienced professional with skills matching {job.company} requirements.",
+                    keywords_used=["Python", "AWS", "Docker"]
+                )
+                cover = CoverLetterDraft(
+                    job_id=job.id,
+                    text=f"Dear Hiring Manager,\n\nI am excited to apply for the {job.title} position at {job.company}.",
+                    keywords_used=["leadership", "innovation"]
+                )
+                results.append(OrchestrationResult(
+                    job=job,
+                    resume=resume,
+                    cover_letter=cover,
+                    metrics={
+                        "salary": {"USD": {"low": 100000, "high": 150000}},
+                        "p_resume": 0.75,
+                        "p_cover": 0.80,
+                        "overall_p": 0.60
+                    }
+                ))
+            return results
+        def regenerate_for_job(self, job, **kwargs):
+            return self.run_for_jobs([job], **kwargs)[0]
+# Initialize orchestrator and advanced features
+try:
+    orch = OrchestratorAgent()
+    logger.info("Orchestrator initialized successfully")
+    # Initialize advanced features if available
+    if ADVANCED_FEATURES:
+        # Initialize parallel executor
+        parallel_executor = ParallelAgentExecutor(max_workers=4)
+        parallel_processor = ParallelJobProcessor()
+        meta_agent = MetaAgent()
+        # Initialize temporal tracker
+        temporal_tracker = TemporalApplicationTracker()
+        # Initialize observability
+        agent_tracer = AgentTracer()
+        agent_monitor = AgentMonitor()
+        triage_agent = TriageAgent(agent_tracer)
+        # Initialize context engineering
+        context_engineer = ContextEngineer()
+        context_scaler = ContextScalingOrchestrator()
+        logger.info("✅ All advanced AI agent features initialized")
+    else:
+        parallel_executor = None
+        temporal_tracker = None
+        agent_tracer = None
+        context_engineer = None
+except Exception as e:
+    logger.error(f"Failed to initialize orchestrator: {e}")
+    raise
+# Session state
+STATE = {
+    "user_id": "default_user",
+    "cv_seed": None,
+    "cover_seed": None,
+    "agent2_notes": "",
+    "custom_jobs": [],
+    "cv_chat": "",
+    "cover_chat": "",
+    "results": [],
+    "inspiration_url": "https://www.careeraddict.com/7-funniest-cover-letters",
+    "use_inspiration": False,
+    "linkedin_authenticated": False,
+    "linkedin_profile": None,
+    "parallel_mode": False,
+    "track_applications": True,
+    "enable_observability": True,
+    "use_context_engineering": True,
+    "execution_timeline": None,
+    "application_history": [],
+}
+# Check LinkedIn OAuth configuration
+LINKEDIN_CLIENT_ID = os.getenv("LINKEDIN_CLIENT_ID")
+LINKEDIN_CLIENT_SECRET = os.getenv("LINKEDIN_CLIENT_SECRET")
+MOCK_MODE = os.getenv("MOCK_MODE", "true").lower() == "true"
+# Check Adzuna configuration
+ADZUNA_APP_ID = os.getenv("ADZUNA_APP_ID")
+ADZUNA_APP_KEY = os.getenv("ADZUNA_APP_KEY")
+def add_custom_job(title: str, company: str, location: str, url: str, desc: str):
+    """Add a custom job with validation"""
+    try:
+        if not title or not company or not desc:
+            return gr.update(value="❌ Title, Company, and Description are required"), None
+        job = JobPosting(
+            id=f"custom_{uuid.uuid4().hex[:8]}",
+            title=title.strip(),
+            company=company.strip(),
+            location=location.strip() if location else None,
+            description=desc.strip(),
+            url=url.strip() if url else None,
+            source="custom",
+            saved_by_user=True,
+        )
+        STATE["custom_jobs"].append(job)
+        logger.info(f"Added custom job: {job.title} at {job.company}")
+        return gr.update(value=f"✅ Added: {job.title} at {job.company}"), ""
+    except Exception as e:
+        logger.error(f"Error adding job: {e}")
+        return gr.update(value=f"❌ Error: {str(e)}"), None
+def get_linkedin_auth_url():
+    """Get LinkedIn OAuth URL"""
+    if USE_SYSTEM_AGENTS and not MOCK_MODE and LINKEDIN_CLIENT_ID:
+        try:
+            from services.linkedin_client import LinkedInClient
+            client = LinkedInClient()
+            return client.get_authorize_url()
+        except Exception as e:
+            logger.error(f"LinkedIn OAuth error: {e}")
+    return None
+def linkedin_login():
+    """Handle LinkedIn login"""
+    auth_url = get_linkedin_auth_url()
+    if auth_url:
+        webbrowser.open(auth_url)
+        return "✅ Opening LinkedIn login in browser...", True
+    else:
+        return "⚠️ LinkedIn OAuth not configured or in mock mode", False
+def search_adzuna_jobs(query: str = "Software Engineer", location: str = "London"):
+    """Search jobs using Adzuna API"""
+    if ADZUNA_APP_ID and ADZUNA_APP_KEY:
+        try:
+            from services.job_aggregator import JobAggregator
+            aggregator = JobAggregator()
+            # Handle SSL issues for corporate networks
+            import requests
+            import urllib3
+            old_get = requests.get
+            def patched_get(*args, **kwargs):
+                if 'adzuna' in str(args[0]):
+                    kwargs['verify'] = False
+                    urllib3.disable_warnings()
+                return old_get(*args, **kwargs)
+            requests.get = patched_get
+            jobs = aggregator.search_adzuna(query, location)
+            return jobs, f"✅ Found {len(jobs)} jobs from Adzuna"
+        except Exception as e:
+            logger.error(f"Adzuna search error: {e}")
+            return [], f"❌ Adzuna search failed: {str(e)}"
+    return [], "⚠️ Adzuna API not configured"
+def list_jobs_options():
+    """Get list of available jobs with enhanced sources"""
+    try:
+        all_jobs = []
+        # Get LinkedIn/mock jobs
+        saved_jobs = orch.get_saved_jobs()
+        all_jobs.extend(saved_jobs)
+        # Add custom jobs
+        custom_jobs = STATE.get("custom_jobs", [])
+        all_jobs.extend(custom_jobs)
+        # Try to add Adzuna jobs if configured
+        if ADZUNA_APP_ID and ADZUNA_APP_KEY:
+            adzuna_jobs, _ = search_adzuna_jobs("Software Engineer", "Remote")
+            all_jobs.extend(adzuna_jobs[:10])  # Add top 10 Adzuna jobs
+        labels = [f"{j.title} — {j.company} ({j.location or 'N/A'}) [{j.source or 'custom'}]" for j in all_jobs]
+        return labels
+    except Exception as e:
+        logger.error(f"Error listing jobs: {e}")
+        return []
+def generate(selected_labels: List[str]):
+    """Generate documents with advanced AI features"""
+    try:
+        if not selected_labels:
+            return "⚠️ Please select at least one job to process", None, gr.update(visible=False), gr.update(visible=False), gr.update(visible=False)
+        # Triage the request if observability is enabled
+        if ADVANCED_FEATURES and STATE.get("enable_observability") and agent_tracer:
+            routing = triage_agent.triage_request(f"Generate documents for {len(selected_labels)} jobs")
+            logger.info(f"Triage routing: {routing}")
+        # Map labels to job objects
+        all_jobs = orch.get_saved_jobs() + STATE.get("custom_jobs", [])
+        # Update label mapping to handle source tags
+        label_to_job = {}
+        for j in all_jobs:
+            label = f"{j.title} — {j.company} ({j.location or 'N/A'})"
+            label_with_source = f"{label} [{j.source or 'custom'}]"
+            # Map both versions
+            label_to_job[label] = j
+            label_to_job[label_with_source] = j
+        jobs = [label_to_job[l] for l in selected_labels if l in label_to_job]
+        if not jobs:
+            return "❌ No valid jobs found", None, None
+        logger.info(f"Generating documents for {len(jobs)} jobs")
+        # Use context engineering if enabled
+        if ADVANCED_FEATURES and STATE.get("use_context_engineering") and context_engineer:
+            for job in jobs:
+                # Engineer optimal context for each job
+                context = context_engineer.engineer_context(
+                    query=f"Generate resume and cover letter for {job.title} at {job.company}",
+                    raw_sources=[
+                        ("job_description", job.description),
+                        ("cv_seed", STATE.get("cv_seed") or ""),
+                        ("notes", STATE.get("agent2_notes") or "")
+                    ]
+                )
+                # Store engineered context
+                job.metadata = job.metadata or {}
+                job.metadata['engineered_context'] = context
+        # Run generation (parallel or sequential)
+        start = time.time()
+        if ADVANCED_FEATURES and STATE.get("parallel_mode") and parallel_executor:
+            # Use parallel processing
+            logger.info("Using parallel processing for document generation")
+            results = asyncio.run(parallel_processor.process_jobs_parallel(
+                jobs=jobs,
+                cv_agent_func=lambda j: orch.cv_agent.get_draft(j, STATE.get("cv_seed")),
+                cover_agent_func=lambda j: orch.cover_letter_agent.get_draft(j, STATE.get("cover_seed"))
+            ))
+        else:
+            # Standard sequential processing
+            results = orch.run_for_jobs(
+                jobs,
+                user_id=STATE.get("user_id", "default_user"),
+                cv_chat=STATE.get("cv_chat"),
+                cover_chat=STATE.get("cover_chat"),
+                cv_seed=STATE.get("cv_seed"),
+                cover_seed=STATE.get("cover_seed"),
+                agent2_notes=STATE.get("agent2_notes"),
+                inspiration_url=(STATE.get("inspiration_url") if STATE.get("use_inspiration") else None),
+            )
+        total_time = time.time() - start
+        STATE["results"] = results
+        # Track applications temporally if enabled
+        if ADVANCED_FEATURES and STATE.get("track_applications") and temporal_tracker:
+            for result in results:
+                temporal_tracker.track_application(result.job, "generated", {
+                    'generation_time': total_time,
+                    'parallel_mode': STATE.get("parallel_mode", False)
+                })
+        # Track in knowledge graph if available
+        if 'kg_service' in globals() and kg_service and kg_service.is_enabled():
+            for result in results:
+                try:
+                    # Extract skills from job description
+                    skills = []
+                    if hasattr(result, 'matched_keywords'):
+                        skills = result.matched_keywords
+                    elif hasattr(result.job, 'description'):
+                        # Simple skill extraction from job description
+                        common_skills = ['python', 'java', 'javascript', 'react', 'node',
+                                       'aws', 'azure', 'docker', 'kubernetes', 'sql',
+                                       'machine learning', 'ai', 'data science']
+                        job_desc_lower = result.job.description.lower()
+                        skills = [s for s in common_skills if s in job_desc_lower]
+                    # Track the application
+                    kg_service.track_application(
+                        user_name=STATE.get("user_name", "User"),
+                        company=result.job.company,
+                        job_title=result.job.title,
+                        job_description=result.job.description,
+                        cv_text=result.resume.text,
+                        cover_letter=result.cover_letter.text,
+                        skills_matched=skills,
+                        score=getattr(result, 'match_score', 0.0)
+                    )
+                    logger.info(f"Tracked application in knowledge graph: {result.job.title} @ {result.job.company}")
+                except Exception as e:
+                    logger.warning(f"Failed to track in knowledge graph: {e}")
+        # Record to context engineering flywheel
+        if ADVANCED_FEATURES and context_engineer:
+            for result in results:
+                if hasattr(result.job, 'metadata') and 'engineered_context' in result.job.metadata:
+                    context_engineer.record_feedback(
+                        result.job.metadata['engineered_context'],
+                        result.resume.text[:500],  # Sample output
+                        0.8  # Success score (could be calculated)
+                    )
+        # Build preview
+        blocks = [f"✅ Generated {len(results)} documents in {total_time:.2f}s\n"]
+        pptx_buttons = []
+        for i, res in enumerate(results):
+            blocks.append(f"### 📄 {res.job.title} — {res.job.company}")
+            blocks.append("**Resume Preview:**")
+            blocks.append("```")
+            blocks.append(res.resume.text[:1500] + "...")
+            blocks.append("```")
+            blocks.append("\n**Cover Letter Preview:**")
+            blocks.append("```")
+            blocks.append(res.cover_letter.text[:1000] + "...")
+            blocks.append("```")
+            # Add PowerPoint export option
+            blocks.append(f"\n**[📊 Export as PowerPoint CV - Job #{i+1}]**")
+            pptx_buttons.append((res.resume, res.job))
+        STATE["pptx_candidates"] = pptx_buttons
+        return "\n".join(blocks), total_time, gr.update(visible=True), gr.update(visible=True), gr.update(visible=True)
+    except Exception as e:
+        logger.error(f"Error generating documents: {e}")
+        return f"❌ Error: {str(e)}", None, gr.update(visible=False), gr.update(visible=False), gr.update(visible=False)
+def regenerate_one(job_label: str):
+    """Regenerate documents for a single job"""
+    try:
+        if not job_label:
+            return "⚠️ Please select a job to regenerate", None
+        all_jobs = orch.get_saved_jobs() + STATE.get("custom_jobs", [])
+        label_to_job = {f"{j.title} — {j.company} ({j.location or 'N/A'})": j for j in all_jobs}
+        job = label_to_job.get(job_label)
+        if not job:
+            return f"❌ Job not found: {job_label}", None
+        start = time.time()
+        result = orch.regenerate_for_job(
+            job,
+            user_id=STATE.get("user_id", "default_user"),
+            cv_chat=STATE.get("cv_chat"),
+            cover_chat=STATE.get("cover_chat"),
+            cv_seed=STATE.get("cv_seed"),
+            cover_seed=STATE.get("cover_seed"),
+            agent2_notes=STATE.get("agent2_notes"),
+            inspiration_url=(STATE.get("inspiration_url") if STATE.get("use_inspiration") else None),
+        )
+        elapsed = time.time() - start
+        # Update state
+        new_results = []
+        for r in STATE.get("results", []):
+            if r.job.id == job.id:
+                new_results.append(result)
+            else:
+                new_results.append(r)
+        STATE["results"] = new_results
+        preview = f"### 🔄 Regenerated: {result.job.title} — {result.job.company}\n\n"
+        preview += "**Resume:**\n```\n" + result.resume.text[:1500] + "\n...```\n\n"
+        preview += "**Cover Letter:**\n```\n" + result.cover_letter.text[:1000] + "\n...```"
+        return preview, elapsed
+    except Exception as e:
+        logger.error(f"Error regenerating: {e}")
+        return f"❌ Error: {str(e)}", None
+def export_to_powerpoint(job_index: int, template: str = "modern_blue"):
+    """Export resume to PowerPoint CV"""
+    try:
+        candidates = STATE.get("pptx_candidates", [])
+        if not candidates or job_index >= len(candidates):
+            return "❌ No resume available for export", None
+        resume, job = candidates[job_index]
+        # Import the PowerPoint CV generator
+        try:
+            from services.powerpoint_cv import convert_resume_to_powerpoint
+            pptx_path = convert_resume_to_powerpoint(resume, job, template)
+            if pptx_path:
+                return f"✅ PowerPoint CV created: {pptx_path}", pptx_path
+        except ImportError:
+            # Fallback to local generation
+            from pptx import Presentation
+            from pptx.util import Inches, Pt
+            prs = Presentation()
+            # Title slide
+            slide = prs.slides.add_slide(prs.slide_layouts[0])
+            slide.shapes.title.text = resume.sections.get("name", "Professional CV")
+            slide.placeholders[1].text = f"{resume.sections.get('title', '')}\n{resume.sections.get('email', '')}"
+            # Summary slide
+            slide = prs.slides.add_slide(prs.slide_layouts[1])
+            slide.shapes.title.text = "Professional Summary"
+            slide.placeholders[1].text = resume.sections.get("summary", "")[:500]
+            # Experience slide
+            slide = prs.slides.add_slide(prs.slide_layouts[1])
+            slide.shapes.title.text = "Professional Experience"
+            exp_text = []
+            for exp in resume.sections.get("experience", [])[:3]:
+                exp_text.append(f"• {exp.get('title', '')} @ {exp.get('company', '')}")
+                exp_text.append(f"  {exp.get('dates', '')}")
+            slide.placeholders[1].text = "\n".join(exp_text)
+            # Skills slide
+            slide = prs.slides.add_slide(prs.slide_layouts[1])
+            slide.shapes.title.text = "Core Skills"
+            skills_text = []
+            for category, items in resume.sections.get("skills", {}).items():
+                if isinstance(items, list):
+                    skills_text.append(f"{category}: {', '.join(items[:5])}")
+            slide.placeholders[1].text = "\n".join(skills_text)
+            # Save
+            output_path = f"cv_{job.company.replace(' ', '_')}_{template}.pptx"
+            prs.save(output_path)
+            return f"✅ PowerPoint CV created: {output_path}", output_path
+    except Exception as e:
+        logger.error(f"PowerPoint export error: {e}")
+        return f"❌ Export failed: {str(e)}", None
+def extract_from_powerpoint(file_path: str):
+    """Extract content from uploaded PowerPoint"""
+    try:
+        from pptx import Presentation
+        prs = Presentation(file_path)
+        extracted_text = []
+        for slide in prs.slides:
+            for shape in slide.shapes:
+                if hasattr(shape, "text"):
+                    text = shape.text.strip()
+                    if text:
+                        extracted_text.append(text)
+        combined_text = "\n".join(extracted_text)
+        # Use as CV seed
+        STATE["cv_seed"] = combined_text
+        return f"✅ Extracted {len(extracted_text)} text blocks from PowerPoint\n\nPreview:\n{combined_text[:500]}..."
+    except Exception as e:
+        logger.error(f"PowerPoint extraction error: {e}")
+        return f"❌ Extraction failed: {str(e)}"
+def summary_table():
+    """Generate summary table"""
+    try:
+        import pandas as pd
+        res = STATE.get("results", [])
+        if not res:
+            return pd.DataFrame({"Status": ["No results yet. Generate documents first."]})
+        rows = []
+        for r in res:
+            m = r.metrics or {}
+            sal = m.get("salary", {})
+            # Handle different salary formats
+            usd = sal.get("USD", {})
+            gbp = sal.get("GBP", {})
+            rows.append({
+                "Job": f"{r.job.title} — {r.job.company}",
+                "Location": r.job.location or "N/A",
+                "USD": f"${usd.get('low', 0):,}-${usd.get('high', 0):,}" if usd else "N/A",
+                "GBP": f"£{gbp.get('low', 0):,}-£{gbp.get('high', 0):,}" if gbp else "N/A",
+                "Resume Score": f"{m.get('p_resume', 0):.1%}",
+                "Cover Score": f"{m.get('p_cover', 0):.1%}",
+                "Overall": f"{m.get('overall_p', 0):.1%}",
+            })
+        return pd.DataFrame(rows)
+    except ImportError:
+        # If pandas not available, return simple dict
+        return {"Error": ["pandas not installed - table view unavailable"]}
+    except Exception as e:
+        logger.error(f"Error generating summary: {e}")
+        return {"Error": [str(e)]}
+def build_app():
+    """Build the Gradio interface with LinkedIn OAuth and Adzuna integration"""
+    with gr.Blocks(
+        title="Job Application Assistant",
+        theme=gr.themes.Soft(),
+        css="""
+        .gradio-container { max-width: 1400px; margin: auto; }
+        """
+    ) as demo:
+        gr.Markdown("""
+        # 🚀 Multi-Agent Job Application Assistant
+        ### AI-Powered Resume & Cover Letter Generation with ATS Optimization
+        ### Now with LinkedIn OAuth + Adzuna Job Search!
+        """)
+        # System Status
+        status_items = []
+        if USE_SYSTEM_AGENTS:
+            status_items.append("✅ **Full System Mode**")
+        else:
+            status_items.append("⚠️ **Standalone Mode**")
+        if ADVANCED_FEATURES:
+            status_items.append("🚀 **Advanced AI Features**")
+        if LANGEXTRACT_AVAILABLE:
+            status_items.append("📊 **LangExtract Enhanced**")
+        if not MOCK_MODE and LINKEDIN_CLIENT_ID:
+            status_items.append("✅ **LinkedIn OAuth Ready**")
+        else:
+            status_items.append("⚠️ **LinkedIn in Mock Mode**")
+        if ADZUNA_APP_ID and ADZUNA_APP_KEY:
+            status_items.append("✅ **Adzuna API Active** (5000 jobs/month)")
+        else:
+            status_items.append("⚠️ **Adzuna Not Configured**")
+        gr.Markdown(" | ".join(status_items))
+        # Show advanced features if available
+        if ADVANCED_FEATURES:
+            advanced_features = []
+            if 'parallel_executor' in locals():
+                advanced_features.append("⚡ Parallel Processing")
+            if 'temporal_tracker' in locals():
+                advanced_features.append("📊 Temporal Tracking")
+            if 'agent_tracer' in locals():
+                advanced_features.append("🔍 Observability")
+            if 'context_engineer' in locals():
+                advanced_features.append("🧠 Context Engineering")
+            if advanced_features:
+                gr.Markdown(f"**Advanced Features Available:** {' | '.join(advanced_features)}")
+        # Import enhanced UI components
+        try:
+            from services.enhanced_ui import (
+                create_enhanced_ui_components,
+                handle_resume_upload,
+                handle_linkedin_import,
+                handle_job_matching,
+                handle_document_export,
+                populate_ui_from_data,
+                format_job_matches_for_display,
+                generate_recommendations_markdown,
+                generate_skills_gap_analysis
+            )
+            ENHANCED_UI_AVAILABLE = True
+        except ImportError:
+            ENHANCED_UI_AVAILABLE = False
+            logger.warning("Enhanced UI components not available")
+        with gr.Row():
+            # Left column - Configuration
+            with gr.Column(scale=2):
+                gr.Markdown("## ⚙️ Configuration")
+                # Enhanced Resume Upload Section (if available)
+                if ENHANCED_UI_AVAILABLE:
+                    ui_components = create_enhanced_ui_components()
+                    # Create a wrapper function that properly handles the response
+                    def process_resume_and_populate(file_path):
+                        """Process resume upload and return extracted data for UI fields"""
+                        if not file_path:
+                            return populate_ui_from_data({})
+                        try:
+                            # Call handle_resume_upload to extract data
+                            response = handle_resume_upload(file_path)
+                            # Extract the data from the response
+                            if response and isinstance(response, dict):
+                                data = response.get('data', {})
+                                # Return the populated fields
+                                return populate_ui_from_data(data)
+                            else:
+                                return populate_ui_from_data({})
+                        except Exception as e:
+                            logger.error(f"Error processing resume: {e}")
+                            return populate_ui_from_data({})
+                    # Wire up the handlers - single function call
+                    ui_components['extract_btn'].click(
+                        fn=process_resume_and_populate,
+                        inputs=[ui_components['resume_upload']],
+                        outputs=[
+                            ui_components['contact_name'],
+                            ui_components['contact_email'],
+                            ui_components['contact_phone'],
+                            ui_components['contact_linkedin'],
+                            ui_components['contact_location'],
+                            ui_components['summary_text'],
+                            ui_components['experience_data'],
+                            ui_components['skills_list'],
+                            ui_components['education_data']
+                        ]
+                    )
+                    ui_components['linkedin_auto_fill'].click(
+                        fn=handle_linkedin_import,
+                        inputs=[ui_components['linkedin_url'], gr.State()],
+                        outputs=[gr.State()]
+                    ).then(
+                        fn=populate_ui_from_data,
+                        inputs=[gr.State()],
+                        outputs=[
+                            ui_components['contact_name'],
+                            ui_components['contact_email'],
+                            ui_components['contact_phone'],
+                            ui_components['contact_linkedin'],
+                            ui_components['contact_location'],
+                            ui_components['summary_text'],
+                            ui_components['experience_data'],
+                            ui_components['skills_list'],
+                            ui_components['education_data']
+                        ]
+                    )
+                # LinkedIn OAuth Section (keep existing)
+                elif not MOCK_MODE and LINKEDIN_CLIENT_ID:
+                    with gr.Accordion("🔐 LinkedIn Authentication", open=True):
+                        linkedin_status = gr.Textbox(
+                            label="Status",
+                            value="Not authenticated",
+                            interactive=False
+                        )
+                        linkedin_btn = gr.Button("🔗 Sign in with LinkedIn", variant="primary")
+                        linkedin_btn.click(
+                            fn=linkedin_login,
+                            outputs=[linkedin_status, gr.State()]
+                        )
+                # Advanced AI Features Section
+                if ADVANCED_FEATURES:
+                    with gr.Accordion("🚀 Advanced AI Features", open=True):
+                        gr.Markdown("### AI Agent Enhancements")
+                        with gr.Row():
+                            parallel_mode = gr.Checkbox(
+                                label="⚡ Parallel Processing (3-5x faster)",
+                                value=STATE.get("parallel_mode", False)
+                            )
+                            track_apps = gr.Checkbox(
+                                label="📊 Temporal Tracking",
+                                value=STATE.get("track_applications", True)
+                            )
+                        with gr.Row():
+                            observability = gr.Checkbox(
+                                label="🔍 Observability & Tracing",
+                                value=STATE.get("enable_observability", True)
+                            )
+                            context_eng = gr.Checkbox(
+                                label="🧠 Context Engineering",
+                                value=STATE.get("use_context_engineering", True)
+                            )
+                        def update_features(parallel, track, observe, context):
+                            STATE["parallel_mode"] = parallel
+                            STATE["track_applications"] = track
+                            STATE["enable_observability"] = observe
+                            STATE["use_context_engineering"] = context
+                            features = []
+                            if parallel: features.append("Parallel")
+                            if track: features.append("Tracking")
+                            if observe: features.append("Observability")
+                            if context: features.append("Context Engineering")
+                            return f"✅ Features enabled: {', '.join(features) if features else 'None'}"
+                        features_status = gr.Textbox(label="Features Status", interactive=False)
+                        parallel_mode.change(
+                            fn=lambda p: update_features(p, track_apps.value, observability.value, context_eng.value),
+                            inputs=[parallel_mode],
+                            outputs=features_status
+                        )
+                        track_apps.change(
+                            fn=lambda t: update_features(parallel_mode.value, t, observability.value, context_eng.value),
+                            inputs=[track_apps],
+                            outputs=features_status
+                        )
+                        observability.change(
+                            fn=lambda o: update_features(parallel_mode.value, track_apps.value, o, context_eng.value),
+                            inputs=[observability],
+                            outputs=features_status
+                        )
+                        context_eng.change(
+                            fn=lambda c: update_features(parallel_mode.value, track_apps.value, observability.value, c),
+                            inputs=[context_eng],
+                            outputs=features_status
+                        )
+                with gr.Accordion("📝 Profile & Notes", open=True):
+                    agent2_notes = gr.Textbox(
+                        label="Additional Context",
+                        value=STATE["agent2_notes"],
+                        lines=4,
+                        placeholder="E.g., visa requirements, years of experience, preferred technologies..."
+                    )
+                    def set_notes(n):
+                        STATE["agent2_notes"] = n or ""
+                        return "✅ Notes saved"
+                    notes_result = gr.Textbox(label="Status", interactive=False)
+                    agent2_notes.change(set_notes, inputs=agent2_notes, outputs=notes_result)
+                with gr.Accordion("📄 Resume Settings", open=False):
+                    cv_chat = gr.Textbox(
+                        label="Resume Instructions",
+                        value=STATE["cv_chat"],
+                        lines=3,
+                        placeholder="E.g., Emphasize leadership experience..."
+                    )
+                    # PowerPoint Upload
+                    gr.Markdown("### 📊 Upload PowerPoint to Extract Content")
+                    pptx_upload = gr.File(
+                        label="Upload PowerPoint (.pptx)",
+                        file_types=[".pptx"],
+                        type="filepath"
+                    )
+                    pptx_extract_btn = gr.Button("📥 Extract from PowerPoint")
+                    pptx_extract_status = gr.Textbox(label="Extraction Status", interactive=False)
+                    cv_seed = gr.Textbox(
+                        label="Resume Template (optional)",
+                        value=STATE["cv_seed"] or "",
+                        lines=10,
+                        placeholder="Paste your existing resume here or extract from PowerPoint..."
+                    )
+                    def set_cv(c, s):
+                        STATE["cv_chat"] = c or ""
+                        STATE["cv_seed"] = s or None
+                        return "✅ Resume settings updated"
+                    def handle_pptx_upload(file):
+                        if file:
+                            status = extract_from_powerpoint(file)
+                            return status, STATE.get("cv_seed", "")
+                        return "No file uploaded", STATE.get("cv_seed", "")
+                    pptx_extract_btn.click(
+                        fn=handle_pptx_upload,
+                        inputs=pptx_upload,
+                        outputs=[pptx_extract_status, cv_seed]
+                    )
+                    cv_info = gr.Textbox(label="Status", interactive=False)
+                    cv_chat.change(lambda x: set_cv(x, cv_seed.value), inputs=cv_chat, outputs=cv_info)
+                    cv_seed.change(lambda x: set_cv(cv_chat.value, x), inputs=cv_seed, outputs=cv_info)
+                with gr.Accordion("✉️ Cover Letter Settings", open=False):
+                    cover_chat = gr.Textbox(
+                        label="Cover Letter Instructions",
+                        value=STATE["cover_chat"],
+                        lines=3,
+                        placeholder="E.g., Professional tone, mention relocation..."
+                    )
+                    cover_seed = gr.Textbox(
+                        label="Cover Letter Template (optional)",
+                        value=STATE["cover_seed"] or "",
+                        lines=10,
+                        placeholder="Paste your existing cover letter here..."
+                    )
+                    def set_cover(c, s):
+                        STATE["cover_chat"] = c or ""
+                        STATE["cover_seed"] = s or None
+                        return "✅ Cover letter settings updated"
+                    cover_info = gr.Textbox(label="Status", interactive=False)
+                    cover_chat.change(lambda x: set_cover(x, cover_seed.value), inputs=cover_chat, outputs=cover_info)
+                    cover_seed.change(lambda x: set_cover(cover_chat.value, x), inputs=cover_seed, outputs=cover_info)
+                gr.Markdown("## 💼 Jobs")
+                # Adzuna Job Search
+                if ADZUNA_APP_ID and ADZUNA_APP_KEY:
+                    with gr.Accordion("🔍 Search Adzuna Jobs", open=True):
+                        with gr.Row():
+                            adzuna_query = gr.Textbox(
+                                label="Job Title",
+                                value="Software Engineer",
+                                placeholder="e.g., Python Developer"
+                            )
+                            adzuna_location = gr.Textbox(
+                                label="Location",
+                                value="London",
+                                placeholder="e.g., New York, Remote"
+                            )
+                        adzuna_search_btn = gr.Button("🔍 Search Adzuna", variant="primary")
+                        adzuna_results = gr.Textbox(
+                            label="Search Results",
+                            lines=3,
+                            interactive=False
+                        )
+                        def search_and_display(query, location):
+                            jobs, message = search_adzuna_jobs(query, location)
+                            # Add jobs to state
+                            if jobs:
+                                STATE["custom_jobs"].extend(jobs[:5])  # Add top 5 to available jobs
+                            return message
+                        adzuna_search_btn.click(
+                            fn=search_and_display,
+                            inputs=[adzuna_query, adzuna_location],
+                            outputs=adzuna_results
+                        )
+                with gr.Accordion("➕ Add Custom Job", open=True):
+                    c_title = gr.Textbox(label="Job Title*", placeholder="e.g., Senior Software Engineer")
+                    c_company = gr.Textbox(label="Company*", placeholder="e.g., Google")
+                    c_loc = gr.Textbox(label="Location", placeholder="e.g., Remote, New York")
+                    c_url = gr.Textbox(label="Job URL", placeholder="https://...")
+                    c_desc = gr.Textbox(
+                        label="Job Description*",
+                        lines=8,
+                        placeholder="Paste the complete job description here..."
+                    )
+                    with gr.Row():
+                        add_job_btn = gr.Button("➕ Add Job", variant="primary")
+                        load_example_btn = gr.Button("📝 Load Example")
+                    add_job_info = gr.Textbox(label="Status", interactive=False)
+                    def load_example():
+                        return (
+                            "Senior Software Engineer",
+                            "Tech Corp",
+                            "Remote",
+                            "",
+                            "We are looking for a Senior Software Engineer with 5+ years of experience in Python, AWS, and Docker. You will lead technical initiatives and build scalable systems."
+                        )
+                    load_example_btn.click(
+                        fn=load_example,
+                        outputs=[c_title, c_company, c_loc, c_url, c_desc]
+                    )
+                    add_job_btn.click(
+                        fn=add_custom_job,
+                        inputs=[c_title, c_company, c_loc, c_url, c_desc],
+                        outputs=[add_job_info, c_title]
+                    )
+                job_select = gr.CheckboxGroup(
+                    choices=list_jobs_options(),
+                    label="📋 Select Jobs to Process"
+                )
+                refresh_jobs = gr.Button("🔄 Refresh Job List")
+                refresh_jobs.click(lambda: gr.update(choices=list_jobs_options()), outputs=job_select)
+            # Right column - Generation
+            with gr.Column(scale=3):
+                gr.Markdown("## 📄 Document Generation")
+                gen_btn = gr.Button("🚀 Generate Documents", variant="primary", size="lg")
+                out_preview = gr.Markdown("Ready to generate documents...")
+                out_time = gr.Number(label="Processing Time (seconds)")
+                # PowerPoint Export Section
+                with gr.Accordion("📊 Export to PowerPoint CV", open=False, visible=False) as pptx_section:
+                    gr.Markdown("### Convert your resume to a professional PowerPoint presentation")
+                    with gr.Row():
+                        pptx_job_select = gr.Number(
+                            label="Job Index (1, 2, 3...)",
+                            value=1,
+                            minimum=1,
+                            step=1
+                        )
+                        pptx_template = gr.Dropdown(
+                            choices=["modern_blue", "corporate_gray", "elegant_green", "warm_red"],
+                            value="modern_blue",
+                            label="Template Style"
+                        )
+                    export_pptx_btn = gr.Button("📊 Create PowerPoint CV", variant="primary")
+                    pptx_status = gr.Textbox(label="Export Status", interactive=False)
+                    pptx_file = gr.File(label="Download PowerPoint", visible=False)
+                    def handle_pptx_export(job_idx, template):
+                        status, file_path = export_to_powerpoint(int(job_idx) - 1, template)
+                        if file_path:
+                            return status, gr.update(visible=True, value=file_path)
+                        return status, gr.update(visible=False)
+                    export_pptx_btn.click(
+                        fn=handle_pptx_export,
+                        inputs=[pptx_job_select, pptx_template],
+                        outputs=[pptx_status, pptx_file]
+                    )
+                # Word Document Export Section
+                with gr.Accordion("📝 Export to Word Documents", open=False, visible=False) as word_section:
+                    gr.Markdown("### Generate professional Word documents")
+                    with gr.Row():
+                        word_job_select = gr.Number(
+                            label="Job Index (1, 2, 3...)",
+                            value=1,
+                            minimum=1,
+                            step=1
+                        )
+                        word_template = gr.Dropdown(
+                            choices=["modern", "executive", "creative", "minimal", "academic"],
+                            value="modern",
+                            label="Document Style"
+                        )
+                    with gr.Row():
+                        export_word_resume_btn = gr.Button("📄 Export Resume as Word", variant="primary")
+                        export_word_cover_btn = gr.Button("✉️ Export Cover Letter as Word", variant="primary")
+                    word_status = gr.Textbox(label="Export Status", interactive=False)
+                    word_files = gr.File(label="Download Word Documents", visible=False, file_count="multiple")
+                    def handle_word_export(job_idx, template, doc_type="resume"):
+                        try:
+                            from services.word_cv import WordCVGenerator
+                            generator = WordCVGenerator()
+                            candidates = STATE.get("pptx_candidates", [])
+                            if not candidates or job_idx > len(candidates):
+                                return "❌ No documents available", gr.update(visible=False)
+                            resume, job = candidates[int(job_idx) - 1]
+                            files = []
+                            if doc_type == "resume" or doc_type == "both":
+                                resume_path = generator.create_resume_document(resume, job, template)
+                                if resume_path:
+                                    files.append(resume_path)
+                            if doc_type == "cover" or doc_type == "both":
+                                # Get cover letter from results
+                                results = STATE.get("results", [])
+                                cover_letter = None
+                                for r in results:
+                                    if r.job.id == job.id:
+                                        cover_letter = r.cover_letter
+                                        break
+                                if cover_letter:
+                                    cover_path = generator.create_cover_letter_document(cover_letter, job, template)
+                                    if cover_path:
+                                        files.append(cover_path)
+                            if files:
+                                return f"✅ Created {len(files)} Word document(s)", gr.update(visible=True, value=files)
+                            return "❌ Failed to create documents", gr.update(visible=False)
+                        except Exception as e:
+                            return f"❌ Error: {str(e)}", gr.update(visible=False)
+                    export_word_resume_btn.click(
+                        fn=lambda idx, tmpl: handle_word_export(idx, tmpl, "resume"),
+                        inputs=[word_job_select, word_template],
+                        outputs=[word_status, word_files]
+                    )
+                    export_word_cover_btn.click(
+                        fn=lambda idx, tmpl: handle_word_export(idx, tmpl, "cover"),
+                        inputs=[word_job_select, word_template],
+                        outputs=[word_status, word_files]
+                    )
+                # Excel Tracker Export
+                with gr.Accordion("📊 Export Excel Tracker", open=False, visible=False) as excel_section:
+                    gr.Markdown("### Create comprehensive job application tracker")
+                    export_excel_btn = gr.Button("📈 Generate Excel Tracker", variant="primary")
+                    excel_status = gr.Textbox(label="Export Status", interactive=False)
+                    excel_file = gr.File(label="Download Excel Tracker", visible=False)
+                    def handle_excel_export():
+                        try:
+                            from services.excel_tracker import ExcelTracker
+                            tracker = ExcelTracker()
+                            results = STATE.get("results", [])
+                            if not results:
+                                return "❌ No results to track", gr.update(visible=False)
+                            tracker_path = tracker.create_tracker(results)
+                            if tracker_path:
+                                return f"✅ Excel tracker created with {len(results)} applications", gr.update(visible=True, value=tracker_path)
+                            return "❌ Failed to create tracker", gr.update(visible=False)
+                        except Exception as e:
+                            return f"❌ Error: {str(e)}", gr.update(visible=False)
+                    export_excel_btn.click(
+                        fn=handle_excel_export,
+                        outputs=[excel_status, excel_file]
+                    )
+                gen_btn.click(fn=generate, inputs=[job_select], outputs=[out_preview, out_time, pptx_section, word_section, excel_section])
+                gr.Markdown("## 🔄 Regenerate Individual Job")
+                with gr.Row():
+                    job_single = gr.Dropdown(choices=list_jobs_options(), label="Select Job")
+                    refresh_single = gr.Button("🔄")
+                refresh_single.click(lambda: gr.update(choices=list_jobs_options()), outputs=job_single)
+                regen_btn = gr.Button("🔄 Regenerate Selected Job")
+                regen_preview = gr.Markdown()
+                regen_time = gr.Number(label="Regeneration Time (seconds)")
+                regen_btn.click(fn=regenerate_one, inputs=[job_single], outputs=[regen_preview, regen_time])
+                gr.Markdown("## 📊 Results Summary")
+                update_summary = gr.Button("📊 Update Summary")
+                table = gr.Dataframe(value=summary_table(), interactive=False)
+                update_summary.click(fn=summary_table, outputs=table)
+                # Knowledge Graph Section
+                if 'kg_service' in globals() and kg_service and kg_service.is_enabled():
+                    with gr.Accordion("📊 Knowledge Graph & Application Tracking", open=False):
+                        gr.Markdown("""
+                        ### 🧠 Application Knowledge Graph
+                        Track your job applications, skills, and patterns over time.
+                        """)
+                        with gr.Row():
+                            with gr.Column(scale=1):
+                                kg_user_name = gr.Textbox(
+                                    label="Your Name",
+                                    value=STATE.get("user_name", "User"),
+                                    placeholder="Enter your name for tracking"
+                                )
+                                def update_user_name(name):
+                                    STATE["user_name"] = name
+                                    return f"Tracking as: {name}"
+                                kg_user_status = gr.Markdown("Enter your name to start tracking")
+                                kg_user_name.change(update_user_name, inputs=[kg_user_name], outputs=[kg_user_status])
+                                gr.Markdown("### 📈 Quick Actions")
+                                show_history_btn = gr.Button("📜 Show My History", variant="primary", size="sm")
+                                show_trends_btn = gr.Button("📊 Show Skill Trends", variant="secondary", size="sm")
+                                show_insights_btn = gr.Button("💡 Company Insights", variant="secondary", size="sm")
+                            with gr.Column(scale=2):
+                                kg_output = gr.JSON(label="Knowledge Graph Data", visible=True)
+                        def show_user_history(user_name):
+                            if kg_service and kg_service.is_enabled():
+                                history = kg_service.get_user_history(user_name)
+                                return history
+                            return {"error": "Knowledge graph not available"}
+                        def show_skill_trends():
+                            if kg_service and kg_service.is_enabled():
+                                trends = kg_service.get_skill_trends()
+                                return trends
+                            return {"error": "Knowledge graph not available"}
+                        def show_company_insights():
+                            if kg_service and kg_service.is_enabled():
+                                # Get insights for all companies user applied to
+                                history = kg_service.get_user_history(STATE.get("user_name", "User"))
+                                companies = set()
+                                for app in history.get("applications", []):
+                                    if isinstance(app, dict) and "properties" in app:
+                                        company = app["properties"].get("company")
+                                        if company:
+                                            companies.add(company)
+                                insights = {}
+                                for company in list(companies)[:5]:  # Limit to 5 companies
+                                    insights[company] = kg_service.get_company_insights(company)
+                                return insights if insights else {"message": "No companies found in history"}
+                            return {"error": "Knowledge graph not available"}
+                        show_history_btn.click(
+                            show_user_history,
+                            inputs=[kg_user_name],
+                            outputs=[kg_output]
+                        )
+                        show_trends_btn.click(
+                            show_skill_trends,
+                            inputs=[],
+                            outputs=[kg_output]
+                        )
+                        show_insights_btn.click(
+                            show_company_insights,
+                            inputs=[],
+                            outputs=[kg_output]
+                        )
+                        gr.Markdown("""
+                        ### 📊 Features:
+                        - **Application History**: Track all your job applications
+                        - **Skill Analysis**: See which skills are in demand
+                        - **Company Insights**: Learn about companies you've applied to
+                        - **Pattern Recognition**: Identify successful application patterns
+                        - All data stored locally in SQLite - no external dependencies!
+                        """)
+                # Enhanced Extraction with LangExtract
+                if LANGEXTRACT_AVAILABLE:
+                    with gr.Accordion("🔍 Enhanced Job Analysis (LangExtract)", open=False):
+                        gr.Markdown("### AI-Powered Job & Resume Analysis")
+                        with gr.Tabs():
+                            # Job Analysis Tab
+                            with gr.TabItem("📋 Job Analysis"):
+                                job_analysis_text = gr.Textbox(
+                                    label="Paste Job Description",
+                                    lines=10,
+                                    placeholder="Paste the full job description here for analysis..."
+                                )
+                                analyze_job_btn = gr.Button("🔍 Analyze Job", variant="primary")
+                                job_analysis_output = gr.Markdown()
+                                def analyze_job(text):
+                                    if not text:
+                                        return "Please paste a job description"
+                                    job = extract_job_info(text)
+                                    keywords = extract_ats_keywords(text)
+                                    output = create_extraction_summary(job)
+                                    output += "\n\n### 🎯 ATS Keywords\n"
+                                    output += f"**High Priority:** {', '.join(keywords.high_priority[:10]) or 'None'}\n"
+                                    output += f"**Medium Priority:** {', '.join(keywords.medium_priority[:10]) or 'None'}\n"
+                                    return output
+                                analyze_job_btn.click(
+                                    fn=analyze_job,
+                                    inputs=job_analysis_text,
+                                    outputs=job_analysis_output
+                                )
+                            # ATS Optimization Tab
+                            with gr.TabItem("🎯 ATS Optimizer"):
+                                gr.Markdown("Compare your resume against job requirements")
+                                with gr.Row():
+                                    ats_resume = gr.Textbox(
+                                        label="Your Resume",
+                                        lines=10,
+                                        placeholder="Paste your resume text..."
+                                    )
+                                    ats_job = gr.Textbox(
+                                        label="Job Description",
+                                        lines=10,
+                                        placeholder="Paste the job description..."
+                                    )
+                                optimize_btn = gr.Button("🎯 Optimize for ATS", variant="primary")
+                                ats_report = gr.Markdown()
+                                def run_ats_optimization(resume, job):
+                                    if not resume or not job:
+                                        return "Please provide both resume and job description"
+                                    result = optimize_for_ats(resume, job)
+                                    return create_ats_report(result)
+                                optimize_btn.click(
+                                    fn=run_ats_optimization,
+                                    inputs=[ats_resume, ats_job],
+                                    outputs=ats_report
+                                )
+                            # Bulk Analysis Tab
+                            with gr.TabItem("📊 Bulk Analysis"):
+                                gr.Markdown("Analyze multiple jobs at once")
+                                bulk_jobs_text = gr.Textbox(
+                                    label="Paste Multiple Job Descriptions (separated by ---)",
+                                    lines=15,
+                                    placeholder="Job 1...\n---\nJob 2...\n---\nJob 3..."
+                                )
+                                bulk_analyze_btn = gr.Button("📊 Analyze All Jobs", variant="primary")
+                                bulk_output = gr.Markdown()
+                                def analyze_bulk_jobs(text):
+                                    if not text:
+                                        return "Please paste job descriptions"
+                                    jobs = text.split("---")
+                                    results = []
+                                    for i, job_text in enumerate(jobs, 1):
+                                        if job_text.strip():
+                                            job = extract_job_info(job_text)
+                                            results.append(f"### Job {i}: {job.title or 'Unknown'}")
+                                            results.append(f"**Company:** {job.company or 'Unknown'}")
+                                            results.append(f"**Skills:** {', '.join(job.skills[:5]) or 'None detected'}")
+                                            results.append("")
+                                    return "\n".join(results) if results else "No valid jobs found"
+                                bulk_analyze_btn.click(
+                                    fn=analyze_bulk_jobs,
+                                    inputs=bulk_jobs_text,
+                                    outputs=bulk_output
+                                )
+                # Advanced Features Results
+                if ADVANCED_FEATURES:
+                    with gr.Accordion("🎯 Advanced Analytics", open=False):
+                        with gr.Tabs():
+                            # Execution Timeline Tab
+                            with gr.TabItem("⚡ Execution Timeline"):
+                                show_timeline_btn = gr.Button("📊 Generate Timeline")
+                                timeline_image = gr.Image(label="Parallel Execution Timeline", visible=False)
+                                def show_execution_timeline():
+                                    if parallel_executor and hasattr(parallel_executor, 'execution_history'):
+                                        try:
+                                            import matplotlib.pyplot as plt
+                                            fig = parallel_executor.plot_timeline()
+                                            timeline_path = "execution_timeline.png"
+                                            fig.savefig(timeline_path)
+                                            plt.close()
+                                            return gr.update(visible=True, value=timeline_path)
+                                        except Exception as e:
+                                            logger.error(f"Timeline generation error: {e}")
+                                    return gr.update(visible=False)
+                                show_timeline_btn.click(fn=show_execution_timeline, outputs=timeline_image)
+                            # Application History Tab
+                            with gr.TabItem("📜 Application History"):
+                                history_btn = gr.Button("📋 Show History")
+                                history_text = gr.Textbox(label="Application Timeline", lines=10, interactive=False)
+                                def show_application_history():
+                                    if temporal_tracker:
+                                        try:
+                                            active = temporal_tracker.get_active_applications()
+                                            patterns = temporal_tracker.analyze_patterns()
+                                            history = "📊 Application Patterns:\n"
+                                            history += f"• Total applications: {patterns.get('total_applications', 0)}\n"
+                                            history += f"• This week: {patterns.get('applications_this_week', 0)}\n"
+                                            history += f"• Response rate: {patterns.get('response_rate', '0%')}\n\n"
+                                            history += "📋 Active Applications:\n"
+                                            for app in active[:5]:
+                                                history += f"• {app['company']} - {app['position']} ({app['status']})\n"
+                                            return history
+                                        except Exception as e:
+                                            return f"Error retrieving history: {e}"
+                                    return "Temporal tracking not available"
+                                history_btn.click(fn=show_application_history, outputs=history_text)
+                            # Observability Tab
+                            with gr.TabItem("🔍 Agent Tracing"):
+                                trace_btn = gr.Button("📝 Show Agent Trace")
+                                trace_text = gr.Textbox(label="Agent Interaction Flow", lines=15, interactive=False)
+                                def show_agent_trace():
+                                    if agent_tracer:
+                                        try:
+                                            import io
+                                            from contextlib import redirect_stdout
+                                            f = io.StringIO()
+                                            with redirect_stdout(f):
+                                                agent_tracer.print_interaction_flow()
+                                            trace_output = f.getvalue()
+                                            # Also get metrics
+                                            metrics = agent_tracer.get_metrics()
+                                            trace_output += f"\n\n📊 Metrics:\n"
+                                            trace_output += f"• Total events: {metrics['total_events']}\n"
+                                            trace_output += f"• Agents involved: {metrics['agents_involved']}\n"
+                                            trace_output += f"• Tool calls: {metrics['tool_calls']}\n"
+                                            trace_output += f"• Errors: {metrics['errors']}\n"
+                                            return trace_output
+                                        except Exception as e:
+                                            return f"Error generating trace: {e}"
+                                    return "Observability not available"
+                                trace_btn.click(fn=show_agent_trace, outputs=trace_text)
+                            # Context Engineering Tab
+                            with gr.TabItem("🧠 Context Insights"):
+                                context_btn = gr.Button("📊 Show Context Stats")
+                                context_text = gr.Textbox(label="Context Engineering Insights", lines=10, interactive=False)
+                                def show_context_insights():
+                                    if context_engineer:
+                                        try:
+                                            # Get flywheel recommendations
+                                            sample_query = "Generate resume for software engineer"
+                                            recommended = context_engineer.flywheel.get_recommended_sources(sample_query)
+                                            insights = "🧠 Context Engineering Insights:\n\n"
+                                            insights += f"📊 Flywheel Learning:\n"
+                                            insights += f"• Successful contexts: {len(context_engineer.flywheel.successful_contexts)}\n"
+                                            insights += f"• Pattern cache size: {len(context_engineer.flywheel.pattern_cache)}\n\n"
+                                            if recommended:
+                                                insights += f"💡 Recommended sources for '{sample_query}':\n"
+                                                for source in recommended:
+                                                    insights += f"  • {source}\n"
+                                            # Memory hierarchy stats
+                                            insights += f"\n📚 Memory Hierarchy:\n"
+                                            insights += f"• L1 Cache: {len(context_engineer.memory.l1_cache)} items\n"
+                                            insights += f"• L2 Memory: {len(context_engineer.memory.l2_memory)} items\n"
+                                            insights += f"• L3 Storage: {len(context_engineer.memory.l3_index)} indexed\n"
+                                            return insights
+                                        except Exception as e:
+                                            return f"Error getting insights: {e}"
+                                    return "Context engineering not available"
+                                context_btn.click(fn=show_context_insights, outputs=context_text)
+        # Configuration status
+        config_status = []
+        # LinkedIn OAuth
+        if not MOCK_MODE and LINKEDIN_CLIENT_ID:
+            config_status.append(f"✅ LinkedIn OAuth ({LINKEDIN_CLIENT_ID[:8]}...)")
+        # Adzuna
+        if ADZUNA_APP_ID and ADZUNA_APP_KEY:
+            config_status.append(f"✅ Adzuna API ({ADZUNA_APP_ID})")
+        # Gemini
+        if os.getenv("GEMINI_API_KEY"):
+            config_status.append("✅ Gemini AI")
+        # Tavily
+        if os.getenv("TAVILY_API_KEY"):
+            config_status.append("✅ Tavily Research")
+        if not config_status:
+            config_status.append("ℹ️ Add API keys to .env for full functionality")
+        gr.Markdown(f"""
+        ---
+        ### 🔧 Active Services: {' | '.join(config_status)}
+        ### 💡 Quick Start:
+        1. **Sign in** with LinkedIn (if configured)
+        2. **Search** for jobs on Adzuna or add custom jobs
+        3. **Configure** advanced features (if available)
+        4. **Select** jobs and click "Generate Documents"
+        5. **Review** AI-generated resume and cover letter
+        6. **Export** to Word/PowerPoint/Excel
+        7. **Analyze** with advanced analytics (if enabled)
+        ### 📊 Current Capabilities:
+        - **Job Sources**: {
+            'Adzuna (5000/month)' if ADZUNA_APP_ID else 'Mock Data'
+        }
+        - **Authentication**: {
+            'LinkedIn OAuth' if not MOCK_MODE and LINKEDIN_CLIENT_ID else 'Mock Mode'
+        }
+        - **AI Generation**: {
+            'Gemini' if os.getenv("GEMINI_API_KEY") else 'Template Mode'
+        }
+        - **Advanced AI**: {
+            'Parallel + Temporal + Observability + Context' if ADVANCED_FEATURES else 'Not Available'
+        }
+        ### 🚀 Performance Enhancements:
+        - **Parallel Processing**: 3-5x faster document generation
+        - **Temporal Tracking**: Complete application history with versioning
+        - **Observability**: Full agent tracing and debugging
+        - **Context Engineering**: Continuous learning and optimization
+        - **Memory Hierarchy**: L1/L2/L3 caching for instant retrieval
+        - **Compression**: Handle 1M+ tokens with intelligent scaling
+        """)
+        return demo
+if __name__ == "__main__":
+    print("=" * 60)
+    print("Job Application Assistant - Gradio Interface")
+    print("=" * 60)
+    # Check configuration
+    if USE_SYSTEM_AGENTS:
+        print("✅ Full system mode - all features available")
+    else:
+        print("⚠️ Standalone mode - basic features only")
+        print("   Place this file in the project directory for full features")
+    if ADVANCED_FEATURES:
+        print("🚀 Advanced AI Agent Features Loaded:")
+        print("  ⚡ Parallel Processing (3-5x faster)")
+        print("  📊 Temporal Tracking (complete history)")
+        print("  🔍 Observability (full tracing)")
+        print("  🧠 Context Engineering (continuous learning)")
+        print("  📈 Context Scaling (1M+ tokens)")
+    if os.getenv("GEMINI_API_KEY"):
+        print("✅ Gemini API configured")
+    else:
+        print("ℹ️ No Gemini API key - using fallback generation")
+    if os.getenv("TAVILY_API_KEY"):
+        print("✅ Tavily API configured for web research")
+    if ADZUNA_APP_ID:
+        print("✅ Adzuna API configured for job search")
+    if LINKEDIN_CLIENT_ID:
+        print("✅ LinkedIn OAuth configured")
+    print("\nStarting Gradio app...")
+    print("=" * 60)
+    try:
+        app = build_app()
+        app.launch(
+            server_name="0.0.0.0",
+            server_port=int(os.getenv("PORT", 7860)),
+            share=False,
+            show_error=True
+        )
+    except Exception as e:
+        logger.error(f"Failed to start app: {e}")
+        print(f"\n❌ Error: {e}")
+        print("\nTroubleshooting:")
+        print("1. Install required packages: pip install gradio pandas python-dotenv")
+        print("2. Check your .env file exists and is valid")
+        print("3. Ensure port 7860 is not in use")
+        raise

mcp/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # mcp servers package

mcp/__pycache__/__init__.cpython-313.pyc ADDED Viewed

Binary file (114 Bytes). View file

mcp/__pycache__/cover_letter_server.cpython-313.pyc ADDED Viewed

Binary file (1.41 kB). View file

mcp/__pycache__/cv_owner_server.cpython-313.pyc ADDED Viewed

Binary file (1.37 kB). View file

mcp/__pycache__/orchestrator_server.cpython-313.pyc ADDED Viewed

Binary file (1.92 kB). View file

mcp/__pycache__/server_common.cpython-313.pyc ADDED Viewed

Binary file (1.73 kB). View file

mcp/cover_letter_server.py ADDED Viewed

	@@ -0,0 +1,27 @@

+from __future__ import annotations
+from mcp.server import Server
+from mcp.server_common import create_common_tools, run_server
+from agents.cover_letter_agent import CoverLetterAgent
+from agents.linkedin_manager import LinkedInManagerAgent
+def build_server() -> Server:
+    server = Server("cover_letter_mcp")
+    create_common_tools(server)
+    agent = CoverLetterAgent()
+    li = LinkedInManagerAgent()
+    @server.tool()
+    async def draft_cover_letter(job_id: str, user_id: str = "default_user") -> str:
+        job = li.get_job(job_id)
+        profile = li.get_profile()
+        draft = agent.create_cover_letter(profile, job, user_id=user_id)
+        return draft.text
+    return server
+if __name__ == "__main__":
+    run_server(build_server())

mcp/cv_owner_server.py ADDED Viewed

	@@ -0,0 +1,27 @@

+from __future__ import annotations
+from mcp.server import Server
+from mcp.server_common import create_common_tools, run_server
+from agents.cv_owner import CVOwnerAgent
+from agents.linkedin_manager import LinkedInManagerAgent
+def build_server() -> Server:
+    server = Server("cv_owner_mcp")
+    create_common_tools(server)
+    cv = CVOwnerAgent()
+    li = LinkedInManagerAgent()
+    @server.tool()
+    async def draft_resume(job_id: str, user_id: str = "default_user") -> str:
+        job = li.get_job(job_id)
+        profile = li.get_profile()
+        draft = cv.create_resume(profile, job, user_id=user_id)
+        return draft.text
+    return server
+if __name__ == "__main__":
+    run_server(build_server())

mcp/orchestrator_server.py ADDED Viewed

	@@ -0,0 +1,31 @@

+from __future__ import annotations
+from typing import List
+from mcp.server import Server
+from mcp.server_common import create_common_tools, run_server
+from agents.orchestrator import OrchestratorAgent
+from models.schemas import JobPosting
+def build_server() -> Server:
+    server = Server("orchestrator_mcp")
+    create_common_tools(server)
+    orch = OrchestratorAgent()
+    @server.tool()
+    async def list_jobs() -> List[dict]:
+        jobs: List[JobPosting] = orch.get_saved_jobs()
+        return [job.model_dump() for job in jobs]
+    @server.tool()
+    async def run_for_jobs(job_ids: List[str], user_id: str = "default_user") -> List[dict]:
+        jobs = [j for j in orch.get_saved_jobs() if j.id in job_ids]
+        results = orch.run_for_jobs(jobs, user_id=user_id)
+        return [r.model_dump() for r in results]
+    return server
+if __name__ == "__main__":
+    run_server(build_server())

mcp/server_common.py ADDED Viewed

	@@ -0,0 +1,25 @@

+from __future__ import annotations
+import asyncio
+from typing import Callable, Awaitable
+from mcp.server import Server
+from services.web_research import get_role_guidelines
+from services.llm import llm
+def create_common_tools(server: Server) -> None:
+    @server.tool()
+    async def research_guidelines(role_title: str, job_description: str) -> str:
+        """Fetch latest best-practice guidance for a role (uses Tavily if configured)."""
+        return get_role_guidelines(role_title, job_description)
+    @server.tool()
+    async def llm_refine(system_prompt: str, user_prompt: str, max_tokens: int = 800) -> str:
+        """Refine a text snippet using the configured LLM provider (OpenAI/Anthropic/Gemini)."""
+        return llm.generate(system_prompt, user_prompt, max_tokens=max_tokens)
+def run_server(server: Server, host: str = "127.0.0.1", port: int = 8765) -> None:
+    # Minimal run loop for development embedding
+    asyncio.run(server.run_stdio_async())

memory/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # memory package

memory/__pycache__/__init__.cpython-313.pyc ADDED Viewed

Binary file (186 Bytes). View file

memory/__pycache__/store.cpython-313.pyc ADDED Viewed

Binary file (7.15 kB). View file

memory/data/anthony_test__capco_lead_ai_2024__cover_letter.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "job_id": "capco_lead_ai_2024",
+  "final": true,
+  "keywords_used": [
+    "architectures",
+    "agent"
+  ],
+  "draft": "With experience across Python, LLMs, GPT, Claude, Gemma, Multi-modal Models, RAG, Prompt Engineering, I can quickly contribute to your team. I value impact, ownership Relevant focus: mlops\n\nRelevant focus: agent, architectures"
+}

memory/data/anthony_test__capco_lead_ai_2024__cv_owner.json ADDED Viewed

	@@ -0,0 +1,45 @@

+{
+  "job_id": "capco_lead_ai_2024",
+  "cycle": 1,
+  "coverage": 0.5384615384615384,
+  "conciseness": 1.0,
+  "keywords_used": [
+    "frameworks",
+    "architectures",
+    "agent",
+    "prompt engineering",
+    "financial",
+    "ai deployment",
+    "multi",
+    "advanced",
+    "rag",
+    "advanced prompt engineering",
+    "experience",
+    "model",
+    "prompt",
+    "deployment",
+    "solutions",
+    "production",
+    "advanced prompt",
+    "mlops",
+    "engineering",
+    "systems",
+    "agentic"
+  ],
+  "guidance": "Use concise, achievement-oriented bullets with metrics; prioritize recent, role-relevant skills; ensure ATS-friendly formatting; avoid images/tables; tailor keywords to the job posting; keep resume to 1-2 pages and cover letter to <= 1 page; reflect current tooling (e.g., modern cloud, MLOps/DevOps practices) only if you have real experience.",
+  "user_chat": "Emphasize multi-agent AI systems and production LLM deployment",
+  "agent2_notes": "British/Australian citizen, no visa required. CQF certified.",
+  "draft": "- CORE TECHNICAL COMPETENCIES\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\n• AI/ML Engineering: Python, LLMs (GPT, Claude, Gemma), Multi-modal Models, RAG, Prompt Engineering\n• Agentic Systems: Multi-agent AI Architectures, Autonomous Workflows, API Integration\n• MLOps & Deployment: Production AI Pipelines, Model Optimization, Cloud AI (AWS, GCP, Azure)\n• Scalable Systems: Full-stack Applications, API Development, Performance Optimization\n• Frameworks: Experience with LangChain/LlamaIndex patterns, Model Context Protocol\n• Financial Services: HSBC, AmEx, Quantitative Finance (CQF - 87%), Regulatory Compliance\n\nPROFESSIONAL EXPERIENCE\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\nCognizant, London, UK                                           2021 - Present\nAI Value Engineer - Associate Director | Lead GenAI Solution Architect\n\nProduction AI & MLOps Leadership:\n• Architected and deployed autonomous AI systems for Tier 1 financial institutions (HSBC, AmEx)\n  implementing production-grade LLM solutions with 99.9% uptime\n• Built scalable MLOps pipelines processing £100k-£1M monthly transactions across government,\n  healthcare, and financial services sectors\n• Pioneered multi-agent AI systems in August 2024, implementing agentic workflows before \n  industry-wide adoption\n\nTechnical Innovation & Optimization:\n• Developed RAG architectures with advanced prompt engineering reducing response latency by 60%\n• Fine-tuned and optimized multi-modal models achieving 90% accuracy in specialized domains\n• Implemented Model Context Protocol for hallucination mitigation in production systems\n• Created full-stack AI applications integrating Claude, GPT, and custom models via APIs\n\nStrategic Partnership & Delivery:\n• Led cloud AI deployments across AWS, GCP, and Azure for enterprise financial services\n• Delivered AI programs consistently 4 weeks ahead of schedule through agile methodologies\n• Guided multidisciplinary teams of 8+ engineers through strategic AI architecture decisions\n• Published thought leadership on MCP vs RAG architectures and Federated Learning\n\nEDUCATION\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\nCertificate in Quantitative Finance (CQF) - 87% average        2020 - 2021\nFitch Learning / CQF Institute\n- ANTHONY LUI\nLead AI Engineer | GenAI Solution Architect\n\nTel: +44 7545 128 601 | Email: luianthony@yahoo.com\nLocation: London | Citizenship: British/Australian (no visa required)\n\nPROFESSIONAL SUMMARY\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\nLead AI Engineer and one of two primary GenAI Solution Architects at Cognizant, with 3+ years \ndeploying production-grade LLMs, multi-modal models, and agentic workflows for Tier 1 financial \ninstitutions including HSBC and AmEx.\n- Expert in architecting autonomous AI systems, implementing \nRAG architectures, and building scalable MLOps pipelines.\n- Proven track record of delivering \nenterprise GenAI solutions 4 weeks ahead of schedule with budgets ranging from £100k-£1M monthly.\n\nANTHONY LUI\nLead AI Engineer | GenAI Solution Architect\n\nTel: +44 7545 128 601 | Email: luianthony@yahoo.com\nLocation: London | Citizenship: British/Australian (no visa required)\n\nPROFESSIONAL SUMMARY\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\nLead AI Engineer and one of two primary GenAI Solution Architects at Cognizant, with 3+ years \ndeploying production-grade LLMs, multi-modal models, and agentic workflows for Tier 1 financial \ninstitutions including HSBC and AmEx. Expert in architecting autonomous AI systems, implementing \nRAG architectures, and building scalable MLOps pipelines. Proven track record of delivering \nenterprise GenAI solutions 4 weeks ahead of schedule with budgets ranging from £100k-£1M monthly.\n\nCORE TECHNICAL COMPETENCIES\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\n• AI/ML Engineering: Python, LLMs (GPT, Claude, Gemma), Multi-modal Models, RAG, Prompt Engineering\n• Agentic Systems: Multi-agent AI Architectures, Autonomous Workflows, API Integration\n• MLOps & Deployment: Production AI Pipelines, Model Optimization, Cloud AI (AWS, GCP, Azure)\n• Scalable Systems: Full-stack Applications, API Development, Performance Optimization\n• Frameworks: Experience with LangChain/LlamaIndex patterns, Model Context Protocol\n• Financial Services: HSBC, AmEx, Quantitative Finance (CQF - 87%), Regulatory Compliance\n\nPROFESSIONAL EXPERIENCE\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\nCognizant, London, UK                                           2021 - Present\nAI Value Engineer - Associate Director | Lead GenAI Solution Architect\n\nProduction AI & MLOps Leadership:\n• Architected and deployed autonomous AI systems for Tier 1 financial institutions (HSBC, AmEx)\n  implementing production-grade LLM solutions with 99.9% uptime\n• Built scalable MLOps pipelines processing £100k-£1M monthly transactions across government,\n  healthcare, and financial services sectors\n• Pioneered multi-agent AI systems in August 2024, implementing agentic workflows before \n  industry-wide adoption\n\nTechnical Innovation & Optimization:\n• Developed RAG architectures with advanced prompt engineering reducing response latency by 60%\n• Fine-tuned and optimized multi-modal models achieving 90% accuracy in specialized domains\n• Implemented Model Context Protocol for hallucination mitigation in production systems\n• Created full-stack AI applications integrating Claude, GPT, and custom models via APIs\n\nStrategic Partnership & Delivery:\n• Led cloud AI deployments across AWS, GCP, and Azure for enterprise financial services\n• Delivered AI programs consistently 4 weeks ahead of schedule through agile methodologies\n• Guided multidisciplinary teams of 8+ engineers through strategic AI architecture decisions\n• Published thought leadership on MCP vs RAG architectures and Federated Learning\n\nEDUCATION\n━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\nCertificate in Quantitative Finance (CQF) - 87% average        2020 - 2021\nFitch Learning / CQF Institute\n",
+  "signals": {
+    "bullet_density": 0.038,
+    "quant_count": 124,
+    "email_ok": true,
+    "gap_years_flag": false,
+    "skills_split_hint": false,
+    "languages_section": false,
+    "links_present": false,
+    "action_verb_count": 7,
+    "approx_pages": 2.56,
+    "approx_one_page": false
+  }
+}