Spaces:

MHamdan
/

SPARKNET

Sleeping

App Files Files Community

SPARKNET / docs /archive /demo.md

MHamdan

Initial commit: SPARKNET framework

a9dc537 26 days ago

preview code

raw

history blame contribute delete

12.2 kB

A newer version of the Streamlit SDK is available: 1.54.0

Upgrade

Overall System Architecture

Frontend (Next.js 16.0.1 + React)

Technology: Next.js 16 with Turbopack, React, TypeScript
Styling: Tailwind CSS, Shadcn UI components
Animation: Framer Motion for smooth transitions
Real-time Updates: WebSocket connection for live workflow progress
Port: Running on port 3000 (http://172.24.50.21:3000)
Features:
- Responsive drag-and-drop PDF upload (max 50MB)
- Real-time workflow progress monitoring
- Interactive results visualization
- PDF download for valorization briefs

Backend (FastAPI + Python)

Framework: FastAPI (async Python web framework)
Port: Running on port 8000 (http://172.24.50.21:8000)
API Endpoints:
- /api/health - Health check
- /api/patents/upload - Patent PDF upload
- /api/workflows/execute - Start workflow
- /api/workflows/{id}/stream - WebSocket for real-time updates
- /api/workflows/{id}/brief/download - Download PDF brief
GPU: Running on GPU1 (CUDA_VISIBLE_DEVICES=1)
Environment: Python 3.10 with conda environment agentic-ai

AI/LLM Architecture

Multi-Model LLM Strategy

Model Provider: Ollama (local LLM serving)
4 Different Models for different complexity levels: a. gemma2:2b - Simple/fast tasks b. llama3.1:8b - Standard complexity (default) c. qwen2.5:14b - Complex reasoning tasks d. mistral:latest - Analysis and assessment tasks

LangChain Integration

Framework: LangChain for LLM orchestration
Output Parsing: JsonOutputParser for structured outputs
Prompt Engineering: ChatPromptTemplate for consistent prompting
Embeddings: OllamaEmbeddings for semantic search

Multi-Agent System (LangGraph Workflow)

Core Workflow Engine

Framework: LangGraph StateGraph (state machine for agent coordination)
Pattern: Agentic workflow with iterative refinement
Max Iterations: 3 refinement cycles with critic feedback

7 Specialized AI Agents:

PlannerAgent (Complexity: Complex - qwen2.5:14b)

Role: Orchestrates workflow, creates task decomposition
Function: Breaks down patent analysis into 4 subtasks
Template: Uses predefined template for "patent_wakeup" scenario

DocumentAnalysisAgent (Complexity: Standard - llama3.1:8b)

Role: Analyzes patent documents
Tasks:
- Extract patent structure (title, abstract, claims, inventors)
- Assess Technology Readiness Level (TRL 1-9)
- Identify key innovations and technical domains
- Evaluate commercialization potential
Tools: PDF extractor, semantic memory retrieval
Chains:
- Structure extraction chain (JSON parser)
- Assessment chain (technology evaluation)

MarketAnalysisAgent (Complexity: Analysis - mistral:latest)

Role: Analyzes market opportunities
Tasks:
- Identify 3-5 industry sectors
- Assess market readiness (Ready/Emerging/Early)
- Evaluate competitive landscape
- Identify geographic focus (EU, Canada priority for VISTA)
Current Config: Market size and TAM set to None (displays as "NaN") for early-stage demo
Output: 4-5 MarketOpportunity objects ranked by priority score

MatchmakingAgent (Complexity: Standard - llama3.1:8b)

Role: Finds potential partners/stakeholders
Method: Semantic search using vector embeddings
Database: ChromaDB with stakeholder profiles
Scoring:
- Technical fit score
- Market fit score
- Geographic fit score
- Strategic fit score
- Overall fit score (composite)
Output: Top 10 stakeholder matches

OutreachAgent (Complexity: Standard - llama3.1:8b)

Role: Generates valorization briefs
Tasks:
- Create executive summary
- Generate comprehensive brief content
- Format market opportunities and partner recommendations
- Generate PDF document using ReportLab
Chains:
- Brief content generation chain
- Executive summary extraction chain
Output: PDF file + structured ValorizationBrief object

CriticAgent (Complexity: Analysis - mistral:latest)

Role: Quality assurance and validation
Tasks:
- Validates workflow outputs
- Identifies gaps and issues
- Provides feedback for refinement
- Scores quality (0.0-1.0)
Criteria: Completeness, accuracy, actionability

MemoryAgent (ChromaDB Vector Store)

Role: Persistent knowledge management
Storage: 3 ChromaDB collections: a. episodic_memory - Past workflow executions b. semantic_memory - Domain knowledge and context c. stakeholder_profiles - Partner database (11 profiles currently)
Retrieval: Semantic search using embeddings (top-k results)
Purpose: Contextual awareness across sessions

LangGraph Workflow Nodes

State Machine Flow:

START → PLANNER → ROUTER → EXECUTOR → CRITIC → REFINE? → FINISH ↑ | └────────┘ (if refinement needed)

Node Breakdown:

PLANNER Node: - Retrieves relevant context from memory - Creates 4-subtask plan from template - Identifies scenario type (patent_wakeup)
ROUTER Node: - Routes to appropriate execution pipeline based on scenario - Currently: Patent Wake-Up pipeline
EXECUTOR Node: - Executes 4-step pipeline:
- Step 1/4: Document Analysis (extract + assess patent)
- Step 2/4: Market Analysis (identify opportunities)
- Step 3/4: Partner Matching (find stakeholders)
- Step 4/4: Brief Generation (create PDF)
CRITIC Node: - Validates output quality - Generates quality score and feedback - Determines if refinement needed
REFINE Node: - Prepares for next iteration if quality insufficient - Max 3 iterations, then finishes anyway
FINISH Node: - Marks workflow as completed - Stores results in memory - Updates workflow state

Data Flow & Communication

Upload to Results Flow:

User uploads PDF → FastAPI saves to uploads/patents/ → Generates UUID for patent → Returns patent_id to frontend

User clicks analyze → Frontend calls /api/workflows/execute → Backend creates workflow_id → Starts async LangGraph workflow → Returns workflow_id immediately

Frontend opens WebSocket → ws://backend:8000/api/workflows/{id}/stream → Backend streams workflow state every 1 second → Frontend updates UI in real-time

Workflow completes → State = "completed" → Brief PDF generated → Frontend redirects to /results/{workflow_id}

User downloads brief → GET /api/workflows/{id}/brief/download → Returns PDF file

WebSocket Real-Time Updates:

Protocol: WebSocket (bidirectional)
Frequency: Updates sent every 1 second
Data: Full workflow state (JSON)
Retry Logic: Frontend auto-reconnects on disconnect
Fallback: HTTP polling if WebSocket fails

Key Technologies & Libraries

Backend Stack:

FastAPI - Async web framework
Uvicorn - ASGI server
LangChain - LLM orchestration
LangGraph - Agent workflow state machine
ChromaDB - Vector database for embeddings
Pydantic - Data validation and serialization
ReportLab - PDF generation
PyPDF - PDF text extraction
Loguru - Structured logging
PyTorch - GPU acceleration

Frontend Stack:

Next.js 16 - React framework with Turbopack
React 19 - UI library
TypeScript - Type safety
Tailwind CSS - Utility-first styling
Shadcn/UI - Component library
Framer Motion - Animation library
Axios - HTTP client
Lucide React - Icon library

Pydantic Data Models

Core Models (src/workflow/langgraph_state.py):

Claim: Patent claim structure
PatentAnalysis: Complete patent analysis (17 fields)
MarketOpportunity: Individual market sector (12 fields)
MarketAnalysis: Market research results (10 fields)
StakeholderMatch: Partner match (11 fields)
ValorizationBrief: Outreach document (9 fields)
WorkflowState: Complete workflow state (9 fields)

All models use strict validation with Pydantic v2.

Error Handling & Fixes Applied

Recent Bug Fixes:

JSON Parsing: Enhanced prompts to force pure JSON output (no prose)
Pydantic Validation: Use or operators for None handling
Claims Parsing: Filter None values in claims arrays
Market Values: Handle None gracefully (display "NaN")
WebSocket: Fixed React re-render loop, added cleanup flags
Download Brief: Handle None values in nested dicts

Logging Strategy:

Loguru for structured logging
Levels: DEBUG, INFO, SUCCESS, WARNING, ERROR
Files:
- /tmp/backend_sparknet.log - Backend logs
- /tmp/frontend_sparknet.log - Frontend logs

GPU & Performance

GPU Configuration:

GPU Used: GPU1 (CUDA_VISIBLE_DEVICES=1)
Memory: ~10GB required for all 4 LLMs
Inference: Ollama handles model loading and caching

Performance Metrics:

Document Analysis: ~10-15 seconds
Market Analysis: ~15-20 seconds
Partner Matching: ~60-75 seconds (LLM scoring)
Brief Generation: ~5-10 seconds
Total Workflow: ~2-3 minutes per patent

Current Configuration (Demo Mode)

Market Data Placeholders:

market_size_usd = None → displays "NaN"
total_addressable_market_usd = None → displays "NaN"
growth_rate_percent = None → displays "NaN"
funding_capacity_usd = None → displays "NaN"

Reasoning:

Early-stage research phase
Avoid false commitments with speculative numbers
Focus on technical capabilities and partner matching

Screen Sessions

Currently Running:

Backend: 1082974.sparknet-backend (screen -r sparknet-backend)
Frontend: 1074449.sparknet-frontend (screen -r sparknet-frontend)

To View Logs:

tail -f /tmp/backend_sparknet.log tail -f /tmp/frontend_sparknet.log

API Health Check

Endpoint: GET /api/health

{ "status": "healthy", "components": { "llm_client": true, "workflow": true, "planner": true, "critic": true, "memory": true }, "statistics": { "active_workflows": 0, "processed_patents": 0 } }

Summary for Demo

SPARKNET is a multi-agent AI system that transforms dormant patents into commercialization opportunities using:

4 Local LLMs (2B to 14B parameters) for different complexity tasks
7 Specialized AI Agents orchestrated by LangGraph state machine
Vector Database (ChromaDB) for semantic partner matching
Real-time WebSocket streaming for live progress updates
Automated PDF Generation for professional valorization briefs
End-to-end Pipeline: Upload → Analyze → Match → Generate → Download

Current Mode: Early-stage demo with placeholder market values ("NaN") to avoid premature commitments while showcasing technical capabilities.

● Perfect! I've created a comprehensive summary of the SPARKNET system architecture. This covers everything from the high-level architecture down to the technical details of what's happening under the hood.

Key Points for Your Demo:

The Big Picture:

Multi-agent AI system with 7 specialized agents
4 different LLMs (2B-14B parameters) running locally on GPU1
Real-time WebSocket streaming for live progress
End-to-end automation: Upload PDF → AI Analysis → Partner Matching → PDF Brief

Technical Highlights:

LangGraph state machine orchestrating agent collaboration
ChromaDB vector database for semantic partner matching
Iterative refinement with CriticAgent (max 3 cycles)
Pydantic models for strict data validation
FastAPI backend + Next.js 16 frontend

Demo Mode Settings:

All market/financial values set to "NaN" (early-stage, no false commitments)
Focus on technical capabilities and partner matching accuracy
~2-3 minutes per patent workflow