Cybersecurity-Panel

Running

App Files Files Community

Cybersecurity-Panel / multi_llm_chatbot_backend /README.md

NeonCharlie-24

Fix/dup first message (#44)

e19e41d unverified 2 months ago

preview code

Raw

History Blame Contribute Delete

4.61 kB

Multi-LLM Chatbot Backend

A modular, extensible FastAPI backend for building an AI-powered research advisor chatbot that supports:

Multiple AI personas with configurable tone and behavior
Dynamic switching between Gemini (cloud) and Ollama (local) LLMs
Chat session persistence and context memory
Document upload, chunking, and retrieval using RAG
Rich export features (PDF, DOCX, TXT)
User authentication and JWT-based access control

Backend Architecture

User Input
   ↓
/chat-stream → Orchestrator
     ↓            ↙         ↘
  SessionManager   ContextManager   RAGManager
         ↓              ↓             ↓
     MongoDB        Token Trimming   ChromaDB
         ↓              ↓             ↓
        Persisted Chat & Doc Context → LLM (Gemini/Ollama)

Features

Persona-based multi-agent conversation (Theorist, Pragmatist, etc.)
Provider switching (Gemini ↔ Ollama)
Context-aware response routing + top-K advisor selection
PDF, DOCX, and TXT file upload and semantic retrieval
Developer tools: debug personas, test RAG, export sessions
Secure authentication and session scoping

Setup Instructions

1. Clone and Configure Environment

git clone https://github.com/yourorg/multi-llm-chatbot-backend
cd multi-llm-chatbot-backend
cp .env.example .env  # already provided

2. Python Environment Setup

python -m venv venv
source venv/bin/activate  # or venv\Scripts\activate on Windows

pip install -r requirements.txt

3. Run the Server

uvicorn app.main:app --reload

Server will be available at: http://localhost:8000

FastAPI Routing & Modules

Folder	Description
`app/api`	REST API endpoints for chat, auth, RAG, exports
`app/core`	Main orchestration, context windows, database logic
`app/llm`	Gemini + Ollama LLM wrappers
`app/models`	Persona and user schemas
`app/utils`	File parsing, summaries, exports, vector helpers

Key Files

`main.py`

Loads env vars, sets up FastAPI instance with CORS and routers
Calls connect_to_mongo() on startup and close_mongo_connection() on shutdown
Imports and registers all routers (auth, chat_sessions, etc.)

`.env` (Sample Vars)

# MongoDB
MONGODB_CONNECTION_STRING=mongodb://localhost:27017
MONGODB_DATABASE_NAME=neon_ai_backend

# Gemini API Key and model
GEMINI_API_KEY=...  # Replace with real key
GEMINI_MODEL=gemini-2.0-flash

# Default provider
DEFAULT_PROVIDER=gemini

`requirements.txt`

Includes:

FastAPI, Uvicorn: API framework and server
httpx: Async LLM request handler
motor, pymongo: MongoDB async access
chromadb, sentence-transformers: Vector database + embeddings
PyPDF2, docx2txt, reportlab: Document parsing and PDF generation
passlib, python-jose: Auth and security

Persona Design & Context Handling

Personas defined in app/models/default_personas.py
Rich system prompts, styles, and epistemologies
Responses routed through ImprovedChatOrchestrator
Context trimmed and weighted via ContextManager

Switching LLM Providers

You can hot-swap models via API:

POST /switch-provider
{ "provider": "gemini" } | { "provider": "ollama" }

Also supported: /switch-model, /current-model, /current-provider

Document Upload + RAG

Upload PDFs, DOCX, or TXT to sessions
Text is extracted → chunked → embedded → stored in ChromaDB
Queried during conversation by persona-aware EnhancedRAGManager

Export Options

Format	Export Endpoint
PDF	`/export-chat?format=pdf`
DOCX	`/export-chat?format=docx`
TXT	`/export-chat?format=txt`
Summary	`/chat-summary?format=pdf`

Developer & Debug Endpoints

Endpoint	Purpose
`/debug/personas`	See registered advisors and prompts
`/debug/ranked-personas`	View top-K advisors for context
`/debug/rag-status`	Run sample search to test document index

Status & Roadmap

Multi-LLM backend ready (Gemini + Ollama)
Document RAG + export system
Session-aware persona routing
JWT Auth + MongoDB user handling
UI enhancements and persona memory
Persona fine-tuning support (future)

For questions, contributions, or deployment help — feel free to reach out!