Spaces:

daniel-was-taken
/

CompifAI

Runtime error

App Files Files Community

CompifAI / README.md

Daniel Ferreira

Update README to remove introductory description

7ccfa59 13 days ago

preview code

raw

history blame contribute delete

5.75 kB

	---
	title: CompifAI
	emoji: 💻
	colorFrom: gray
	colorTo: green
	sdk: docker
	pinned: false
	license: apache-2.0
	---

	# Competence Standards RAG Chatbot

	## 📋 Table of Contents

	- [About the Project](#about-the-project)
	- [Features](#features)
	- [Prerequisites](#prerequisites)
	- [Installation](#installation)
	- [Configuration](#configuration)
	- [Starting the Application](#starting-the-application)
	- [Database Population](#database-population)
	- [Data Management](#data-management)
	- [Testing](#testing)

	## 🎯 About the Project

	This RAG chatbot is specifically designed to assist with competence standards in higher education settings. It provides intelligent responses based on a curated knowledge base of documents.

	The system uses advanced natural language processing to understand queries and retrieve relevant information from the document corpus, providing contextually appropriate responses.

	## ✨ Features

	- Interactive Chat Interface: Built with Chainlit for an intuitive user experience
	- Vector Search: Powered by Milvus for efficient similarity search
	- Advanced Embeddings: Uses Nebius AI Qwen3-Embedding-8B model
	- Document Processing: Supports multiple formats (PDF, DOCX, HTML)
	- Authentication: Integrated with Chainlit's authentication system
	- Persistent Storage: Database integration with PostgreSQL
	- Containerized Deployment: Docker Compose setup for easy deployment
	- Testing Suite: Comprehensive testing with RAGAS evaluation


	## 🚀 Installation

	1. Clone the repository
	```bash
	git clone https://github.com/daniel-was-taken/prod-rag-chat.git
	cd prod-rag-chat
	```

	2. Create a virtual environment
	```bash
	python -m venv venv

	# On Windows
	.\venv\Scripts\activate

	# On macOS/Linux
	source venv/bin/activate
	```

	3. Install dependencies
	```bash
	pip install -r requirements.txt
	```

	## ⚙️ Configuration

	1. Create environment file
	```bash
	cp .env.example .env # If available, or create manually
	```

	2. Configure environment variables
	Create a `.env` file in the root directory with the following variables:
	```env
	# Nebius AI Configuration
	NEBIUS_API_KEY=your_nebius_api_key_here
	OPENAI_API_KEY=your_openai_api_key_here # Fallback

	# Milvus Configuration
	MILVUS_URI=http://localhost:19530

	# Chainlit Configuration
	CHAINLIT_AUTH_SECRET=your_auth_secret_here

	# Database Configuration (if using PostgreSQL)
	DATABASE_URL=postgresql://user:password@localhost:5432/dbname
	```

	## 🎬 Starting the Application

	### Method 1: Using Docker Compose (Recommended)

	1. Start the infrastructure services
	```bash
	docker-compose up -d
	```
	This will start:
	- Milvus vector database
	- etcd (for Milvus coordination)
	- MinIO (for Milvus storage)

	2. Wait for services to be ready

	3. Start the Chainlit application
	```bash
	chainlit run app.py -w
	```

	### Method 2: Manual Setup

	1. Install and configure Milvus standalone
	Follow the [Milvus installation guide](https://milvus.io/docs/install_standalone-docker.md)

	2. Start the application
	```bash
	chainlit run app.py -w
	```

	The application will be available at `http://localhost:8000`

	## 📊 Database Population

	The system automatically populates the vector database on first startup. However, you can manually manage the data:

	### Automatic Population

	The application automatically checks if the Milvus collection exists and has data. If not, it runs the population script automatically.

	### Manual Population

	To manually populate or repopulate the database:

	```bash
	python populate_db.py
	```

	### Adding New Documents

	1. Add documents to the data directory
	```bash
	# Place your documents in the data/ folder
	cp your_new_document.pdf data/
	cp your_new_document.docx data/
	```

	2. Supported file formats:
	- PDF files (`.pdf`)
	- Microsoft Word documents (`.docx`)
	- HTML files (`.html`)

	3. Repopulate the database
	```bash
	# Delete existing collection
	python delete_collection.py

	# Repopulate with new documents
	python populate_db.py
	```

	### Database Configuration

	The population script uses the following configuration:

	- Embedding Model: Qwen/Qwen3-Embedding-8B (4096 dimensions)
	- Chunk Size: 1500 characters maximum
	- Combine Threshold: 200 characters minimum
	- Batch Size: 5 documents per batch
	- Collection Name: `my_rag_collection`

	### Document Processing Pipeline

	1. Loading: Documents are loaded using UnstructuredLoader
	2. Cleaning: Text is cleaned and normalized
	3. Chunking: Documents are split into manageable chunks
	4. Embedding: Chunks are converted to vector embeddings
	5. Storage: Embeddings are stored in Milvus with metadata

	## 🗃️ Data Management


	### Deleting the Collection

	```bash
	python delete_collection.py
	```


	### Updating Documents

	To update the document corpus:

	1. Add/remove documents in the `data/` directory
	2. Delete the existing collection: `python delete_collection.py`
	3. Restart the application (it will automatically repopulate)

	## 🧪 Testing

	The project includes comprehensive testing:

	### Running Unit Tests

	```bash

	# Run specific test files
	python -m unittest tests/test_chainlit.py -v
	```

	### RAGAS Evaluation

	Evaluate the RAG system performance:

	```bash
	# Run RAGAS evaluation
	python tests/test_ragas.py

	# Or use the Jupyter notebook
	jupyter notebook tests/test_ragas.ipynb
	```

	### Manual Testing

	Test individual components:

	```bash
	# Test vector search functionality
	python tests/test_vector_search.py

	```


	## 📄 License

	This project is licensed under the MIT license.