Codey-Bryant / README.md
pasxalisag's picture
Upload 6 files
26a7ce0 verified
---
title: Codey Bryant 3.0
emoji: πŸ€–
colorFrom: blue
colorTo: green
sdk: docker
app_file: app_hf.py
pinned: true
license: mit
---
# πŸ€– Codey Bryant 3.0 - SOTA RAG Coding Assistant
## Advanced RAG Architecture
**Codey Bryant 3.0** implements state-of-the-art Retrieval-Augmented Generation (RAG) with four key innovations:
### 1. **HyDE (Hypothetical Document Embeddings)**
Generates hypothetical answers to improve retrieval relevance for vague queries.
### 2. **Query Rewriting**
Transforms casual/vague questions into specific, searchable queries.
### 3. **Multi-Query Retrieval**
Searches multiple query variations to increase recall.
### 4. **Answer-Space Retrieval**
Retrieves from both question AND answer embeddings for better context.
## Technical Stack
- **LLM**: TinyLlama 1.1B (4-bit quantized)
- **Embeddings**: all-MiniLM-L6-v2
- **Retrieval**: FAISS + BM25 hybrid
- **Datasets**: OPC Educational + Evol-Instruct
- **Framework**: Gradio + Hugging Face Transformers
## Performance Features
- Handles vague queries like "it's not working"
- Streaming responses
- Context-aware generation
- Hybrid dense-sparse retrieval
- Persistent artifact storage
## Getting Started
1. Click **"Initialize Assistant"** (required once)
2. Ask Python coding questions
3. Get intelligent, context-aware responses
## Example Queries
- "How to read a CSV file in Python?"
- "Why am I getting 'list index out of range'?"
- "Make this function faster..."
- "Help, my code isn't working!"
- "Best way to sort a dictionary by value?"
## Why This Architecture?
1. **HyDE**: Addresses the "semantic gap" between queries and documents
2. **Query Rewriting**: Improves retrieval for conversational queries
3. **Multi-Query**: Increases recall for complex questions
4. **Answer-Space**: Provides better context for generation
## πŸ“ Repository Structure