Spaces:
Running
Running
File size: 4,845 Bytes
55953aa 3381167 55953aa a465955 55953aa 3381167 55953aa 3381167 55953aa 3381167 55953aa 3381167 55953aa 3381167 55953aa a465955 55953aa 3381167 55953aa a465955 55953aa a465955 55953aa a465955 55953aa 3381167 55953aa 3381167 a465955 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 | ---
title: Kerdos AI — Custom LLM Chat (Demo)
emoji: 🤖
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: "6.6.0"
app_file: app.py
pinned: false
license: mit
tags:
- rag
- document-qa
- enterprise
- llama
- faiss
- gradio
- nlp
- question-answering
- kerdos
- private-llm
- demo
---
# 🤖 Kerdos AI — Custom LLM Chat & Document Q&A
> **A Demo Product by [Kerdos Infrasoft Private Limited](https://kerdos.in)**
> Upload your company documents. Ask questions. Get answers — strictly from your data.
> ⚠️ **This is a Demo Version.** Features, model selection, and customisation are limited.
> The full product will support **private, on-premise LLM deployments** tailored to your organisation.
---
## 🚀 About Kerdos Infrasoft
**Kerdos Infrasoft Private Limited** is an Indian technology startup incorporated in December 2023, headquartered in **Bengaluru, Karnataka, India** (CIN: U62099KA2023PTC182869). We specialise in:
- 🧠 Artificial Intelligence & Machine Learning
- 🔗 Blockchain Development
- 💻 Custom Software Development
- ☁️ Infrastructure & Digital Consulting
🌐 **Website**: [https://kerdos.in](https://kerdos.in)
📞 **Contact**: [https://kerdos.in/contact](https://kerdos.in/contact)
✉️ **Email**: [partnership@kerdos.in](mailto:partnership@kerdos.in)
---
## 💰 Seeking Investment & Partnerships
We are actively **seeking investment and strategic partnerships** to build the **fully customisable enterprise edition**, which will include:
- 🔒 **Private LLM Hosting** — on-premise or private-cloud deployments
- 🎛️ **Custom Model Fine-tuning** — trained on your domain data
- 🛡️ **Data Privacy Guarantees** — your data never leaves your environment
- 🏷️ **White-label Deployments** — fully branded for your organisation
**Interested?** Reach out at [partnership@kerdos.in](mailto:partnership@kerdos.in) or visit [kerdos.in/contact](https://kerdos.in/contact)
---
## ✨ Features (Demo)
| Feature | Details |
| ----------------------------- | ----------------------------------------------------------------- |
| 📄 **Multi-format ingestion** | PDF, DOCX, TXT, MD, CSV |
| 🧠 **Open-source LLM** | `meta-llama/Llama-3.1-8B-Instruct` via HF Inference API |
| 🔒 **Strictly grounded** | Answers only from your uploaded documents |
| 📦 **Multi-document** | Upload and query across multiple files simultaneously |
| 💬 **Multi-turn chat** | Maintains conversation context across questions |
| ⚡ **Fast** | CPU-friendly embeddings (`BAAI/bge-small-en-v1.5` + FAISS cosine) |
| 🔑 **Secure** | Files processed in-session only — never stored permanently |
---
## 🚀 How to Use
### On Hugging Face Spaces
1. Upload your documents (PDF, DOCX, TXT) using the left panel
2. Click **Index Documents**
3. Enter your [Hugging Face API token](https://huggingface.co/settings/tokens) _(Write access required for Llama 3)_
4. Ask questions in the chat!
### Self-Hosted / Local
```bash
git clone https://huggingface.co/spaces/kerdosdotio/Custom-LLM-Chat
cd Custom-LLM-Chat
pip install -r requirements.txt
HF_TOKEN=hf_your_token python app.py
```
---
## 🏗️ Architecture
```
User Uploads Files
↓
Document Parser (PDF / DOCX / TXT)
↓
Text Chunking (512 chars, 64 overlap)
↓
Embeddings (BAAI/bge-small-en-v1.5)
↓
FAISS Vector Index (cosine similarity, in-memory)
↓
User Question → Similarity Search → Top-K Chunks
↓
Llama 3.1 8B — answers ONLY from retrieved chunks
↓
Response + Source Citations
```
---
## 🔧 Tech Stack
- **UI**: [Gradio](https://gradio.app)
- **LLM**: `meta-llama/Llama-3.1-8B-Instruct`
- **Embeddings**: `BAAI/bge-small-en-v1.5` (cosine similarity via FAISS)
- **Vector Store**: [FAISS](https://github.com/facebookresearch/faiss)
- **Document Parsing**: PyMuPDF, python-docx
---
## 💼 Use Cases
- **Customer Support**: Index your product manuals, FAQs, and policies
- **HR & Legal**: Query employee handbooks, contracts, and compliance docs
- **Sales Enablement**: Search product specs, case studies, and pricing docs
- **IT Helpdesk**: Query runbooks, troubleshooting guides, and SOPs
---
## 🔐 Privacy
- Uploaded documents are **processed in-memory** and **not stored** after your session ends
- For persistent storage or on-premise deployment with full privacy, [contact us](https://kerdos.in/contact)
---
## 📄 License
MIT License — free for commercial and private use.
---
_© 2024–2026 Kerdos Infrasoft Private Limited | Bengaluru, Karnataka, India_
|