Spaces:

kerdosdotio
/

Custom-LLM-Chat

Running

File size: 4,845 Bytes

---
title: Kerdos AI — Custom LLM Chat (Demo)
emoji: 🤖
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: "6.6.0"
app_file: app.py
pinned: false
license: mit
tags:
  - rag
  - document-qa
  - enterprise
  - llama
  - faiss
  - gradio
  - nlp
  - question-answering
  - kerdos
  - private-llm
  - demo
---

# 🤖 Kerdos AI — Custom LLM Chat & Document Q&A

> **A Demo Product by [Kerdos Infrasoft Private Limited](https://kerdos.in)**
> Upload your company documents. Ask questions. Get answers — strictly from your data.

> ⚠️ **This is a Demo Version.** Features, model selection, and customisation are limited.
> The full product will support **private, on-premise LLM deployments** tailored to your organisation.

---

## 🚀 About Kerdos Infrasoft

**Kerdos Infrasoft Private Limited** is an Indian technology startup incorporated in December 2023, headquartered in **Bengaluru, Karnataka, India** (CIN: U62099KA2023PTC182869). We specialise in:

- 🧠 Artificial Intelligence & Machine Learning
- 🔗 Blockchain Development
- 💻 Custom Software Development
- ☁️ Infrastructure & Digital Consulting

🌐 **Website**: [https://kerdos.in](https://kerdos.in)
📞 **Contact**: [https://kerdos.in/contact](https://kerdos.in/contact)
✉️ **Email**: [partnership@kerdos.in](mailto:partnership@kerdos.in)

---

## 💰 Seeking Investment & Partnerships

We are actively **seeking investment and strategic partnerships** to build the **fully customisable enterprise edition**, which will include:

- 🔒 **Private LLM Hosting** — on-premise or private-cloud deployments
- 🎛️ **Custom Model Fine-tuning** — trained on your domain data
- 🛡️ **Data Privacy Guarantees** — your data never leaves your environment
- 🏷️ **White-label Deployments** — fully branded for your organisation

**Interested?** Reach out at [partnership@kerdos.in](mailto:partnership@kerdos.in) or visit [kerdos.in/contact](https://kerdos.in/contact)

---

## ✨ Features (Demo)

| Feature                       | Details                                                           |
| ----------------------------- | ----------------------------------------------------------------- |
| 📄 **Multi-format ingestion** | PDF, DOCX, TXT, MD, CSV                                           |
| 🧠 **Open-source LLM**        | `meta-llama/Llama-3.1-8B-Instruct` via HF Inference API           |
| 🔒 **Strictly grounded**      | Answers only from your uploaded documents                         |
| 📦 **Multi-document**         | Upload and query across multiple files simultaneously             |
| 💬 **Multi-turn chat**        | Maintains conversation context across questions                   |
| ⚡ **Fast**                   | CPU-friendly embeddings (`BAAI/bge-small-en-v1.5` + FAISS cosine) |
| 🔑 **Secure**                 | Files processed in-session only — never stored permanently        |

---

## 🚀 How to Use

### On Hugging Face Spaces

1. Upload your documents (PDF, DOCX, TXT) using the left panel
2. Click **Index Documents**
3. Enter your [Hugging Face API token](https://huggingface.co/settings/tokens) _(Write access required for Llama 3)_
4. Ask questions in the chat!

### Self-Hosted / Local

```bash
git clone https://huggingface.co/spaces/kerdosdotio/Custom-LLM-Chat
cd Custom-LLM-Chat
pip install -r requirements.txt
HF_TOKEN=hf_your_token python app.py
```

---

## 🏗️ Architecture

```
User Uploads Files
      ↓
Document Parser (PDF / DOCX / TXT)
      ↓
Text Chunking (512 chars, 64 overlap)
      ↓
Embeddings (BAAI/bge-small-en-v1.5)
      ↓
FAISS Vector Index (cosine similarity, in-memory)
      ↓
User Question → Similarity Search → Top-K Chunks
      ↓
Llama 3.1 8B — answers ONLY from retrieved chunks
      ↓
Response + Source Citations
```

---

## 🔧 Tech Stack

- **UI**: [Gradio](https://gradio.app)
- **LLM**: `meta-llama/Llama-3.1-8B-Instruct`
- **Embeddings**: `BAAI/bge-small-en-v1.5` (cosine similarity via FAISS)
- **Vector Store**: [FAISS](https://github.com/facebookresearch/faiss)
- **Document Parsing**: PyMuPDF, python-docx

---

## 💼 Use Cases

- **Customer Support**: Index your product manuals, FAQs, and policies
- **HR & Legal**: Query employee handbooks, contracts, and compliance docs
- **Sales Enablement**: Search product specs, case studies, and pricing docs
- **IT Helpdesk**: Query runbooks, troubleshooting guides, and SOPs

---

## 🔐 Privacy

- Uploaded documents are **processed in-memory** and **not stored** after your session ends
- For persistent storage or on-premise deployment with full privacy, [contact us](https://kerdos.in/contact)

---

## 📄 License

MIT License — free for commercial and private use.

---

_© 2024–2026 Kerdos Infrasoft Private Limited | Bengaluru, Karnataka, India_