File size: 4,845 Bytes
55953aa
3381167
 
55953aa
a465955
55953aa
 
 
 
 
 
 
 
 
 
 
 
 
 
3381167
 
 
55953aa
 
3381167
55953aa
3381167
 
55953aa
3381167
 
55953aa
 
 
3381167
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55953aa
a465955
 
 
 
 
 
 
 
 
55953aa
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3381167
55953aa
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a465955
55953aa
a465955
55953aa
 
 
 
 
 
 
 
 
 
 
 
 
 
a465955
55953aa
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3381167
55953aa
 
 
 
 
 
3381167
 
 
a465955
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
---
title: Kerdos AI  Custom LLM Chat (Demo)
emoji: 🤖
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: "6.6.0"
app_file: app.py
pinned: false
license: mit
tags:
  - rag
  - document-qa
  - enterprise
  - llama
  - faiss
  - gradio
  - nlp
  - question-answering
  - kerdos
  - private-llm
  - demo
---

# 🤖 Kerdos AI — Custom LLM Chat & Document Q&A

> **A Demo Product by [Kerdos Infrasoft Private Limited](https://kerdos.in)**
> Upload your company documents. Ask questions. Get answers — strictly from your data.

> ⚠️ **This is a Demo Version.** Features, model selection, and customisation are limited.
> The full product will support **private, on-premise LLM deployments** tailored to your organisation.

---

## 🚀 About Kerdos Infrasoft

**Kerdos Infrasoft Private Limited** is an Indian technology startup incorporated in December 2023, headquartered in **Bengaluru, Karnataka, India** (CIN: U62099KA2023PTC182869). We specialise in:

- 🧠 Artificial Intelligence & Machine Learning
- 🔗 Blockchain Development
- 💻 Custom Software Development
- ☁️ Infrastructure & Digital Consulting

🌐 **Website**: [https://kerdos.in](https://kerdos.in)
📞 **Contact**: [https://kerdos.in/contact](https://kerdos.in/contact)
✉️ **Email**: [partnership@kerdos.in](mailto:partnership@kerdos.in)

---

## 💰 Seeking Investment & Partnerships

We are actively **seeking investment and strategic partnerships** to build the **fully customisable enterprise edition**, which will include:

- 🔒 **Private LLM Hosting** — on-premise or private-cloud deployments
- 🎛️ **Custom Model Fine-tuning** — trained on your domain data
- 🛡️ **Data Privacy Guarantees** — your data never leaves your environment
- 🏷️ **White-label Deployments** — fully branded for your organisation

**Interested?** Reach out at [partnership@kerdos.in](mailto:partnership@kerdos.in) or visit [kerdos.in/contact](https://kerdos.in/contact)

---

## ✨ Features (Demo)

| Feature                       | Details                                                           |
| ----------------------------- | ----------------------------------------------------------------- |
| 📄 **Multi-format ingestion** | PDF, DOCX, TXT, MD, CSV                                           |
| 🧠 **Open-source LLM**        | `meta-llama/Llama-3.1-8B-Instruct` via HF Inference API           |
| 🔒 **Strictly grounded**      | Answers only from your uploaded documents                         |
| 📦 **Multi-document**         | Upload and query across multiple files simultaneously             |
| 💬 **Multi-turn chat**        | Maintains conversation context across questions                   |
| ⚡ **Fast**                   | CPU-friendly embeddings (`BAAI/bge-small-en-v1.5` + FAISS cosine) |
| 🔑 **Secure**                 | Files processed in-session only — never stored permanently        |

---

## 🚀 How to Use

### On Hugging Face Spaces

1. Upload your documents (PDF, DOCX, TXT) using the left panel
2. Click **Index Documents**
3. Enter your [Hugging Face API token](https://huggingface.co/settings/tokens) _(Write access required for Llama 3)_
4. Ask questions in the chat!

### Self-Hosted / Local

```bash
git clone https://huggingface.co/spaces/kerdosdotio/Custom-LLM-Chat
cd Custom-LLM-Chat
pip install -r requirements.txt
HF_TOKEN=hf_your_token python app.py
```

---

## 🏗️ Architecture

```
User Uploads Files

Document Parser (PDF / DOCX / TXT)

Text Chunking (512 chars, 64 overlap)

Embeddings (BAAI/bge-small-en-v1.5)

FAISS Vector Index (cosine similarity, in-memory)

User Question → Similarity Search → Top-K Chunks

Llama 3.1 8B — answers ONLY from retrieved chunks

Response + Source Citations
```

---

## 🔧 Tech Stack

- **UI**: [Gradio](https://gradio.app)
- **LLM**: `meta-llama/Llama-3.1-8B-Instruct`
- **Embeddings**: `BAAI/bge-small-en-v1.5` (cosine similarity via FAISS)
- **Vector Store**: [FAISS](https://github.com/facebookresearch/faiss)
- **Document Parsing**: PyMuPDF, python-docx

---

## 💼 Use Cases

- **Customer Support**: Index your product manuals, FAQs, and policies
- **HR & Legal**: Query employee handbooks, contracts, and compliance docs
- **Sales Enablement**: Search product specs, case studies, and pricing docs
- **IT Helpdesk**: Query runbooks, troubleshooting guides, and SOPs

---

## 🔐 Privacy

- Uploaded documents are **processed in-memory** and **not stored** after your session ends
- For persistent storage or on-premise deployment with full privacy, [contact us](https://kerdos.in/contact)

---

## 📄 License

MIT License — free for commercial and private use.

---

_© 2024–2026 Kerdos Infrasoft Private Limited | Bengaluru, Karnataka, India_