Custom-LLM-Chat / README.md
Bhaskar Ram
feat: apply all 15 upgrades — BGE embeddings, cosine FAISS, streaming LLM, tenacity retry, dotenv, Dockerfile, tests
a465955

A newer version of the Gradio SDK is available: 6.8.0

Upgrade
metadata
title: Kerdos AI  Custom LLM Chat (Demo)
emoji: 🤖
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 6.6.0
app_file: app.py
pinned: false
license: mit
tags:
  - rag
  - document-qa
  - enterprise
  - llama
  - faiss
  - gradio
  - nlp
  - question-answering
  - kerdos
  - private-llm
  - demo

🤖 Kerdos AI — Custom LLM Chat & Document Q&A

A Demo Product by Kerdos Infrasoft Private Limited Upload your company documents. Ask questions. Get answers — strictly from your data.

⚠️ This is a Demo Version. Features, model selection, and customisation are limited. The full product will support private, on-premise LLM deployments tailored to your organisation.


🚀 About Kerdos Infrasoft

Kerdos Infrasoft Private Limited is an Indian technology startup incorporated in December 2023, headquartered in Bengaluru, Karnataka, India (CIN: U62099KA2023PTC182869). We specialise in:

  • 🧠 Artificial Intelligence & Machine Learning
  • 🔗 Blockchain Development
  • 💻 Custom Software Development
  • ☁️ Infrastructure & Digital Consulting

🌐 Website: https://kerdos.in 📞 Contact: https://kerdos.in/contact ✉️ Email: partnership@kerdos.in


💰 Seeking Investment & Partnerships

We are actively seeking investment and strategic partnerships to build the fully customisable enterprise edition, which will include:

  • 🔒 Private LLM Hosting — on-premise or private-cloud deployments
  • 🎛️ Custom Model Fine-tuning — trained on your domain data
  • 🛡️ Data Privacy Guarantees — your data never leaves your environment
  • 🏷️ White-label Deployments — fully branded for your organisation

Interested? Reach out at partnership@kerdos.in or visit kerdos.in/contact


✨ Features (Demo)

Feature Details
📄 Multi-format ingestion PDF, DOCX, TXT, MD, CSV
🧠 Open-source LLM meta-llama/Llama-3.1-8B-Instruct via HF Inference API
🔒 Strictly grounded Answers only from your uploaded documents
📦 Multi-document Upload and query across multiple files simultaneously
💬 Multi-turn chat Maintains conversation context across questions
Fast CPU-friendly embeddings (BAAI/bge-small-en-v1.5 + FAISS cosine)
🔑 Secure Files processed in-session only — never stored permanently

🚀 How to Use

On Hugging Face Spaces

  1. Upload your documents (PDF, DOCX, TXT) using the left panel
  2. Click Index Documents
  3. Enter your Hugging Face API token (Write access required for Llama 3)
  4. Ask questions in the chat!

Self-Hosted / Local

git clone https://huggingface.co/spaces/kerdosdotio/Custom-LLM-Chat
cd Custom-LLM-Chat
pip install -r requirements.txt
HF_TOKEN=hf_your_token python app.py

🏗️ Architecture

User Uploads Files
      ↓
Document Parser (PDF / DOCX / TXT)
      ↓
Text Chunking (512 chars, 64 overlap)
      ↓
Embeddings (BAAI/bge-small-en-v1.5)
      ↓
FAISS Vector Index (cosine similarity, in-memory)
      ↓
User Question → Similarity Search → Top-K Chunks
      ↓
Llama 3.1 8B — answers ONLY from retrieved chunks
      ↓
Response + Source Citations

🔧 Tech Stack

  • UI: Gradio
  • LLM: meta-llama/Llama-3.1-8B-Instruct
  • Embeddings: BAAI/bge-small-en-v1.5 (cosine similarity via FAISS)
  • Vector Store: FAISS
  • Document Parsing: PyMuPDF, python-docx

💼 Use Cases

  • Customer Support: Index your product manuals, FAQs, and policies
  • HR & Legal: Query employee handbooks, contracts, and compliance docs
  • Sales Enablement: Search product specs, case studies, and pricing docs
  • IT Helpdesk: Query runbooks, troubleshooting guides, and SOPs

🔐 Privacy

  • Uploaded documents are processed in-memory and not stored after your session ends
  • For persistent storage or on-premise deployment with full privacy, contact us

📄 License

MIT License — free for commercial and private use.


© 2024–2026 Kerdos Infrasoft Private Limited | Bengaluru, Karnataka, India