Spaces:

LejobuildYT
/

AI_Chatbot

Sleeping

App Files Files Community

AI_Chatbot / README.md

LejobuildYT

Update README.md

7bfb101 verified 19 days ago

preview code

raw

history blame contribute delete

6.35 kB

metadata

title: AI Chatbot Site
emoji: 🤖
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false

🤖 Zephyr-7B Chatbot für Hugging Face Spaces

Ein vollständiges React-Frontend mit Python-Backend für KI-Inferenz auf Hugging Face Spaces. Mit integriertem Plugin-System für einfache Erweiterbarkeit.

🎯 Features

Zephyr-7B-Beta Modell - Schnell, optimiert, auf VRAM-Limited Spaces
React + Vite Frontend - Modernes, responsives UI
Gradio Backend - Einfaches Deployment auf HF Spaces
Plugin-System - Einfach neue JS-Dateien in /plugins packen und sie werden automatisch geladen
Live Streaming - Token-by-Token Responses (optional ausbaubar)
Mehrsprachig - DE/EN ready

🚀 Quick Start

Lokal testen

Requirements installieren:

pip install -r requirements.txt

Backend starten:

python app.py

Das Backend läuft dann auf http://localhost:7860

React Frontend (neuer Terminal):

npm install
npm run dev

Frontend läuft auf http://localhost:5173

📦 Build für HF Spaces

React Build:

npm run build

Erzeugt dist/ Ordner

Hochladen zu HF Spaces:

# Space erstellen auf huggingface.co/spaces
# Dann:
git clone https://huggingface.co/spaces/USERNAME/SPACE-NAME
cd SPACE-NAME

# Dateien kopieren:
cp app.py .
cp requirements.txt .
cp -r plugins/ .
cp -r dist/ .  # (optional, wenn du static files servieren willst)

Commit & Push:

git add .
git commit -m "Initial commit"
git push

Die Spaces wird automatisch deployt!

📂 Projektstruktur

AI_Chatbot_Site/
├── app.py                    # Gradio Backend (HF Spaces Einstiegspunkt)
├── requirements.txt          # Python Dependencies
├── package.json              # Node Dependencies
├── vite.config.js           # Vite Konfiguration
│
├── src/
│   ├── main.jsx             # React Entry Point
│   ├── App.jsx              # Main App Component
│   ├── App.css              # App Styles
│   ├── index.css            # Global Styles
│   │
│   ├── components/
│   │   ├── ChatBox.jsx      # Chat UI Component
│   │   ├── ChatBox.css
│   │   ├── PluginLoader.jsx # Plugin System
│   │   └── plugins/         # Plugins Ordner
│   │
│   └── utils/
│       ├── api.js           # API Client
│       └── pluginManager.js # Plugin Management
│
├── public/
│   └── index.html
│
└── plugins/
    └── example-logger.js    # Example Plugin

🔌 Plugin-System

Ein neues Plugin erstellen

Erstelle eine neue Datei in plugins/myplugin.js:

// plugins/myplugin.js

export function onPluginInit(context) {
  context.log('My plugin initialized!')
}

export function onMessageSent(context, { message, systemPrompt }) {
  context.log(`User sent: ${message}`)
  
  // Hier kannst du:
  // - Analytics tracken
  // - Daten speichern
  // - API calls machen
  // - Etc.
}

export function onResponseReceived(context, { content, stats }) {
  context.log(`Response: ${content.substring(0, 50)}...`)
  context.log(`Stats: ${JSON.stringify(stats)}`)
}

Das Plugin wird automatisch geladen wenn die App startet!

Plugin Hooks

Hook	Wann	Parameter
`onPluginInit`	Plugin wird geladen	`context`
`onMessageSent`	Benutzer sendet Nachricht	`context, { message, systemPrompt }`
`onResponseReceived`	Response vom Model	`context, { content, stats }`

Plugin Context API

context.log(msg)    // Konsolenlog mit Plugin-Name
context.warn(msg)   // Warning
context.error(msg)  // Error

⚙️ Konfiguration

Backend (app.py)

MODEL_NAME = "HuggingFaceH4/zephyr-7b-beta"  # Modell wechseln
MAX_TOKENS = 512                             # Max Output länge
TEMPERATURE = 0.7                            # Kreativität (0-2)
TOP_P = 0.9                                  # Nucleus sampling

Frontend Einstellungen

Im UI kannst du direkt:

System Prompt anpassen
Temperatur & Top-P live ändern
Antwortzeit und Token-Count sehen

📊 Performance Optimierungen

Für Hugging Face Spaces (limitierte VRAM):

Model Quantization (optional):

from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(
    MODEL_NAME,
    load_in_8bit=True,  # 8-bit quantization
    device_map="auto",
)

Kleineres Modell:

MODEL_NAME = "HuggingFaceH4/zephyr-7b-alpha"  # Schneller
# oder
MODEL_NAME = "mistralai/Mistral-7B-v0.1"     # Alternative

Caching:

# Model wird gecacht nach erstem Load

🎨 Styling

CSS Variables in src/index.css für einfache Anpassung
Responsive Design (Mobile, Tablet, Desktop)
Dunkel/Hell Mode Support (buildbar)

🔐 Security

Backend validiert alle Inputs
CORS ist für HF Spaces automatisch configured
Sensitive data (API Keys) sollten in .env gehen (nicht committed)

🐛 Troubleshooting

"Module not found" bei Plugins

Stelle sicher:

Datei ist im /plugins Ordner
Exports sind korrekt: export function onPluginInit(context) { }

Backend startet nicht

# Teste wenn Python funktioniert:
python -c "import torch; print(torch.cuda.is_available())"

# Stelle sicher transformers installiert:
pip install -r requirements.txt --upgrade

React startet nicht

rm -rf node_modules package-lock.json
npm install
npm run dev

📝 Lizenz

MIT - Du kannst das frei nutzen, modifizieren und deployen!

🤝 Contributing

Feel free to add more plugins, components, or optimizations!

🎓 Nächste Schritte

Streaming Responses (token-by-token)
Dark Mode Toggle
Chat History speichern
Export Chats als PDF
Multi-Model Support (einfach zu Model-Select erweitern)
Voice Input (Whisper Integration)
Analytics Dashboard

Happy Coding! 🚀