Fix: Robust range-resumable downloaders and FastAPI-level stop filters to eliminate prompt template leakage e732501 Running AjinkyaPagare commited on about 10 hours ago
Fix: Optimize configs, relax system prompts, and improve StarCoder2 prompt template stopping 0d0f467 AjinkyaPagare commited on about 11 hours ago
Update display title in README.md frontmatter to show correct name dc72282 AjinkyaPagare commited on 1 day ago
feat: configure StarCoder2 3B model deployment configurations and ports 3fbe92b AjinkyaPagare commited on 1 day ago
fix: resolve javascript scoping exceptions, adjust button accents, and integrate marked.js with responsive layouts 7175181 AjinkyaPagare commited on 2 days ago
feat: Add premium chat features including interactive copy buttons, timestamps and full bubble copy controls 23495d7 AjinkyaPagare commited on 2 days ago
Fix UI to permanently dock input row at the bottom and grow scrollable chat list above it 12b638c AjinkyaPagare commited on 2 days ago
Perfect playground layout positioning welcome message inside chat box and input below it 4a07db6 AjinkyaPagare commited on 2 days ago
Enable detailed background server logs for startup debugging 9f91c07 AjinkyaPagare commited on 2 days ago
Reposition user input box exactly below the default LLM welcome message 898e3c1 AjinkyaPagare commited on 2 days ago
Upgrade backend to support high-performance scaling & concurrency caching f9538e1 AjinkyaPagare commited on 2 days ago
Position input row exact below LLM message using premium chat-messages flex design 0b79961 AjinkyaPagare commited on 2 days ago
Implement Stop button, AbortController, OpenAI-compatible chat backend, and zero-scrollbar layout 2417b82 AjinkyaPagare commited on 2 days ago
Enable real-time interactive chat streaming in dashboard UI 2f69243 AjinkyaPagare commited on 2 days ago
Fix binary detection and setup logic to support cross-platform Linux/Docker environments 045ac3e AjinkyaPagare commited on 2 days ago
perf: limit LLAMA_THREADS to 2 on Linux to prevent cgroup throttling, and override max_tokens < 100 to a minimum of 100 tokens 22de5fc AjinkyaPagare commited on 2 days ago
feat: increase context size to 8192, add robust input validations, and enable cross-platform telemetry matching both llama-server.exe and llama-server 301bafe AjinkyaPagare commited on 2 days ago
feat: add OpenAI compatibility proxy and beautiful glassmorphism dashboard UI ce1cb13 AjinkyaPagare commited on 2 days ago
Fix Dockerfile: copy all compiled libraries and define LD_LIBRARY_PATH 3888eab AjinkyaPagare commited on 2 days ago
Deploy production-grade ultra-fast Qwen 7B Space and GitLab repo 072e013 AjinkyaPagare commited on 2 days ago