Update system model to Gemma 3 1B Instruct and humanize responses 9eed65c khubchand commited on 16 days ago
feat: optimize response time by offloading translation to deep-translator and reducing ctx/retrieval k size 147efdf khubchand commited on 17 days ago
Optimize Hugging Face Space: add eager model loading, reduce max tokens, fix stop tokens, limit CPU threads 09bc714 khubchand commited on 17 days ago
feat: implement complete RAG-based AI engine with Ollama fallback and vector search support b597dd6 khubchand commited on 17 days ago