ai-assistant-engine / llm /model_loader.py

Commit History

Update system model to Gemma 3 1B Instruct and humanize responses
9eed65c

khubchand commited on

feat: optimize response time by offloading translation to deep-translator and reducing ctx/retrieval k size
147efdf

khubchand commited on

Optimize Hugging Face Space: add eager model loading, reduce max tokens, fix stop tokens, limit CPU threads
09bc714

khubchand commited on

feat: implement complete RAG-based AI engine with Ollama fallback and vector search support
b597dd6

khubchand commited on

Initial clean release
0a96660

khubchand commited on