Pass HF_TOKEN to embedding model to avoid 429 rate limits 93c3adb vaishnav Claude Opus 4.6 (1M context) commited on Apr 9
Stream LLM tokens in real-time, trim history, and reduce retriever k 7abe457 vaishnav Claude Opus 4.6 (1M context) commited on Apr 9
Restore full URL list in configs/config.py 1f71c2d vaishnav Claude Opus 4.6 (1M context) commited on Apr 8
Raise HF max_new_tokens and load .env from any cwd 51f4eed vaishnav Claude Opus 4.6 (1M context) commited on Apr 8
Switch HF default to openai/gpt-oss-120b with auto provider routing 1c2dc4f vaishnav Claude Opus 4.6 (1M context) commited on Apr 8
Unpin gradio to match HF Spaces base image d19ec61 vaishnav Claude Opus 4.6 (1M context) commited on Apr 7
Fix HuggingFace provider crash and harden chain init 332bfab vaishnav Claude Opus 4.6 (1M context) commited on Apr 7
Merge branch 'main' of hf.co:spaces/vaishnaveswar/AIVIZ-BOT 59bfa04 vaishnav commited on Feb 25, 2025