Switch to Mistral 7B Instruct - compatible and ready to fine-tune bcc0a91 verified EGYADMIN commited on 24 days ago
Switch to Qwen 2.5 32B Instruct - powerful and compatible model 7439474 verified EGYADMIN commited on 24 days ago
Switch back to original moonshotai/Kimi-K2-Instruct model 3e1fa45 verified EGYADMIN commited on 24 days ago
Switch to quantized model RedHatAI/Kimi-K2-Instruct-quantized.w4a16 1e6a29d verified EGYADMIN commited on 24 days ago
Switch to Hugging Face Inference API for efficient model access cfa3f95 verified EGYADMIN commited on 24 days ago
Optimize app.py with better memory management and simplified code 24a1793 verified EGYADMIN commited on 24 days ago
Remove model pre-download from Dockerfile - model will be downloaded at runtime 469912b verified EGYADMIN commited on 24 days ago
Update Dockerfile to pre-download model during build a0f6119 verified EGYADMIN commited on 24 days ago
Switch to Kimi-K2-Instruct model for better compatibility 7501b6e verified EGYADMIN commited on 24 days ago
Fix Gradio 6.x compatibility - remove show_copy_button and theme from Blocks 653809e verified EGYADMIN commited on 24 days ago
Update Dockerfile with optimized settings for Kimi-K2 ea506a5 verified EGYADMIN commited on 24 days ago
Add compressed-tensors and update dependencies for Kimi-K2 fac206b verified EGYADMIN commited on 24 days ago
Apply patch BEFORE transformers import to fix is_torch_fx_available error ef80b0e verified EGYADMIN commited on 24 days ago
Fix import error by patching is_torch_fx_available function b76db2a verified EGYADMIN commited on 24 days ago
Add application file to load Kimi-K2-Thinking model 9d2d217 verified EGYADMIN commited on 24 days ago