Update app.py to use pipeline API for Inferless compatibility 93afb6b verified tigres2526 commited on Aug 9, 2025
Remove false quantization_config - model weights are not quantized 2c6d9cb verified tigres2526 commited on Aug 8, 2025
Fix imports to use absolute paths for standalone loading 4373c62 verified tigres2526 commited on Aug 8, 2025
Fix imports to use absolute paths for standalone loading a0b2e27 verified tigres2526 commited on Aug 8, 2025
Add auto_map to config for custom MoE architecture loading 86efc4b verified tigres2526 commited on Aug 8, 2025
Add modeling_gpt_oss.py for MoE architecture support 456e8a0 verified tigres2526 commited on Aug 8, 2025
Remove emojis to fix Inferless Unicode encoding error cb71c54 verified tigres2526 commited on Aug 7, 2025
Upload INFERLESS_DEPLOYMENT.md with huggingface_hub 4a2c5df verified tigres2526 commited on Aug 7, 2025
Upload inferless_config.yaml with huggingface_hub bbd369a verified tigres2526 commited on Aug 7, 2025
Upload requirements_inferless.txt with huggingface_hub 0b5a909 verified tigres2526 commited on Aug 7, 2025
Upload app_inferless_quantized.py with huggingface_hub 4aaafa4 verified tigres2526 commited on Aug 7, 2025
Upload cai_20b_utils-1.0.0.tar.gz with huggingface_hub 5c459c8 verified tigres2526 commited on Aug 7, 2025
Upload cai_20b_utils-1.0.0-py3-none-any.whl with huggingface_hub 6d0a2e2 verified tigres2526 commited on Aug 7, 2025
Update model card with production cleanup code and utils package f621767 verified tigres2526 commited on Aug 7, 2025