Pin MODEL_REVISION to specific commit hash for CDN cache busting d6129a4 mshz88 commited on 28 days ago
Rebuild with transformers.js v4.2.0 and properly exported ONNX model 3388c9d verified mshz88 commited on 28 days ago
Fix WebGPU hang: use q4 decoder (now int32-patched) for all backends 7733924 verified mshz88 commited on 28 days ago
Update MODEL_REVISION to main to include WebGPU-compatible model files f6015de verified mshz88 commited on 28 days ago
Add fp16→fp32 fallback for vision encoder WebGPU compatibility 01416c0 verified mshz88 commited on 28 days ago
Update MODEL_REVISION to latest decoder rebuild (asymmetric q4, no accuracy_level) ee395af verified mshz88 commited on May 5
Switch back to our model with fixed GatherBlockQuantized embed_tokens 3b1923d verified mshz88 commited on May 5
TEST: Use onnx-community model to verify infrastructure works 3693d17 verified mshz88 commited on May 5
Fix: enable chat input after model load, add WebGPU detection + inference logging 8f06f4b verified mshz88 commited on May 4
Pin model revision to bypass cache (fix duplicate node error) f9f64bf verified mshz88 commited on May 4