Swap Wan2.1 -> HunyuanVideo-1.5, 544x960 portrait 9:16, 49 frames @ 24fps b6e2fb3 Junchen Fu commited on Mar 9
Fix: use WanPipeline(Wan2.1-T2V-1.3B) directly, drop VACE GGUF to avoid layer mismatch 0432e61 Junchen Fu commited on Mar 9
Swap LTX-Video -> Wan2.1-VACE-1.3B GGUF for better video quality 454bf3e Junchen Fu commited on Mar 9
Increase LTX-Video steps 4->25 for better quality, extend GPU duration to 120s 4b8d70c Junchen Fu commited on Mar 9
Fix spaces import order, upgrade LLM to Qwen2.5-7B-Instruct on ZeroGPU 4aa806e Junchen Fu commited on Mar 9
Fix LLM: use local Qwen2.5-0.5B-Instruct via transformers pipeline, no external API 1e237c6 Junchen Fu commited on Mar 8
Fix LLM: switch to direct requests call on api-inference.huggingface.co, bypass router 223a7bd Junchen Fu commited on Mar 8
Fix: switch chat_completion to text_generation with Mistral [INST] template f003c98 Junchen Fu commited on Mar 8
Fix LLM: use Mistral-7B (available on hf-inference), drop response_format, add JSON fallback 328a82b Junchen Fu commited on Mar 8
Fix: use provider=hf-inference to bypass router, no third-party provider needed 0ca75f0 Junchen Fu commited on Mar 8
Switch LLM to Llama-3.2-3B-Instruct (available on HF serverless) 34aaf9a Junchen Fu commited on Mar 7
Swap to Qwen2.5-1.5B LLM and LTX-Video; output mp4; slim requirements ed307d2 Junchen Fu commited on Mar 7
Perf: background preload RAG, thread-safe lazy loading, declare models in README for HF caching baf02c1 Junchen Fu commited on Mar 7
Fix: lazy-load models and RAG dataset on first use to avoid init-phase network failures 8a7f9d9 Junchen Fu commited on Mar 7
Fix encoding: remove non-ASCII comments, ensure clean UTF-8 app.py with two tabs 711ed36 Junchen Fu commited on Mar 7