Commit History

fix: drop provider='hf-inference' (auto-route works for 72B)
2d57fba
Running
verified

2045max commited on

fix: bump sdk_version to 5.49.1 to match requirements
ede5b89
verified

2045max commited on

fix: gradio>=5.49.1 + huggingface_hub<1.0 (HfFolder compat)
6d845eb
verified

2045max commited on

fix: upgrade huggingface_hub + explicit hf-inference provider
d999892
verified

2045max commited on

fix: switch to Qwen2.5-72B-Instruct (7B not available on free inference API)
9eff4e9
verified

2045max commited on

fix: LLM timeout + no-token fallback
e9be0e7
verified

2045max commited on

fix: pin gradio 5.0.0 + hf_hub 0.25.2
df5aac5
verified

2045max commited on

fix: pin python 3.11, gradio 5.x
bc49f1c
verified

2045max commited on

Step 5: RAG with Qwen2.5
5bad468
verified

2045max commited on

Upgrade Gradio to 5.x for huggingface_hub 1.x compat
20c462d
verified

2045max commited on

Update
34518b7
verified

2045max commited on

Initial deploy: FAQ bot with bge-small-zh
71245aa
verified

2045max commited on

initial commit
1093193
verified

2045max commited on