Optimize for Hugging Face Inference API with streaming support and RAG integration 03da349 rdune71 commited on Sep 4, 2025