fix: remove Ollama from Dockerfile, use Groq API instead 4044503 Running therandomuser03 commited on 6 days ago
feat: replace Ollama with Groq API (llama-3.3-70b-versatile) befb434 therandomuser03 commited on 6 days ago
perf: reduce num_ctx 8192→2048 for faster CPU inference on t3.large-HF 5bcc538 therandomuser03 commited on 6 days ago