Commit History

feat: replace Ollama with Groq API (llama-3.3-70b-versatile)
befb434

therandomuser03 commited on

perf: reduce num_ctx 8192→2048 for faster CPU inference on t3.large-HF
5bcc538

therandomuser03 commited on

Update app/main.py
1f89c0f
verified

therandomuser03 commited on

Initial clean commit with LFS models
0c46c35

therandomuser03 commited on