Add Remediate profile with note that destructive actions blocked in demo 6a4d8ab seriffic commited on Apr 7
Expand chainlit welcome: full demo context, repo link, watsonx explanation c4d2559 seriffic commited on Apr 7
Remove welcome message, use chainlit.md for starters page with full context f015731 seriffic commited on Apr 7
Add model/timing footer to responses, explain demo context in welcome 8be2fbd seriffic commited on Apr 6
Fix: parse tool_args from string when watsonx returns unparsed JSON f369779 seriffic commited on Apr 6
Fresh IAM token per request, fix token expiry causing str.get error eca5b20 seriffic commited on Apr 6
Fix message extraction: handle str/dict/list response formats from watsonx 13c1cbb seriffic commited on Apr 6
Switch to watsonx backend: Granite 4 H-Small on GPU, no local model needed e4400f9 seriffic commited on Apr 6
Use ollama base image directly with Python installed on top for GPU 7c77def seriffic commited on Apr 6
python:3.12-slim + CUDA runtime libs via apt (no deadsnakes, no nvidia base) 152cf3a seriffic commited on Apr 6
Force NVIDIA_VISIBLE_DEVICES=all in entrypoint to override HF void setting 10e10dc seriffic commited on Apr 6
Revert to granite-4.0-micro (dense) β hybrid arch not supported in llama.cpp b5580 438c7e3 seriffic commited on Apr 5
Switch to granite-4.0-h-micro (3B hybrid) β h-small OOM on cpu-basic 13bdbb2 seriffic commited on Apr 5