Spaces:

msradam
/

amanat

Paused

App Files Files Community

amanat

Commit History

Full CIBA debug output, delete starter, sanitized binding message

e18f030

seriffic commited on Apr 7

Demo mode remediation returns success for CIBA flow

1c29ccb

seriffic commited on Apr 7

Add Outlook demo emails

ea1b5bb

seriffic commited on Apr 7

Replace What is Amanat starter with Outlook scan

184e41c

seriffic commited on Apr 7

Add Remediate profile with note that destructive actions blocked in demo

6a4d8ab

seriffic commited on Apr 7

Expand chat profile description with full demo context

56ae369

seriffic commited on Apr 7

Expand chainlit welcome: full demo context, repo link, watsonx explanation

c4d2559

seriffic commited on Apr 7

Richer profile description, Kokota attribution

2c75464

seriffic commited on Apr 7

Remove welcome message, use chainlit.md for starters page with full context

f015731

seriffic commited on Apr 7

Show welcome message in chat for demo mode

d5da69d

seriffic commited on Apr 6

Add full demo description to HF Space page

974d63e

seriffic commited on Apr 6

Add auditable data link and watsonx context to welcome

2cc8bbd

seriffic commited on Apr 6

CIBA step-up auth for destructive actions

93da010

seriffic commited on Apr 6

Add model/timing footer to responses, explain demo context in welcome

8be2fbd

seriffic commited on Apr 6

Fix: parse tool_args from string when watsonx returns unparsed JSON

f369779

seriffic commited on Apr 6

Show full traceback

74d1385

seriffic commited on Apr 6

Fresh IAM token per request, fix token expiry causing str.get error

eca5b20

seriffic commited on Apr 6

Pin strands==1.34.1, use python:3.13-slim to match local

149456a

seriffic commited on Apr 6

Show traceback in error message for debugging

10730f3

seriffic commited on Apr 6

Fix after_tool result extraction: handle str results from watsonx

22c69ba

seriffic commited on Apr 6

Fix message extraction: handle str/dict/list response formats from watsonx

13c1cbb

seriffic commited on Apr 6

Switch to watsonx backend: Granite 4 H-Small on GPU, no local model needed

e4400f9

seriffic commited on Apr 6

Revert to working python:3.12-slim + ollama CPU build

589d4f5

seriffic commited on Apr 6

Use system python3 from ollama base, no version pinning

6858d68

seriffic commited on Apr 6

Use ollama base image directly with Python installed on top for GPU

7c77def

seriffic commited on Apr 6

Minimal Dockerfile: python:3.12-slim + ollama binary, cpu-upgrade

51bcaf8

seriffic commited on Apr 6

python:3.12-slim + CUDA runtime libs via apt (no deadsnakes, no nvidia base)

152cf3a

seriffic commited on Apr 6

Fix: ubuntu22.04 not 24.04

0b07d25

seriffic commited on Apr 6

Base on nvidia/cuda runtime for actual GPU access in ollama

844cd79

seriffic commited on Apr 6

Force NVIDIA_VISIBLE_DEVICES=all in entrypoint to override HF void setting

10e10dc

seriffic commited on Apr 6

Add GPU diagnostics to startup logs

cfdd971

seriffic commited on Apr 5

Copy ollama CUDA runners for GPU acceleration on T4

92d425d

seriffic commited on Apr 5

Copy ollama binary into python:3.13-slim — simplest approach

c526ed4

seriffic commited on Apr 5

Multi-stage: Python 3.13 deps + Ollama CUDA runtime

b884fe8

seriffic commited on Apr 5

Pull model at runtime instead of build time

caa4c49

seriffic commited on Apr 5

Fix: wait for Ollama to start before pulling model

55fdd33

seriffic commited on Apr 5

Switch to Ollama with GPU: pre-pulled granite4:micro, T4 tier

516dc6e

seriffic commited on Apr 5

Revert to CPU build on cpu-upgrade — CUDA build stuck

5cf08cf

seriffic commited on Apr 5

Use pre-built CUDA binary instead of compiling from source

737ad6e

seriffic commited on Apr 5

CUDA build + GPU offload for T4 tier

6d048fd

seriffic commited on Apr 5

Disable GGML_NATIVE to avoid illegal instruction on different CPU

132f905

seriffic commited on Apr 5

Increase ctx-size to 4096 — tool defs need 1500 tokens

66faa89

seriffic commited on Apr 5

Revert to granite-4.0-micro (dense) — hybrid arch not supported in llama.cpp b5580

438c7e3

seriffic commited on Apr 5

Switch to granite-4.0-h-micro (3B hybrid) — h-small OOM on cpu-basic

13bdbb2

seriffic commited on Apr 5

Upgrade to Granite 4 H-Small (8B) for better tool calling

3c513ac

seriffic commited on Apr 5

Force agent to call tools immediately

062bace

seriffic commited on Apr 5

Add --jinja flag for tool calling support

930c813

seriffic commited on Apr 5

Deploy Amanat: Granite 4 Micro + llama-server + Chainlit + Auth0

efd8492

seriffic commited on Apr 5

initial commit

784fbd1
verified

msradam commited on Apr 5

Commit History

Full CIBA debug output, delete starter, sanitized binding message e18f030

Demo mode remediation returns success for CIBA flow 1c29ccb

Add Outlook demo emails ea1b5bb

Replace What is Amanat starter with Outlook scan 184e41c

Add Remediate profile with note that destructive actions blocked in demo 6a4d8ab

Expand chat profile description with full demo context 56ae369

Expand chainlit welcome: full demo context, repo link, watsonx explanation c4d2559

Richer profile description, Kokota attribution 2c75464

Remove welcome message, use chainlit.md for starters page with full context f015731

Show welcome message in chat for demo mode d5da69d

Add full demo description to HF Space page 974d63e

Add auditable data link and watsonx context to welcome 2cc8bbd

CIBA step-up auth for destructive actions 93da010

Add model/timing footer to responses, explain demo context in welcome 8be2fbd

Fix: parse tool_args from string when watsonx returns unparsed JSON f369779

Show full traceback 74d1385

Fresh IAM token per request, fix token expiry causing str.get error eca5b20

Pin strands==1.34.1, use python:3.13-slim to match local 149456a

Show traceback in error message for debugging 10730f3

Fix after_tool result extraction: handle str results from watsonx 22c69ba

Fix message extraction: handle str/dict/list response formats from watsonx 13c1cbb

Switch to watsonx backend: Granite 4 H-Small on GPU, no local model needed e4400f9

Revert to working python:3.12-slim + ollama CPU build 589d4f5

Use system python3 from ollama base, no version pinning 6858d68

Use ollama base image directly with Python installed on top for GPU 7c77def

Minimal Dockerfile: python:3.12-slim + ollama binary, cpu-upgrade 51bcaf8

python:3.12-slim + CUDA runtime libs via apt (no deadsnakes, no nvidia base) 152cf3a

Fix: ubuntu22.04 not 24.04 0b07d25

Base on nvidia/cuda runtime for actual GPU access in ollama 844cd79

Force NVIDIA_VISIBLE_DEVICES=all in entrypoint to override HF void setting 10e10dc

Add GPU diagnostics to startup logs cfdd971

Copy ollama CUDA runners for GPU acceleration on T4 92d425d

Copy ollama binary into python:3.13-slim — simplest approach c526ed4

Multi-stage: Python 3.13 deps + Ollama CUDA runtime b884fe8

Pull model at runtime instead of build time caa4c49

Fix: wait for Ollama to start before pulling model 55fdd33

Switch to Ollama with GPU: pre-pulled granite4:micro, T4 tier 516dc6e

Revert to CPU build on cpu-upgrade — CUDA build stuck 5cf08cf

Use pre-built CUDA binary instead of compiling from source 737ad6e

CUDA build + GPU offload for T4 tier 6d048fd

Disable GGML_NATIVE to avoid illegal instruction on different CPU 132f905

Increase ctx-size to 4096 — tool defs need 1500 tokens 66faa89

Revert to granite-4.0-micro (dense) — hybrid arch not supported in llama.cpp b5580 438c7e3

Switch to granite-4.0-h-micro (3B hybrid) — h-small OOM on cpu-basic 13bdbb2

Upgrade to Granite 4 H-Small (8B) for better tool calling 3c513ac

Force agent to call tools immediately 062bace

Add --jinja flag for tool calling support 930c813

Deploy Amanat: Granite 4 Micro + llama-server + Chainlit + Auth0 efd8492

initial commit 784fbd1 verified

Full CIBA debug output, delete starter, sanitized binding message

e18f030

Demo mode remediation returns success for CIBA flow

1c29ccb

Add Outlook demo emails

ea1b5bb

Replace What is Amanat starter with Outlook scan

184e41c

Add Remediate profile with note that destructive actions blocked in demo

6a4d8ab

Expand chat profile description with full demo context

56ae369

Expand chainlit welcome: full demo context, repo link, watsonx explanation

c4d2559

Richer profile description, Kokota attribution

2c75464

Remove welcome message, use chainlit.md for starters page with full context

f015731

Show welcome message in chat for demo mode

d5da69d

Add full demo description to HF Space page

974d63e

Add auditable data link and watsonx context to welcome

2cc8bbd

CIBA step-up auth for destructive actions

93da010

Add model/timing footer to responses, explain demo context in welcome

8be2fbd

Fix: parse tool_args from string when watsonx returns unparsed JSON

f369779

Show full traceback

74d1385

Fresh IAM token per request, fix token expiry causing str.get error

eca5b20

Pin strands==1.34.1, use python:3.13-slim to match local

149456a

Show traceback in error message for debugging

10730f3

Fix after_tool result extraction: handle str results from watsonx

22c69ba

Fix message extraction: handle str/dict/list response formats from watsonx

13c1cbb

Switch to watsonx backend: Granite 4 H-Small on GPU, no local model needed

e4400f9

Revert to working python:3.12-slim + ollama CPU build

589d4f5

Use system python3 from ollama base, no version pinning

6858d68

Use ollama base image directly with Python installed on top for GPU

7c77def

Minimal Dockerfile: python:3.12-slim + ollama binary, cpu-upgrade

51bcaf8

python:3.12-slim + CUDA runtime libs via apt (no deadsnakes, no nvidia base)

152cf3a

Fix: ubuntu22.04 not 24.04

0b07d25

Base on nvidia/cuda runtime for actual GPU access in ollama

844cd79

Force NVIDIA_VISIBLE_DEVICES=all in entrypoint to override HF void setting

10e10dc

Add GPU diagnostics to startup logs

cfdd971

Copy ollama CUDA runners for GPU acceleration on T4

92d425d

Copy ollama binary into python:3.13-slim — simplest approach

c526ed4

Multi-stage: Python 3.13 deps + Ollama CUDA runtime

b884fe8

Pull model at runtime instead of build time

caa4c49

Fix: wait for Ollama to start before pulling model

55fdd33

Switch to Ollama with GPU: pre-pulled granite4:micro, T4 tier

516dc6e

Revert to CPU build on cpu-upgrade — CUDA build stuck

5cf08cf

Use pre-built CUDA binary instead of compiling from source

737ad6e

CUDA build + GPU offload for T4 tier

6d048fd

Disable GGML_NATIVE to avoid illegal instruction on different CPU

132f905

Increase ctx-size to 4096 — tool defs need 1500 tokens

66faa89

Revert to granite-4.0-micro (dense) — hybrid arch not supported in llama.cpp b5580

438c7e3

Switch to granite-4.0-h-micro (3B hybrid) — h-small OOM on cpu-basic

13bdbb2

Upgrade to Granite 4 H-Small (8B) for better tool calling

3c513ac

Force agent to call tools immediately

062bace

Add --jinja flag for tool calling support

930c813

Deploy Amanat: Granite 4 Micro + llama-server + Chainlit + Auth0

efd8492

initial commit

784fbd1
verified