Spaces:

visualisable-ai
/

api

Paused

App Files Files Community

api / backend

Commit History

feat: add auto_complete parameter for token generation

bb689ce

gary-boon Claude Opus 4.5 commited on Dec 24, 2025

fix: add QKV extraction support for Mistral/Devstral architecture

d1d37a8

gary-boon Claude Opus 4.5 commited on Dec 24, 2025

feat: implement lazy-loading for attention matrices

929ba88

gary-boon Claude Opus 4.5 commited on Dec 24, 2025

Add avg_entropy calculation for attention heads

66a46b6

gary-boon Claude Opus 4.5 commited on Dec 22, 2025

Revert QKV visualization fixes - need better approach for data streaming

d0b7e29

gary-boon Claude Opus 4.5 commited on Dec 17, 2025

Add safety checks for missing QKV keys

a79cb83

gary-boon Claude Opus 4.5 commited on Dec 17, 2025

Limit QKV matrices to top 5 heads per layer to reduce response size

decb5ab

gary-boon Claude Opus 4.5 commited on Dec 17, 2025

Fix QKV matrix extraction for Mistral/Devstral architecture

9056859

gary-boon Claude Opus 4.5 commited on Dec 17, 2025

Fix QKV visualization for Mistral/Devstral architecture

4ec134b

gary-boon Claude Opus 4.5 commited on Dec 17, 2025

Fix: Import time module at top level for SSE events

15a862b

gary-boon Claude Opus 4.5 commited on Dec 16, 2025

Add SSE streaming endpoint for real-time analysis progress

172a186

gary-boon Claude Opus 4.5 commited on Dec 16, 2025

feat: Include token metadata in analysis response

ee0f6c9

gary-boon Claude Opus 4.5 commited on Dec 16, 2025

feat: Implement tier-based model filtering by device type

6bf9f5c

gary-boon Claude Opus 4.5 commited on Dec 16, 2025

Fix: Add attn_implementation="eager" to model switch function

f94a7ae

gary-boon Claude Opus 4.5 commited on Dec 16, 2025

Add tokenSections boundaries and update system prompt

c6f4cc5

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

Fix: Handle MistralCommonTokenizer pad_token setter

e20ccaf

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

Integrate mistral-common for correct Devstral tokenization

ed06dcb

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

Remove mistral_common to fix dependency conflict

3d9d9ee

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

Use mistral_common for proper Devstral prompt formatting

3e80769

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

Add system prompt support for instruction-tuned models

2860768

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

fix: Simpler prompt format and temperature=0 for Devstral

76020ee

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

fix: Sanitize JSON response for NaN/Inf float values

99f6209

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

fix: Check chat_template is set before using apply_chat_template

474927d

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

fix: Add chat template support for Devstral instruct model

8d85da8

gary-boon Claude Opus 4.5 commited on Dec 15, 2025

fix: Convert bfloat16 to float32 for numpy compatibility

cb6f39c

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

fix: Use eager attention for output_attentions support

5333b21

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Add vocabSize to modelInfo response

499afba

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Add recommended_dtype to model configs

62525b2

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Phase 2: Add Devstral backend support

9080f28

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Add DEVICE env var to force CPU mode on DGX Spark

5f122aa

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Make zarr/numcodecs imports optional for ARM64 compatibility

6435a75

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Phase 1: DGX Spark infrastructure

a2bd186

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Make QKV hook robust against shape mismatches

343dd57

gary-boon Claude commited on Nov 18, 2025

Fix research attention endpoint model compatibility

f5ba954

gary-boon Claude commited on Nov 18, 2025

Add research attention analysis endpoint with real CodeGen tokenization

8f63685

gary-boon Claude commited on Nov 18, 2025

Add research attention analysis endpoints with Q/K/V extraction

37ed739

gary-boon Claude commited on Nov 13, 2025

Fix ablation study for Code Llama compatibility

cd300ee

gary-boon Claude commited on Oct 31, 2025

Fix model info endpoint for Code Llama compatibility

7dd568f

gary-boon Claude commited on Oct 31, 2025

Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes

ed40a9a

gary-boon Claude commited on Oct 30, 2025

Fix pyarrow compatibility issue with datasets library

1680fda

gary-boon Claude commited on Sep 16, 2025

Fix syntax error in swe_bench_service.py

9dbec03

gary-boon Claude commited on Sep 16, 2025

Remove all mock data from SWE-bench - real data only

c0d95bf

gary-boon Claude commited on Sep 16, 2025

Add GitHub URLs and improve mock data for SWE-bench

22c69fa

gary-boon Claude commited on Sep 16, 2025

Fix SWE-bench service to gracefully handle dataset loading failures

ae9e159

gary-boon Claude commited on Sep 16, 2025

Fix SWE-bench service to return full problem statements

1d23728

gary-boon Claude commited on Sep 16, 2025

Add SWE-bench integration and improve backend routing

4444ae2

gary-boon Claude commited on Sep 15, 2025

Add layer_stride parameter for PromptDiff optimization

5aed1a9

gary-boon Claude commited on Sep 12, 2025

Capture complete attention patterns after generation

992dc8c

gary-boon commited on Sep 11, 2025

Fix: Use scaling approach instead of skipping layers

3c774b5

gary-boon Claude commited on Sep 2, 2025

Fix: Refine layer hook output format handling

4b03268

gary-boon Claude commited on Sep 2, 2025