api / backend

Commit History

feat: add auto_complete parameter for token generation
bb689ce

gary-boon Claude Opus 4.5 commited on

fix: add QKV extraction support for Mistral/Devstral architecture
d1d37a8

gary-boon Claude Opus 4.5 commited on

feat: implement lazy-loading for attention matrices
929ba88

gary-boon Claude Opus 4.5 commited on

Add avg_entropy calculation for attention heads
66a46b6

gary-boon Claude Opus 4.5 commited on

Revert QKV visualization fixes - need better approach for data streaming
d0b7e29

gary-boon Claude Opus 4.5 commited on

Add safety checks for missing QKV keys
a79cb83

gary-boon Claude Opus 4.5 commited on

Limit QKV matrices to top 5 heads per layer to reduce response size
decb5ab

gary-boon Claude Opus 4.5 commited on

Fix QKV matrix extraction for Mistral/Devstral architecture
9056859

gary-boon Claude Opus 4.5 commited on

Fix QKV visualization for Mistral/Devstral architecture
4ec134b

gary-boon Claude Opus 4.5 commited on

Fix: Import time module at top level for SSE events
15a862b

gary-boon Claude Opus 4.5 commited on

Add SSE streaming endpoint for real-time analysis progress
172a186

gary-boon Claude Opus 4.5 commited on

feat: Include token metadata in analysis response
ee0f6c9

gary-boon Claude Opus 4.5 commited on

feat: Implement tier-based model filtering by device type
6bf9f5c

gary-boon Claude Opus 4.5 commited on

Fix: Add attn_implementation="eager" to model switch function
f94a7ae

gary-boon Claude Opus 4.5 commited on

Add tokenSections boundaries and update system prompt
c6f4cc5

gary-boon Claude Opus 4.5 commited on

Fix: Handle MistralCommonTokenizer pad_token setter
e20ccaf

gary-boon Claude Opus 4.5 commited on

Integrate mistral-common for correct Devstral tokenization
ed06dcb

gary-boon Claude Opus 4.5 commited on

Remove mistral_common to fix dependency conflict
3d9d9ee

gary-boon Claude Opus 4.5 commited on

Use mistral_common for proper Devstral prompt formatting
3e80769

gary-boon Claude Opus 4.5 commited on

Add system prompt support for instruction-tuned models
2860768

gary-boon Claude Opus 4.5 commited on

fix: Simpler prompt format and temperature=0 for Devstral
76020ee

gary-boon Claude Opus 4.5 commited on

fix: Sanitize JSON response for NaN/Inf float values
99f6209

gary-boon Claude Opus 4.5 commited on

fix: Check chat_template is set before using apply_chat_template
474927d

gary-boon Claude Opus 4.5 commited on

fix: Add chat template support for Devstral instruct model
8d85da8

gary-boon Claude Opus 4.5 commited on

fix: Convert bfloat16 to float32 for numpy compatibility
cb6f39c

gary-boon Claude Opus 4.5 commited on

fix: Use eager attention for output_attentions support
5333b21

gary-boon Claude Opus 4.5 commited on

Add vocabSize to modelInfo response
499afba

gary-boon Claude Opus 4.5 commited on

Add recommended_dtype to model configs
62525b2

gary-boon Claude Opus 4.5 commited on

Phase 2: Add Devstral backend support
9080f28

gary-boon Claude Opus 4.5 commited on

Add DEVICE env var to force CPU mode on DGX Spark
5f122aa

gary-boon Claude Opus 4.5 commited on

Make zarr/numcodecs imports optional for ARM64 compatibility
6435a75

gary-boon Claude Opus 4.5 commited on

Phase 1: DGX Spark infrastructure
a2bd186

gary-boon Claude Opus 4.5 commited on

Make QKV hook robust against shape mismatches
343dd57

gary-boon Claude commited on

Fix research attention endpoint model compatibility
f5ba954

gary-boon Claude commited on

Add research attention analysis endpoint with real CodeGen tokenization
8f63685

gary-boon Claude commited on

Add research attention analysis endpoints with Q/K/V extraction
37ed739

gary-boon Claude commited on

Fix ablation study for Code Llama compatibility
cd300ee

gary-boon Claude commited on

Fix model info endpoint for Code Llama compatibility
7dd568f

gary-boon Claude commited on

Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes
ed40a9a

gary-boon Claude commited on

Fix pyarrow compatibility issue with datasets library
1680fda

gary-boon Claude commited on

Fix syntax error in swe_bench_service.py
9dbec03

gary-boon Claude commited on

Remove all mock data from SWE-bench - real data only
c0d95bf

gary-boon Claude commited on

Add GitHub URLs and improve mock data for SWE-bench
22c69fa

gary-boon Claude commited on

Fix SWE-bench service to gracefully handle dataset loading failures
ae9e159

gary-boon Claude commited on

Fix SWE-bench service to return full problem statements
1d23728

gary-boon Claude commited on

Add SWE-bench integration and improve backend routing
4444ae2

gary-boon Claude commited on

Add layer_stride parameter for PromptDiff optimization
5aed1a9

gary-boon Claude commited on

Capture complete attention patterns after generation
992dc8c

gary-boon commited on

Fix: Use scaling approach instead of skipping layers
3c774b5

gary-boon Claude commited on

Fix: Refine layer hook output format handling
4b03268

gary-boon Claude commited on