feat: add auto_complete parameter for token generation bb689ce gary-boon Claude Opus 4.5 commited on Dec 24, 2025
fix: add QKV extraction support for Mistral/Devstral architecture d1d37a8 gary-boon Claude Opus 4.5 commited on Dec 24, 2025
feat: implement lazy-loading for attention matrices 929ba88 gary-boon Claude Opus 4.5 commited on Dec 24, 2025
Add avg_entropy calculation for attention heads 66a46b6 gary-boon Claude Opus 4.5 commited on Dec 22, 2025
Revert QKV visualization fixes - need better approach for data streaming d0b7e29 gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Limit QKV matrices to top 5 heads per layer to reduce response size decb5ab gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Fix QKV matrix extraction for Mistral/Devstral architecture 9056859 gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Fix QKV visualization for Mistral/Devstral architecture 4ec134b gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Fix: Import time module at top level for SSE events 15a862b gary-boon Claude Opus 4.5 commited on Dec 16, 2025
Add SSE streaming endpoint for real-time analysis progress 172a186 gary-boon Claude Opus 4.5 commited on Dec 16, 2025
feat: Include token metadata in analysis response ee0f6c9 gary-boon Claude Opus 4.5 commited on Dec 16, 2025
feat: Implement tier-based model filtering by device type 6bf9f5c gary-boon Claude Opus 4.5 commited on Dec 16, 2025
Fix: Add attn_implementation="eager" to model switch function f94a7ae gary-boon Claude Opus 4.5 commited on Dec 16, 2025
Add tokenSections boundaries and update system prompt c6f4cc5 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Fix: Handle MistralCommonTokenizer pad_token setter e20ccaf gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Integrate mistral-common for correct Devstral tokenization ed06dcb gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Remove mistral_common to fix dependency conflict 3d9d9ee gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Use mistral_common for proper Devstral prompt formatting 3e80769 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Add system prompt support for instruction-tuned models 2860768 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Simpler prompt format and temperature=0 for Devstral 76020ee gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Sanitize JSON response for NaN/Inf float values 99f6209 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Check chat_template is set before using apply_chat_template 474927d gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Add chat template support for Devstral instruct model 8d85da8 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Convert bfloat16 to float32 for numpy compatibility cb6f39c gary-boon Claude Opus 4.5 commited on Dec 14, 2025
fix: Use eager attention for output_attentions support 5333b21 gary-boon Claude Opus 4.5 commited on Dec 14, 2025
Add DEVICE env var to force CPU mode on DGX Spark 5f122aa gary-boon Claude Opus 4.5 commited on Dec 14, 2025
Make zarr/numcodecs imports optional for ARM64 compatibility 6435a75 gary-boon Claude Opus 4.5 commited on Dec 14, 2025
Fix research attention endpoint model compatibility f5ba954 gary-boon Claude commited on Nov 18, 2025
Add research attention analysis endpoint with real CodeGen tokenization 8f63685 gary-boon Claude commited on Nov 18, 2025
Add research attention analysis endpoints with Q/K/V extraction 37ed739 gary-boon Claude commited on Nov 13, 2025
Fix model info endpoint for Code Llama compatibility 7dd568f gary-boon Claude commited on Oct 31, 2025
Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes ed40a9a gary-boon Claude commited on Oct 30, 2025
Fix pyarrow compatibility issue with datasets library 1680fda gary-boon Claude commited on Sep 16, 2025
Remove all mock data from SWE-bench - real data only c0d95bf gary-boon Claude commited on Sep 16, 2025
Add GitHub URLs and improve mock data for SWE-bench 22c69fa gary-boon Claude commited on Sep 16, 2025
Fix SWE-bench service to gracefully handle dataset loading failures ae9e159 gary-boon Claude commited on Sep 16, 2025
Fix SWE-bench service to return full problem statements 1d23728 gary-boon Claude commited on Sep 16, 2025
Add SWE-bench integration and improve backend routing 4444ae2 gary-boon Claude commited on Sep 15, 2025
Add layer_stride parameter for PromptDiff optimization 5aed1a9 gary-boon Claude commited on Sep 12, 2025
Fix: Use scaling approach instead of skipping layers 3c774b5 gary-boon Claude commited on Sep 2, 2025