Integrate mistral-common for correct Devstral tokenization ed06dcb gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Remove mistral_common to fix dependency conflict 3d9d9ee gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Use mistral_common for proper Devstral prompt formatting 3e80769 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Add system prompt support for instruction-tuned models 2860768 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Simpler prompt format and temperature=0 for Devstral 76020ee gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Sanitize JSON response for NaN/Inf float values 99f6209 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Check chat_template is set before using apply_chat_template 474927d gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Add chat template support for Devstral instruct model 8d85da8 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
fix: Convert bfloat16 to float32 for numpy compatibility cb6f39c gary-boon Claude Opus 4.5 commited on Dec 14, 2025
fix: Use eager attention for output_attentions support 5333b21 gary-boon Claude Opus 4.5 commited on Dec 14, 2025
Add DEVICE env var to force CPU mode on DGX Spark 5f122aa gary-boon Claude Opus 4.5 commited on Dec 14, 2025
Make zarr/numcodecs imports optional for ARM64 compatibility 6435a75 gary-boon Claude Opus 4.5 commited on Dec 14, 2025
Fix research attention endpoint model compatibility f5ba954 gary-boon Claude commited on Nov 18, 2025
Add research attention analysis endpoint with real CodeGen tokenization 8f63685 gary-boon Claude commited on Nov 18, 2025
Add research attention analysis endpoints with Q/K/V extraction 37ed739 gary-boon Claude commited on Nov 13, 2025
Fix model info endpoint for Code Llama compatibility 7dd568f gary-boon Claude commited on Oct 31, 2025
Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes ed40a9a gary-boon Claude commited on Oct 30, 2025
Fix pyarrow compatibility issue with datasets library 1680fda gary-boon Claude commited on Sep 16, 2025
Remove all mock data from SWE-bench - real data only c0d95bf gary-boon Claude commited on Sep 16, 2025
Add GitHub URLs and improve mock data for SWE-bench 22c69fa gary-boon Claude commited on Sep 16, 2025
Fix SWE-bench service to gracefully handle dataset loading failures ae9e159 gary-boon Claude commited on Sep 16, 2025
Fix SWE-bench service to return full problem statements 1d23728 gary-boon Claude commited on Sep 16, 2025
Add SWE-bench integration and improve backend routing 4444ae2 gary-boon Claude commited on Sep 15, 2025
Add layer_stride parameter for PromptDiff optimization 5aed1a9 gary-boon Claude commited on Sep 12, 2025
Fix: Use scaling approach instead of skipping layers 3c774b5 gary-boon Claude commited on Sep 2, 2025
Fix: Handle single-element tuple outputs in layer hook 9e42df9 gary-boon Claude commited on Sep 2, 2025
Fix: Correct layer hook output format for layer_norm compatibility 070f9b8 gary-boon Claude commited on Sep 1, 2025
Simplify workflow: Remove automated HuggingFace deployment e6aff69 gary-boon commited on Aug 28, 2025
feat: Add pipeline analyzer and QKV extractor for transformer visualization 767a3fd gary-boon Claude commited on Aug 27, 2025
Add ablation support to model service with comprehensive testing bb8a292 gary-boon Claude commited on Aug 20, 2025