Spaces:
Paused
Paused
Commit History
Emit per-token embedding L2 norms for RQ1 Step 3 336085f
Emit per-layer top-K SwiGLU intermediate activations for RQ1 Step 5 019492b
Emit effective_probability and sampling eligibility per alternative 6f43bd6
gary-boon commited on
Expose query_position and query_token_id in /attention/row response d2d962d
gary-boon commited on
Derive aggregate-row num_heads from request metadata, not live model 3506846
gary-boon commited on
Fix empty text and incorrect is_special for Mistral control tokens b5e4add
gary-boon Claude Opus 4.6 (1M context) commited on
Fix score-all classification in vectorised path e375e45
gary-boon Claude Opus 4.6 (1M context) commited on
Refactor head classification from cascade to score-all-then-rank 10baadf
gary-boon Claude Opus 4.6 (1M context) commited on
Add FFN contribution metrics and gate activation stats to research endpoint 82349c1
gary-boon Claude Opus 4.6 (1M context) commited on
Add tuned lens as supplementary projection mode for logit lens 6f48db0
gary-boon Claude Opus 4.6 commited on
Rename 'counterfactual' terminology to 'input ablation' in docstrings 0d76811
gary-boon Claude Opus 4.6 commited on
Fix margin winner_token in margin_data and extract _classify_stability helper ecf49a2
gary-boon Claude Opus 4.6 commited on
Add margin-based decision analysis, interventional counterfactuals, and run comparison (v3.0) 121a2d9
gary-boon Claude Opus 4.6 commited on
Add deep inspection: data-driven pattern classification, attention/MLP tracking, logit lens 54d9b6e
gary-boon Claude Opus 4.6 commited on
Change default model to Devstral and optimise attention extraction d8d197a
gary-boon Claude Opus 4.6 commited on
Fix MistralTokenizer loading and token section boundaries 76b614d
gary-boon Claude Opus 4.5 commited on
Fix MistralTokenizer not loaded during model switch 2bdf299
gary-boon Claude Opus 4.5 commited on
feat: add top-k/top-p sampling and detailed logits/probability tracking 2c6343b
gary-boon Claude Opus 4.5 commited on
fix: Improve token alternatives numerical stability and temperature control bc1f0e0
gary-boon Claude Opus 4.5 commited on
Store matrices as numpy arrays instead of Python lists f2e89c2
gary-boon Claude Opus 4.5 commited on
Add per-step memory cleanup for large model support a94eb19
gary-boon Claude Opus 4.5 commited on
Fix RAM exhaustion for large token generation 959074d
gary-boon Claude Opus 4.5 commited on
feat: add auto_complete parameter for token generation bb689ce
gary-boon Claude Opus 4.5 commited on
fix: add QKV extraction support for Mistral/Devstral architecture d1d37a8
gary-boon Claude Opus 4.5 commited on
feat: implement lazy-loading for attention matrices 929ba88
gary-boon Claude Opus 4.5 commited on
Add avg_entropy calculation for attention heads 66a46b6
gary-boon Claude Opus 4.5 commited on
Revert QKV visualization fixes - need better approach for data streaming d0b7e29
gary-boon Claude Opus 4.5 commited on
Add safety checks for missing QKV keys a79cb83
gary-boon Claude Opus 4.5 commited on
Limit QKV matrices to top 5 heads per layer to reduce response size decb5ab
gary-boon Claude Opus 4.5 commited on
Fix QKV matrix extraction for Mistral/Devstral architecture 9056859
gary-boon Claude Opus 4.5 commited on
Fix QKV visualization for Mistral/Devstral architecture 4ec134b
gary-boon Claude Opus 4.5 commited on
Fix: Import time module at top level for SSE events 15a862b
gary-boon Claude Opus 4.5 commited on
Add SSE streaming endpoint for real-time analysis progress 172a186
gary-boon Claude Opus 4.5 commited on
feat: Include token metadata in analysis response ee0f6c9
gary-boon Claude Opus 4.5 commited on
feat: Implement tier-based model filtering by device type 6bf9f5c
gary-boon Claude Opus 4.5 commited on
Fix: Add attn_implementation="eager" to model switch function f94a7ae
gary-boon Claude Opus 4.5 commited on
Add tokenSections boundaries and update system prompt c6f4cc5
gary-boon Claude Opus 4.5 commited on
Fix: Handle MistralCommonTokenizer pad_token setter e20ccaf
gary-boon Claude Opus 4.5 commited on
Integrate mistral-common for correct Devstral tokenization ed06dcb
gary-boon Claude Opus 4.5 commited on
Remove mistral_common to fix dependency conflict 3d9d9ee
gary-boon Claude Opus 4.5 commited on
Use mistral_common for proper Devstral prompt formatting 3e80769
gary-boon Claude Opus 4.5 commited on
Add system prompt support for instruction-tuned models 2860768
gary-boon Claude Opus 4.5 commited on
fix: Simpler prompt format and temperature=0 for Devstral 76020ee
gary-boon Claude Opus 4.5 commited on
fix: Sanitize JSON response for NaN/Inf float values 99f6209
gary-boon Claude Opus 4.5 commited on
fix: Check chat_template is set before using apply_chat_template 474927d
gary-boon Claude Opus 4.5 commited on
fix: Add chat template support for Devstral instruct model 8d85da8
gary-boon Claude Opus 4.5 commited on
fix: Convert bfloat16 to float32 for numpy compatibility cb6f39c
gary-boon Claude Opus 4.5 commited on
fix: Use eager attention for output_attentions support 5333b21
gary-boon Claude Opus 4.5 commited on
Add vocabSize to modelInfo response 499afba
gary-boon Claude Opus 4.5 commited on