Merge pull request #6 from gary-boon/feature/rq1-ffn-intermediate a1b2ad2 unverified gary-boon commited on 3 days ago
Backend: unembedding cosine emission + Devstral tokenizer fallback 3a10cf3 garyboon Claude Opus 4.7 (1M context) commited on 3 days ago
Emit per-token embedding L2 norms for RQ1 Step 3 336085f garyboon Claude Opus 4.7 (1M context) commited on 5 days ago
Emit per-layer top-K SwiGLU intermediate activations for RQ1 Step 5 019492b garyboon Claude Opus 4.7 (1M context) commited on 5 days ago
Merge: F2 backend — emit effective_probability for sampling-effective margins (#5) e11c277 unverified gary-boon commited on 11 days ago
Merge: F1 backend — expose query_position and query_token_id (#4) 778d7c8 unverified gary-boon commited on 11 days ago
Merge: F3a — derive aggregate-row num_heads from request metadata (#1) 78c7200 unverified gary-boon commited on 11 days ago
Emit effective_probability and sampling eligibility per alternative 6f43bd6 gary-boon commited on 11 days ago
Expose query_position and query_token_id in /attention/row response d2d962d gary-boon commited on 11 days ago
Derive aggregate-row num_heads from request metadata, not live model 3506846 gary-boon commited on 11 days ago
Fix empty text and incorrect is_special for Mistral control tokens b5e4add gary-boon Claude Opus 4.6 (1M context) commited on 14 days ago
Fix score-all classification in vectorised path e375e45 gary-boon Claude Opus 4.6 (1M context) commited on 25 days ago
Merge branch 'main' into feature/score-all-classification bfdde66 gary-boon commited on about 1 month ago
Pin PyTorch <2.11.0 to fix CUDA driver incompatibility on HF Spaces 8b9ad4b gary-boon Claude Opus 4.6 (1M context) commited on about 1 month ago
Revert "Fix model selector not showing loaded GPU model on CPU-detected hardware" ff8be9d gary-boon commited on about 1 month ago
Fix model selector not showing loaded GPU model on CPU-detected hardware 6c5265e gary-boon Claude Opus 4.6 (1M context) commited on about 1 month ago
Refactor head classification from cascade to score-all-then-rank 10baadf gary-boon Claude Opus 4.6 (1M context) commited on about 1 month ago
Add FFN contribution metrics and gate activation stats to research endpoint 82349c1 gary-boon Claude Opus 4.6 (1M context) commited on about 1 month ago
Add tuned lens as supplementary projection mode for logit lens 6f48db0 gary-boon Claude Opus 4.6 commited on Feb 25
Rename 'counterfactual' terminology to 'input ablation' in docstrings 0d76811 gary-boon Claude Opus 4.6 commited on Feb 25
Fix margin winner_token in margin_data and extract _classify_stability helper ecf49a2 gary-boon Claude Opus 4.6 commited on Feb 25
Add margin-based decision analysis, interventional counterfactuals, and run comparison (v3.0) 121a2d9 gary-boon Claude Opus 4.6 commited on Feb 25
Add deep inspection: data-driven pattern classification, attention/MLP tracking, logit lens 54d9b6e gary-boon Claude Opus 4.6 commited on Feb 24
Change default model to Devstral and optimise attention extraction d8d197a gary-boon Claude Opus 4.6 commited on Feb 22
Fix PyTorch dynamo cache directory issue in containers 9978aec gary-boon Claude Opus 4.5 commited on Feb 11
Fix MistralTokenizer loading and token section boundaries 76b614d gary-boon Claude Opus 4.5 commited on Jan 27
Fix MistralTokenizer not loaded during model switch 2bdf299 gary-boon Claude Opus 4.5 commited on Jan 25
feat: add top-k/top-p sampling and detailed logits/probability tracking 2c6343b gary-boon Claude Opus 4.5 commited on Jan 13
fix: Improve token alternatives numerical stability and temperature control bc1f0e0 gary-boon Claude Opus 4.5 commited on Jan 6
Store matrices as numpy arrays instead of Python lists f2e89c2 gary-boon Claude Opus 4.5 commited on Dec 24, 2025
Add per-step memory cleanup for large model support a94eb19 gary-boon Claude Opus 4.5 commited on Dec 24, 2025
Fix RAM exhaustion for large token generation 959074d gary-boon Claude Opus 4.5 commited on Dec 24, 2025
feat: add auto_complete parameter for token generation bb689ce gary-boon Claude Opus 4.5 commited on Dec 24, 2025
fix: add QKV extraction support for Mistral/Devstral architecture d1d37a8 gary-boon Claude Opus 4.5 commited on Dec 24, 2025
feat: implement lazy-loading for attention matrices 929ba88 gary-boon Claude Opus 4.5 commited on Dec 24, 2025
Add avg_entropy calculation for attention heads 66a46b6 gary-boon Claude Opus 4.5 commited on Dec 22, 2025
Revert QKV visualization fixes - need better approach for data streaming d0b7e29 gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Limit QKV matrices to top 5 heads per layer to reduce response size decb5ab gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Fix QKV matrix extraction for Mistral/Devstral architecture 9056859 gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Fix QKV visualization for Mistral/Devstral architecture 4ec134b gary-boon Claude Opus 4.5 commited on Dec 17, 2025
Add future considerations doc for response size optimization 3e67ea2 gary-boon Claude Opus 4.5 commited on Dec 16, 2025
Fix: Import time module at top level for SSE events 15a862b gary-boon Claude Opus 4.5 commited on Dec 16, 2025
Add SSE streaming endpoint for real-time analysis progress 172a186 gary-boon Claude Opus 4.5 commited on Dec 16, 2025
feat: Include token metadata in analysis response ee0f6c9 gary-boon Claude Opus 4.5 commited on Dec 16, 2025
feat: Implement tier-based model filtering by device type 6bf9f5c gary-boon Claude Opus 4.5 commited on Dec 16, 2025
Fix: Add attn_implementation="eager" to model switch function f94a7ae gary-boon Claude Opus 4.5 commited on Dec 16, 2025
Add Phase 5: Performance optimizations to phased plan 383a328 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Add tokenSections boundaries and update system prompt c6f4cc5 gary-boon Claude Opus 4.5 commited on Dec 15, 2025
Fix: Handle MistralCommonTokenizer pad_token setter e20ccaf gary-boon Claude Opus 4.5 commited on Dec 15, 2025