Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders Paper • 2606.10029 • Published 16 days ago • 12
A Geometric Account of Activation Steering through Angle-Norm Decomposition Paper • 2606.06735 • Published 21 days ago • 25
Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders Paper • 2606.07473 • Published 20 days ago • 15
view article Article Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action nvidia • 24 days ago • 83
KVAE 2.0 Collection KVAE 2.0 is a family of video tokenizers with a time compression ratio of 4 and spacial compression ratio of 8 and 16 • 2 items • Updated Apr 16 • 3
Interpreting CLIP with Hierarchical Sparse Autoencoders Paper • 2502.20578 • Published Feb 27, 2025 • 1
SOM Directions are Better than One: Multi-Directional Refusal Suppression in Language Models Paper • 2511.08379 • Published Nov 11, 2025 • 5
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders Paper • 2602.05027 • Published Feb 4 • 63
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 233
Cross-Frame Representation Alignment for Fine-Tuning Video Diffusion Models Paper • 2506.09229 • Published Jun 10, 2025 • 7
Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM Paper • 2512.21580 • Published Dec 25, 2025 • 9
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing Paper • 2303.10845 • Published Mar 20, 2023 • 3
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story Paper • 2511.15210 • Published Nov 19, 2025 • 91
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17, 2025 • 95
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5, 2025 • 234