39 35 20

Denis Kuznedelev

SpiridonSunRotator

https://github.com/Godofnothing

Godofnothing

AI & ML interests

Model compression, computer vision, NLP

Recent Activity

upvoted a paper about 1 month ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

upvoted an article 2 months ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

upvoted a paper 2 months ago

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

View all activity

Organizations

upvoted a paper about 1 month ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

Paper • 2604.01161 • Published Apr 1 • 32

upvoted an article 2 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted 2 papers 2 months ago

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

Paper • 2602.23881 • Published Feb 27 • 18

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published Feb 3 • 5

upvoted 2 papers 3 months ago

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published Feb 2 • 13

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 61

upvoted a paper 5 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

upvoted a paper 6 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 128

upvoted an article 6 months ago

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Feb 4, 2025

•

upvoted an article 7 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29, 2025

•

223

upvoted a paper 7 months ago

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Paper • 2509.23202 • Published Sep 27, 2025 • 30

upvoted a paper 9 months ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published Jul 24, 2025 • 42

upvoted 2 papers 10 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 126

MADrive: Memory-Augmented Driving Scene Modeling

Paper • 2506.21520 • Published Jun 26, 2025 • 36

upvoted 3 papers 11 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7, 2025 • 71

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27, 2025 • 143

Unified Scaling Laws for Compressed Representations

Paper • 2506.01863 • Published Jun 2, 2025 • 19

upvoted 3 papers 12 months ago

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25, 2025 • 85

Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

Paper • 2505.16134 • Published May 22, 2025 • 18

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78

Denis Kuznedelev

AI & ML interests

Recent Activity

Organizations

SpiridonSunRotator's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face