39 36 23

Denis Kuznedelev

SpiridonSunRotator

https://github.com/Godofnothing

Godofnothing

AI & ML interests

Model compression, computer vision, NLP

Recent Activity

liked a Space 3 days ago

AlexWortega/same-data-different-losses

liked a model 8 days ago

AlexWortega/SIQ-1-35B

liked a model about 1 month ago

AlexWortega/ml-intern-v4-100m-tinystories-20260512-1721

View all activity

Organizations

liked a Space 3 days ago

Weight-Space Geometry of Offline Reasoning Training

🧭

Interactive weight-space geometry of six reasoning losses

liked a model 8 days ago

AlexWortega/SIQ-1-35B

Text Generation • 35B • Updated about 8 hours ago • 3.85k • 75

liked a model about 1 month ago

AlexWortega/ml-intern-v4-100m-tinystories-20260512-1721

Text Generation • 0.1B • Updated May 12 • 1.42k • 3

upvoted a paper about 1 month ago

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Paper • 2605.07850 • Published May 8 • 18

upvoted a paper 3 months ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

Paper • 2604.01161 • Published Apr 1 • 32

upvoted an article 4 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

•

Jan 27

• 80

upvoted a paper 4 months ago

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

Paper • 2602.23881 • Published Feb 27 • 18

liked a dataset 4 months ago

ma-xu/fine-t2i

Viewer • Updated Feb 20 • 727k • 30.1k • 109

upvoted a paper 4 months ago

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published Feb 3 • 5

liked a model 4 months ago

black-forest-labs/FLUX.2-klein-4B

Image-to-Image • Updated Feb 24 • 520k • • 753

New activity in Skywork/unipic_nano_2images 4 months ago

Fix of cat command

#2 opened 4 months ago by

SpiridonSunRotator

upvoted 2 papers 5 months ago

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published Feb 2 • 13

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 63

liked a model 5 months ago

tencent/HunyuanImage-3.0-Instruct-Distil

Image-to-Image • 83B • Updated Feb 3 • 3.88k • 61

New activity in tencent/HunyuanImage-3.0-Instruct-Distil 5 months ago

OOM on 4 GPU

#3 opened 5 months ago by

SpiridonSunRotator

New activity in tencent/HunyuanImage-3.0-Instruct 5 months ago

cuBLAS error on image generation

#6 opened 5 months ago by

SpiridonSunRotator

New activity in tencent/HunyuanImage-3.0-Instruct-Distil 5 months ago

Issues with loading the model

#2 opened 5 months ago by

SpiridonSunRotator

updated a model 6 months ago

yresearch/Alice-AI-ART-dev

Text-to-Image • Updated Dec 30, 2025 • 1

published a model 6 months ago

yresearch/Alice-AI-ART-dev

Text-to-Image • Updated Dec 30, 2025 • 1

upvoted a paper 7 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

Denis Kuznedelev

AI & ML interests

Recent Activity

Organizations

SpiridonSunRotator's activity

Weight-Space Geometry of Offline Reasoning Training

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Fix of cat command

OOM on 4 GPU

cuBLAS error on image generation

Issues with loading the model