Artem

kabachuha

https://scholar.google.com/citations?user=_kUfYFUAAAAJ

kabachuha

AI & ML interests

deep learning, natural language processing, text2image, text2video, computer vision

Recent Activity

new activity 4 days ago

edwixx/diffusiongemma-26B-A4B-it-HERETIC-Uncensored:The own model that own says own own often own

new activity 5 days ago

Gryphe/Gemma-4-31B-StyleTune:Now I feel stupid :)

liked a model 5 days ago

Nimbz/Gemma-4-Dark-Gemistry-31B

View all activity

Organizations

upvoted a paper 14 days ago

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 17 days ago • 12

upvoted 2 papers 16 days ago

A Geometric Account of Activation Steering through Angle-Norm Decomposition

Paper • 2606.06735 • Published 22 days ago • 25

Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

Paper • 2606.07473 • Published 21 days ago • 15

upvoted an article 24 days ago

Article

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

nvidia

•

24 days ago

• 83

upvoted a collection 2 months ago

KVAE 2.0

Collection

KVAE 2.0 is a family of video tokenizers with a time compression ratio of 4 and spacial compression ratio of 8 and 16 • 2 items • Updated Apr 16 • 3

upvoted a paper 3 months ago

Interpreting CLIP with Hierarchical Sparse Autoencoders

Paper • 2502.20578 • Published Feb 27, 2025 • 1

upvoted a collection 3 months ago

My LTX-2 Loras

Collection

9 items • Updated Apr 3 • 7

upvoted 2 papers 4 months ago

SOM Directions are Better than One: Multi-Directional Refusal Suppression in Language Models

Paper • 2511.08379 • Published Nov 11, 2025 • 5

Effective Reasoning Chains Reduce Intrinsic Dimensionality

Paper • 2602.09276 • Published Feb 9 • 12

upvoted 4 papers 5 months ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published Feb 9 • 73

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Paper • 2602.05027 • Published Feb 4 • 63

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 233

Cross-Frame Representation Alignment for Fine-Tuning Video Diffusion Models

Paper • 2506.09229 • Published Jun 10, 2025 • 7

upvoted 3 papers 6 months ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 183

Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM

Paper • 2512.21580 • Published Dec 25, 2025 • 9

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

Paper • 2303.10845 • Published Mar 20, 2023 • 3

upvoted 2 papers 7 months ago

HunyuanVideo 1.5 Technical Report

Paper • 2511.18870 • Published Nov 24, 2025 • 29

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published Nov 19, 2025 • 91

upvoted 2 papers over 1 year ago

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17, 2025 • 95

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 234

Artem

AI & ML interests

Recent Activity

Organizations

kabachuha's activity

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action