Ilya Pereverzin's picture

Ilya Pereverzin

NodeLinker

·

AI & ML interests

Isn't it amazing that we let a computer think like a human?

Recent Activity

liked a dataset 9 days ago

victor/claude-fable-worldcup-2026-session

liked a dataset 9 days ago

lazarus19/Vibe-Coding-Instruct

liked a dataset 9 days ago

armand0e/claude-fable-5-claude-code

View all activity

Organizations

upvoted 3 papers 11 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 13 days ago • 90

MiniMax Sparse Attention

Paper • 2606.13392 • Published 13 days ago • 144

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Paper • 2606.13673 • Published 13 days ago • 105

upvoted a paper 12 days ago

Verifiable Environments Are LEGO Bricks: Recursive Composition for Reasoning Generalization

Paper • 2606.12373 • Published 14 days ago • 7

upvoted a collection 12 days ago

DiffusionGemma

1 item • Updated 13 days ago • 52

upvoted a paper 12 days ago

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Paper • 2606.12370 • Published 14 days ago • 21

upvoted a paper 18 days ago

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

Paper • 2605.29707 • Published 27 days ago • 147

upvoted a paper about 1 month ago

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Paper • 2605.22791 • Published May 21 • 33

upvoted a changelog about 1 month ago

Hugging Face Changelog

Filter Leaderboards by Model Size

May 20

• 134

upvoted 3 papers about 1 month ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 62

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Paper • 2605.11887 • Published May 12 • 17

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

upvoted a paper about 2 months ago

WebWorld: A Large-Scale World Model for Web Agent Training

Paper • 2602.14721 • Published Feb 16 • 19

upvoted a collection about 2 months ago

Qwen-Scope

16 items • Updated May 14 • 72

upvoted a paper about 2 months ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published Apr 29 • 112

upvoted a collection about 2 months ago

MiMo-V2.5

4 items • Updated Apr 27 • 90

upvoted 2 collections 2 months ago

DeepSeek-V4

4 items • Updated Apr 24 • 687

DFlash

Block Diffusion for Flash Speculative Decoding • 22 items • Updated 8 days ago • 132

upvoted 2 papers 2 months ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 244

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 87