AMIRAN KURTANIDZE

sunsulaki

33 23

AI & ML interests

None yet

Recent Activity

upvoted an article 1 day ago

80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop

upvoted a paper 15 days ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper 15 days ago

APPO: Agentic Procedural Policy Optimization

View all activity

Organizations

None yet

upvoted an article 1 day ago

Article

80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop

hugging-science

•

2 days ago

• 16

upvoted 10 papers 15 days ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2606.15007 • Published 20 days ago • 16

APPO: Agentic Procedural Policy Optimization

Paper • 2606.12384 • Published 21 days ago • 79

On the Geometry of On-Policy Distillation

Paper • 2606.07082 • Published 27 days ago • 75

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 21 days ago • 92

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 29 days ago • 126

MiniMax Sparse Attention

Paper • 2606.13392 • Published 21 days ago • 148

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 17 days ago • 121

upvoted a paper 23 days ago

GRAM-R^2: Self-Training Generative Foundation Reward Models for Reward Reasoning

Paper • 2509.02492 • Published Sep 2, 2025 • 2

upvoted 2 papers about 1 month ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 35

Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models

Paper • 2605.17672 • Published May 17 • 23

upvoted a paper about 2 months ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published May 7 • 85

upvoted a paper 3 months ago

The Universal Normal Embedding

Paper • 2603.21786 • Published Mar 23 • 16

upvoted a paper 4 months ago

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Paper • 2511.10645 • Published Nov 13, 2025 • 14

upvoted a collection 4 months ago

ParoQuant

Collection

Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 24 items • Updated 24 days ago • 27

upvoted 2 papers 4 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

Paper • 2602.10063 • Published Feb 10 • 75

AMIRAN KURTANIDZE

AI & ML interests

Recent Activity

Organizations

sunsulaki's activity

80TB+ of astronomy for the HDD-poor: crossmatch the Multimodal Universe from your laptop