32 23

AMIRAN KURTANIDZE

sunsulaki

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper 8 days ago

APPO: Agentic Procedural Policy Optimization

upvoted a paper 8 days ago

On the Geometry of On-Policy Distillation

View all activity

Organizations

None yet

upvoted 10 papers 8 days ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2606.15007 • Published 13 days ago • 15

APPO: Agentic Procedural Policy Optimization

Paper • 2606.12384 • Published 14 days ago • 77

On the Geometry of On-Policy Distillation

Paper • 2606.07082 • Published 20 days ago • 73

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 14 days ago • 90

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 22 days ago • 124

MiniMax Sparse Attention

Paper • 2606.13392 • Published 14 days ago • 145

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 10 days ago • 113

liked a model 12 days ago

chopratejas/kompress-v2-base

Token Classification • 0.2B • Updated 15 days ago • 1.18k • 15

liked a model 13 days ago

zjukg/OntoTune-sft-Llama3-8B

8B • Updated Jun 7, 2025 • 4 • 4

upvoted a paper 16 days ago

GRAM-R^2: Self-Training Generative Foundation Reward Models for Reward Reasoning

Paper • 2509.02492 • Published Sep 2, 2025 • 2

liked 3 models 16 days ago

Ray2333/GRM-Llama3.2-3B-rewardmodel-ft

Text Classification • 3B • Updated Apr 30, 2025 • 2.35k • 14

nicholasKluge/RewardModel

Text Classification • 0.1B • Updated Jun 9, 2025 • 188 • 2

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated Oct 20, 2025 • 117k • 355

liked a model 18 days ago

teapotai/tinyteapot

Text Generation • 77M • Updated Feb 23 • 364 • 23

liked a model 26 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 13 days ago • 359k • 2.35k

liked a dataset 26 days ago

openbmb/UltraData-SFT-2605

Updated 27 days ago • 49.2k • 351

upvoted a paper about 1 month ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

AMIRAN KURTANIDZE

AI & ML interests

Recent Activity

Organizations

sunsulaki's activity