227 448

dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

World Action Models: A Survey

upvoted a paper 22 days ago

Trust Region On-Policy Distillation

upvoted a paper 22 days ago

Self-Distilled Policy Gradient

View all activity

Organizations

None yet

upvoted a paper 3 days ago

World Action Models: A Survey

Paper • 2606.20781 • Published 9 days ago • 52

upvoted 2 papers 22 days ago

Trust Region On-Policy Distillation

Paper • 2606.01249 • Published 27 days ago • 44

Self-Distilled Policy Gradient

Paper • 2606.04036 • Published 25 days ago • 27

upvoted a paper 25 days ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 29 days ago • 66

liked a model 2 months ago

unsloth/Qwen3.6-35B-A3B-GGUF

Image-Text-to-Text • 35B • Updated Apr 20 • 906k • 1.28k

upvoted a paper 3 months ago

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published Apr 2 • 152

liked 3 models 3 months ago

upvoted 2 papers 3 months ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 158

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published Dec 31, 2025 • 109

liked 2 models 4 months ago

unsloth/Qwen3.5-9B-GGUF

Image-Text-to-Text • 9B • Updated Mar 2 • 1.06M • 718

unsloth/Qwen3.5-35B-A3B-GGUF

Image-Text-to-Text • 35B • Updated Mar 5 • 144k • 850

liked a dataset 4 months ago

togethercomputer/CoderForge-Preview

Viewer • Updated Feb 26 • 827k • 4.58k • 170

liked 2 models 4 months ago

Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated Apr 24 • 2.14M • • 1.45k

LocoreMind/LocoOperator-4B

Text Generation • 4B • Updated Feb 24 • 445 • • 279

liked a dataset 4 months ago

SWE-Gym/SWE-Gym

Viewer • Updated May 10, 2025 • 2.44k • 92.3k • 25

upvoted a paper 4 months ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

upvoted an article 4 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 311

liked a dataset 4 months ago

neulab/agent-data-collection

Preview • Updated Mar 9 • 3.57k • 114

dfuhoiysOHSVFh82934gfjklb

AI & ML interests

Recent Activity

Organizations

huba-buba's activity

Transformers v5: Simple model definitions powering the AI ecosystem