babied wake deduct's picture

babied wake deduct

babied-wake-deduct

·

AI & ML interests

None yet

Recent Activity

liked a Space 2 days ago

Supertone/supertonic-3

liked a model 10 days ago

MiniMaxAI/MiniMax-M2.5

liked a model 11 days ago

Zyphra/ZAYA1-8B

View all activity

Organizations

upvoted 2 papers 3 months ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 62

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published Feb 2 • 36

upvoted an article 5 months ago

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

upvoted an article 6 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 311

upvoted a paper 7 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 115

upvoted an article 7 months ago

Article

Granite 4.0 Nano: Just how small can you go?

ibm-granite

•

Oct 28, 2025

• 124

upvoted 3 papers 8 months ago

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published Apr 30, 2025 • 15

Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8, 2025 • 44

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Paper • 2504.07866 • Published Apr 10, 2025 • 12

upvoted a collection 8 months ago

Granite Docling

Models for parsing complex PDFs and structured documents, designed to complement Docling. • 4 items • Updated 21 days ago • 63

upvoted a paper 8 months ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published Sep 15, 2025 • 29

upvoted a collection 8 months ago

Qwen3-Next

4 items • Updated Dec 31, 2025 • 188

upvoted a paper 8 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

upvoted a collection 8 months ago

mmBERT: a modern multilingual encoder

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53

upvoted a paper 8 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238

upvoted 5 collections 9 months ago

MobileCLIP2

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated 26 days ago • 61

Hermes 4 Collection

9 items • Updated Mar 2 • 103

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated about 10 hours ago • 104

Seed-OSS

Seed-OSS Open-Source Models • 3 items • Updated Aug 20, 2025 • 63

Ovis2.5

Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19, 2025 • 58