Suzie Oh's picture

Suzie Oh

ohsuz

·

ohsuz

AI & ML interests

None yet

Recent Activity

liked a dataset 25 days ago

openbmb/Ultra-FineWeb-L3

liked a dataset 2 months ago

Crownelius/Opus-4.6-Reasoning-3300x

upvoted a collection 3 months ago

Cosmos-Predict2.5

View all activity

Organizations

upvoted a collection 3 months ago

Cosmos-Predict2.5

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3 • 2 items • Updated 15 days ago • 23

upvoted a paper 3 months ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 155

upvoted a paper 4 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

upvoted a collection 5 months ago

TranslateGemma

3 items • Updated Mar 12 • 245

upvoted a paper 5 months ago

Scaling Generalist Data-Analytic Agents

Paper • 2509.25084 • Published Sep 29, 2025 • 22

upvoted 2 papers 6 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 159

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 85

upvoted 5 articles 6 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

AviSoori1x

•

May 7, 2024

• 122

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

MiniMaxAI

•

Jan 5

• 41

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

MiniMax-AI

•

Oct 30, 2025

• 80

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

MiniMax-AI

•

Oct 30, 2025

• 43

Article

What makes good reasoning data

MiniMax-AI

•

Oct 30, 2025

• 45

upvoted a collection 6 months ago

Open Korean LLM (MSIT 2025)

6 items • Updated Jan 2 • 16

upvoted a collection 7 months ago

ToolRM

ToolRM: Towards Agentic Tool-Use Reward Modeling • 4 items • Updated Mar 2 • 4

upvoted an article 7 months ago

Article

How to Build an MCP Server with Gradio

abidlabs, ysharma

•

Apr 30, 2025

• 202

upvoted a paper 7 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29, 2025 • 38

upvoted a collection 7 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3

upvoted an article 7 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

+1

baptistecolle, hannayukhymenko, lvwerra

•

Sep 10, 2025

• 65

upvoted 2 papers 7 months ago

FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling

Paper • 2510.24645 • Published Oct 28, 2025 • 11

Spurious Rewards: Rethinking Training Signals in RLVR

Paper • 2506.10947 • Published Jun 12, 2025 • 2