2 38 1

Jiarui Yao

FlippyDora

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper 14 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper 14 days ago

Rethinking the Divergence Regularization in LLM RL

View all activity

Organizations

upvoted a paper 6 days ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2606.15007 • Published 13 days ago • 15

upvoted 2 papers 14 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 16 days ago • 41

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 17 days ago • 33

upvoted 2 papers 15 days ago

Lean4Agent: Formal Modeling and Verification for Agent Workflow and Trajectory

Paper • 2606.06523 • Published 23 days ago • 6

AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents

Paper • 2606.05597 • Published 21 days ago • 4

upvoted a paper 17 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 24 days ago • 134

upvoted a paper 22 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Paper • 2606.02754 • Published 23 days ago • 13

upvoted a paper 23 days ago

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 24 days ago • 232

updated a model about 1 month ago

jrtmp/preditive-mask

Updated about 1 month ago

published a model about 1 month ago

jrtmp/preditive-mask

Updated about 1 month ago

updated a dataset about 1 month ago

CorrectKLinRL/math500

Viewer • Updated May 20 • 500 • 7

published a dataset about 1 month ago

CorrectKLinRL/math500

Viewer • Updated May 20 • 500 • 7

updated a model about 1 month ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-forward_k3-clipLow_inf-clipHigh_inf

2B • Updated May 18 • 4

published a model about 1 month ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-forward_k3-clipLow_inf-clipHigh_inf

2B • Updated May 18 • 4

updated a model about 1 month ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-reverse_k3-clipLow_inf-clipHigh_inf

2B • Updated May 18 • 3

published a model about 1 month ago

CorrectKLinRL/Qwen3-1.7B-Base-prlCurrentKL-eta100-reverse_k3-clipLow_inf-clipHigh_inf

2B • Updated May 18 • 3

updated a dataset about 1 month ago

CorrectKLinRL/olympiadbench

Viewer • Updated May 18 • 674 • 7

published a dataset about 1 month ago

CorrectKLinRL/olympiadbench

Viewer • Updated May 18 • 674 • 7

updated a dataset about 1 month ago

CorrectKLinRL/minerva_math

Viewer • Updated May 18 • 272 • 12

published a dataset about 1 month ago

CorrectKLinRL/minerva_math

Viewer • Updated May 18 • 272 • 12

Jiarui Yao

AI & ML interests

Recent Activity

Organizations

FlippyDora's activity