Jingqing Ruan's picture

Jingqing Ruan

Amanda2023

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

lordx64/Qwable-v1

upvoted a paper 2 days ago

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

upvoted a paper 2 months ago

SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting

View all activity

Organizations

None yet

upvoted a paper 2 days ago

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

Paper • 2606.22883 • Published 5 days ago • 33

upvoted a paper 2 months ago

SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting

Paper • 2604.10688 • Published Apr 12 • 26

upvoted a collection 3 months ago

PaTaRM

PaTaRM is a Generative Reward Model (GRM) for RLHF alignment. • 4 items • Updated Apr 2 • 2

upvoted 2 papers 3 months ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published Mar 17 • 58

upvoted a paper 6 months ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 89

upvoted a paper about 1 year ago

When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning

Paper • 2505.15400 • Published May 21, 2025 • 23