NANYUN (Violet) PENG's picture

5

NANYUN (Violet) PENG

violetpeng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

upvoted a paper about 1 month ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

authored a paper 3 months ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

View all activity

Organizations

upvoted a paper 29 days ago

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

Paper • 2605.26340 • Published May 25 • 36

upvoted a paper about 1 month ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published May 11 • 79

upvoted a paper over 1 year ago

LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints

Paper • 2410.06458 • Published Oct 9, 2024 • 8

upvoted a paper about 2 years ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

upvoted a collection about 2 years ago

Model Extrapolation Expedites Alignment

Better aligned models obtained by model extrapolation (ExPO) • 23 items • Updated Mar 2 • 17