Guangxiang Zhao's picture

Guangxiang Zhao

zhaoguangxiang

·

https://guangxiang.cc/

AI & ML interests

None yet

Recent Activity

new activity 28 days ago

stepfun-ai/Step-3.5-Flash-SFT:Could you please provide sources of the data

liked a dataset about 1 month ago

liked a dataset about 1 month ago

OpenSeeker/OpenSeeker-v1-Data

View all activity

Organizations

upvoted a collection about 1 month ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 4 days ago • 123

upvoted a paper 2 months ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published Feb 12 • 62

upvoted 2 papers 6 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18, 2025 • 3

upvoted a paper 11 months ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 21

upvoted a paper 12 months ago

DeepCritic: Deliberate Critique with Large Language Models

Paper • 2505.00662 • Published May 1, 2025 • 54

upvoted a paper about 1 year ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20, 2025 • 49