Tongyao's picture

Tongyao PRO

tyzhu

·

tongyao-zhu

AI & ML interests

Natural Language Processing

Recent Activity

liked a Space 11 days ago

nanotron/ultrascale-playbook

updated a model 12 days ago

tyzhu/hf_model_math

upvoted a paper 16 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

View all activity

Organizations

None yet

upvoted a paper 16 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 17 days ago • 142

upvoted a paper 5 months ago

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published Jan 27 • 8

upvoted a paper 8 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132

upvoted 3 papers 9 months ago

From Harm to Help: Turning Reasoning In-Context Demos into Assets for Reasoning LMs

Paper • 2509.23196 • Published Sep 27, 2025 • 9

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26, 2025 • 70

Variational Reasoning for Language Models

Paper • 2509.22637 • Published Sep 26, 2025 • 70

upvoted 6 papers about 1 year ago

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28, 2025 • 29

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27, 2025 • 27

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26, 2025 • 24

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19, 2025 • 36

What's "up" with vision-language models? Investigating their struggle with spatial reasoning

Paper • 2310.19785 • Published Oct 30, 2023 • 1

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26, 2025 • 60

upvoted 3 papers over 1 year ago

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Paper • 2503.15450 • Published Mar 19, 2025 • 12

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18, 2025 • 19

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21, 2024 • 55

upvoted 2 papers almost 3 years ago

From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning

Paper • 2304.07995 • Published Apr 17, 2023 • 3

In-context Autoencoder for Context Compression in a Large Language Model

Paper • 2307.06945 • Published Jul 13, 2023 • 29