Zhang's picture

Zhang

Diluner

·

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

shizhuo2/sokoban-diversity-trajectories:v2: refresh 100k/ (MM 357292, HOM 259505, HET 188588) + README counts

View all activity

Organizations

upvoted 2 papers 27 days ago

CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists

Paper • 2605.26029 • Published 28 days ago • 18

Self-Improving Language Models with Bidirectional Evolutionary Search

Paper • 2605.28814 • Published 29 days ago • 60

upvoted a paper about 1 month ago

Useful Memories Become Faulty When Continuously Updated by LLMs

Paper • 2605.12978 • Published May 13 • 19

upvoted 3 papers 5 months ago

SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers

Paper • 2602.05115 • Published Feb 4 • 20

Sparse Reward Subsystem in Large Language Models

Paper • 2602.00986 • Published May 11 • 13

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published Feb 1 • 45

upvoted 2 papers about 1 year ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 191

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

upvoted 2 papers over 1 year ago

Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity

Paper • 2502.11901 • Published Feb 17, 2025 • 6

Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization

Paper • 2410.04717 • Published Oct 7, 2024 • 18