zhuxl's picture

7

zhuxl

zhuzhu00

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

upvoted a paper about 2 months ago

On Domain-Specific Post-Training for Multimodal Large Language Models

upvoted a paper about 2 months ago

How to Synthesize Text Data without Model Collapse?

View all activity

Organizations

None yet

upvoted 7 papers about 2 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 96

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 30

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published Mar 6, 2025 • 9

Reasoning with Exploration: An Entropy Perspective

Paper • 2506.14758 • Published Jun 17, 2025 • 30

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 117

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 85