Xun Wu's picture

Xun Wu

YUSHUIWX

·

https://yushuiwx.github.io/

AI & ML interests

Mixture-of-Experts Multi-Modality LLMs Reasoning

Organizations

None yet

upvoted a paper 3 months ago

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 60

upvoted a paper 4 months ago

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Paper • 2603.05168 • Published Mar 5 • 6

upvoted a paper 11 months ago

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published Jul 28, 2025 • 32

upvoted 2 papers about 1 year ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

On-Policy RL with Optimal Reward Baseline

Paper • 2505.23585 • Published May 29, 2025 • 14

upvoted an article about 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf

•

Sep 18, 2024

• 281

upvoted a paper about 1 year ago

Think Only When You Need with Large Hybrid-Reasoning Models

Paper • 2505.14631 • Published May 20, 2025 • 20