Chaojun XIAO's picture

Chaojun XIAO

xcjthu

·

https://xcjthu.github.io/

xcjthu

AI & ML interests

NLP、information extraction

Recent Activity

upvoted a paper 2 days ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

submitted a paper 2 days ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

upvoted a paper 9 days ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

View all activity

Organizations

upvoted a paper 2 days ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

Paper • 2606.18831 • Published 9 days ago • 6

upvoted a paper 9 days ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

Paper • 2606.15378 • Published 13 days ago • 17

upvoted 2 papers 9 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29, 2025 • 18

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 119

upvoted a paper about 1 year ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9, 2025 • 99

upvoted a collection about 1 year ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated May 24 • 85

upvoted 2 papers over 1 year ago

Densing Law of LLMs

Paper • 2412.04315 • Published Dec 5, 2024 • 19

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

upvoted a paper almost 2 years ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 32

upvoted 2 papers about 2 years ago

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 26

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 46