Longxu Dou

dreamerdeo

https://longxudou.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

liked a model about 2 months ago

miromind-ai/MiroThinker-1.7-mini

liked a model about 2 months ago

miromind-ai/MiroThinker-1.7

upvoted a paper 2 months ago

On Data Engineering for Scaling LLM Terminal Capabilities

View all activity

Organizations

liked 2 models about 2 months ago

miromind-ai/MiroThinker-1.7-mini

Text Generation • 31B • Updated 24 days ago • 937 • 95

miromind-ai/MiroThinker-1.7

Text Generation • 235B • Updated 24 days ago • 772 • 136

upvoted a paper 2 months ago

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published Feb 24 • 102

liked a dataset 2 months ago

zai-org/terminal-bench-2-verified

Updated Feb 27 • 2.37k • 71

upvoted a paper 2 months ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted a paper 3 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

liked a dataset 5 months ago

Danau5tin/terminal-tasks

Viewer • Updated Sep 12, 2025 • 331 • 21 • 7

upvoted a paper 6 months ago

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Paper • 2307.13269 • Published Jul 25, 2023 • 34

authored 2 papers 6 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 132

Training Optimal Large Diffusion Language Models

Paper • 2510.03280 • Published Sep 28, 2025

upvoted 3 papers 7 months ago

upvoted a collection 7 months ago

cwm

Collection

Collection for Code World Model, an agentic coding model from FAIR. • 3 items • Updated Sep 24, 2025 • 20

upvoted a paper 10 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 48

updated a Space 11 months ago

README

💻

upvoted 2 papers 11 months ago

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27, 2025 • 26

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28, 2025 • 29

upvoted 2 papers 12 months ago

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26, 2025 • 24

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19, 2025 • 36

Longxu Dou

AI & ML interests

Recent Activity

Organizations

dreamerdeo's activity

README