22 4

Lancer

lancer001010

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

DFlash: Block Diffusion for Flash Speculative Decoding

upvoted a paper about 1 month ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper about 1 month ago

Qwen3-ASR Technical Report

View all activity

Organizations

None yet

upvoted 5 papers about 1 month ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published Jan 22 • 190

upvoted a collection about 1 month ago

Qwen3-ASR

Collection

4 items • Updated Jan 29 • 53

liked a dataset about 1 month ago

united-we-care/United-Syn-Med

Updated Nov 7, 2025 • 108 • 27

upvoted 2 papers about 2 months ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 91

upvoted 3 papers 2 months ago

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 169

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 229

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Paper • 2601.03872 • Published Jan 7 • 43

updated a collection 2 months ago

Diffusion

Collection

2 items • Updated Jan 4

upvoted a paper 3 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 152

upvoted an article 4 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

343

upvoted 2 articles 5 months ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21, 2025

•

307

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

upvoted an article 6 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

101

upvoted a paper 6 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 233

liked a model 7 months ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • Updated Aug 26, 2025 • 24.8k • 1.01k

Lancer

AI & ML interests

Recent Activity

Organizations

lancer001010's activity

Continuous batching from first principles

Supercharge your OCR Pipelines with Open Models

mem-agent: Equipping LLM Agents with Memory Using RL

From GRPO to DAPO and GSPO: What, Why, and How