Bowen Lv

extreme1228

13 12 2

AI & ML interests

None yet

Recent Activity

new activity 2 days ago

extreme1228/ScaleCUA:Add task categories, paper link, and GitHub repository

new activity 2 days ago

extreme1228/ScaleCUA-qwen3-vl-scienceboard-sft:Link model to Hugging Face paper page

new activity 2 days ago

extreme1228/ScaleCUA-qwen3-vl-osworld-sft:Link model card to paper

View all activity

Organizations

None yet

upvoted a paper 2 days ago

SCALECUA: Scaling Computer Use Agents with Verifiable Task Synthesis and Efficient Online RL

Paper • 2607.11185 • Published 9 days ago • 1

upvoted a paper about 2 months ago

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Paper • 2605.31584 • Published May 29 • 43

upvoted a collection 3 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.73k

upvoted a paper 4 months ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 63

upvoted a paper 6 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 48

upvoted a paper 8 months ago

OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation

Paper • 2511.20211 • Published Nov 25, 2025 • 12

upvoted a paper 9 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 69

upvoted 2 articles 12 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 514

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.16k

upvoted an article about 1 year ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 378

upvoted a paper about 1 year ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83

upvoted an article about 1 year ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

NormalUhr

•

Feb 11, 2025

• 130

Bowen Lv

AI & ML interests

Recent Activity

Organizations

extreme1228's activity

Welcome GPT OSS, the new open-source model family from OpenAI!

Mixture of Experts Explained

KV Caching Explained: Optimizing Transformer Inference Efficiency

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment