Luyi

lulululuyi

1 29 2

CodeMasterLu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

upvoted a paper 4 months ago

AI Can Learn Scientific Taste

updated a collection 4 months ago

R-HORIZON Models

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published May 25 • 104

upvoted a paper 4 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 431

upvoted a collection 4 months ago

R-HORIZON Models

Collection

models of R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth? • 5 items • Updated Mar 10 • 1

upvoted a paper 4 months ago

Advancing Block Diffusion Language Models for Test-Time Scaling

Paper • 2602.09555 • Published Feb 10 • 4

upvoted 4 papers 5 months ago

OPE: Overcoming Information Saturation in Parallel Thinking via Outline-Guided Path Exploration

Paper • 2602.08344 • Published Feb 9 • 5

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published Feb 5 • 61

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 49

upvoted 2 papers 6 months ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Paper • 2601.07779 • Published Jan 12 • 28

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 51

upvoted 4 papers 8 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 242

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 73

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 99

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85

upvoted a collection 8 months ago

R-HORIZON

Collection

The training and evaluation datasets for Paper "How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?" • 6 items • Updated Oct 22, 2025 • 10

upvoted 3 papers 9 months ago

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published Oct 10, 2025 • 53

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1, 2025 • 23

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9, 2025 • 29

upvoted a collection 9 months ago

R-HORZION Datasets

Collection

Training and evaluation datasets of R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth? • 6 items • Updated Mar 9 • 6

upvoted a paper 10 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

Luyi

AI & ML interests

Recent Activity

Organizations

lulululuyi's activity