Xufang Luo's picture

Xufang Luo

daixufang

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

authored a paper 1 day ago

A Disease-Centric Vision-Language Foundation Model for Precision Oncology in Kidney Cancer

authored a paper 1 day ago

$ΔL$ Normalization: Rethink Loss Aggregation in RLVR

View all activity

Organizations

None yet

authored 5 papers 1 day ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 136

A Disease-Centric Vision-Language Foundation Model for Precision Oncology in Kidney Cancer

Paper • 2508.16569 • Published Aug 22, 2025 • 1

$ΔL$ Normalization: Rethink Loss Aggregation in RLVR

Paper • 2509.07558 • Published Sep 9, 2025 • 7

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Paper • 2510.18632 • Published Oct 21, 2025 • 22

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published 3 days ago • 29

upvoted a paper 1 day ago

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published 3 days ago • 29

upvoted a paper 4 months ago

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Paper • 2510.18632 • Published Oct 21, 2025 • 22

upvoted 2 papers 6 months ago

ΔL Normalization: Rethink Loss Aggregation in RLVR

Paper • 2509.07558 • Published Sep 9, 2025 • 7

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

upvoted 2 papers 7 months ago

LeanK: Learnable K Cache Channel Pruning for Efficient Decoding

Paper • 2508.02215 • Published Aug 4, 2025 • 12

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 136

commented a paper 7 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 136 •

authored 8 papers 7 months ago

LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression

Paper • 2310.06839 • Published Oct 10, 2023 • 4

LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models

Paper • 2404.01617 • Published Apr 2, 2024 • 8

Mitigate Position Bias in Large Language Models via Scaling a Single Dimension

Paper • 2406.02536 • Published Jun 4, 2024

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 39

Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key

Paper • 2501.09695 • Published Jan 16, 2025 • 1

On Memory Construction and Retrieval for Personalized Conversational Agents

Paper • 2502.05589 • Published Feb 8, 2025

VisRL: Intention-Driven Visual Perception via Reinforced Reasoning

Paper • 2503.07523 • Published Mar 10, 2025 • 1

Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs

Paper • 2505.12929 • Published May 19, 2025 • 3