UCLA Scalable Analytics Institute

university

https://web.cs.ucla.edu/~weiwang/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

willhx authored a paper 10 days ago

T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

willhx authored a paper 10 days ago

HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness

willhx submitted a paper about 2 months ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

View all activity

willhx

authored 2 papers 10 days ago

T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Paper • 2605.02178 • Published May 4 • 10

HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness

Paper • 2606.12882 • Published 15 days ago • 13

willhx

submitted a paper to Daily Papers about 2 months ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Paper • 2605.02178 • Published May 4 • 10

willhx

authored a paper 4 months ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published Feb 25 • 26

UCLA-SCAI-Lab

updated a collection 4 months ago

ARLArena

Collection

3 items • Updated Feb 26 • 1

zhhhhahahaha

updated a model 4 months ago

UCLA-SCAI/Qwen3-VL-4B-Instruct-rft-sokoban_6x6

4B • Updated Feb 26 • 1

zhhhhahahaha

published a model 4 months ago

UCLA-SCAI/Qwen3-VL-4B-Instruct-rft-sokoban_6x6

4B • Updated Feb 26 • 1

UCLA-SCAI-Lab

updated a collection 5 months ago

ARLArena

Collection

3 items • Updated Feb 26 • 1

zhhhhahahaha

updated a model 5 months ago

UCLA-SCAI/Qwen3-4B-rft-webshop

4B • Updated Feb 2 • 7

zhhhhahahaha

published a model 5 months ago

UCLA-SCAI/Qwen3-4B-rft-webshop

4B • Updated Feb 2 • 7

zhhhhahahaha

updated a model 5 months ago

UCLA-SCAI/Qwen3-4B-rft-alfworld

4B • Updated Feb 2 • 3

zhhhhahahaha

published a model 5 months ago

UCLA-SCAI/Qwen3-4B-rft-alfworld

4B • Updated Feb 2 • 3

sxkdz

authored 4 papers almost 2 years ago

SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models

Paper • 2307.10635 • Published Jul 20, 2023 • 9

authored a paper about 2 years ago

Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks

Paper • 2310.00115 • Published Sep 29, 2023

AI & ML interests

Recent Activity

Team members 6

UCLA-SCAI's activity