University of Texas at Austin

university

Verified

https://www.utexas.edu

AI & ML interests

None defined yet.

Recent Activity

acnagle submitted a paper about 2 months ago

TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning

jehuhuhuhu authored a paper about 2 months ago

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

jehuhuhuhu authored a paper about 2 months ago

Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI

View all activity

Papers

TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning

Training-free Latent Inter-Frame Pruning with Attention Recovery

View all Papers

submitted a paper to Daily Papers about 2 months ago

TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning

Paper • 2603.12529 • Published Mar 13 • 19

authored a paper about 2 months ago

TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning

Paper • 2603.12529 • Published Mar 13 • 19

submitted 2 papers to Daily Papers about 2 months ago

Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

Paper • 2603.09221 • Published Mar 10

nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space

Paper • 2603.04948 • Published Mar 5 • 2

submitted a paper to Daily Papers 3 months ago

EntRGi: Entropy Aware Reward Guidance for Diffusion Language Models

Paper • 2602.05000 • Published Feb 4 • 2

authored a paper 3 months ago

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

Paper • 2601.17111 • Published Jan 23 • 5

submitted a paper to Daily Papers 3 months ago

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

Paper • 2601.17111 • Published Jan 23 • 5

authored a paper 5 months ago

Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models

Paper • 2512.03125 • Published Dec 2, 2025 • 3

authored a paper 6 months ago

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms

Paper • 2510.13913 • Published Oct 15, 2025 • 4

authored 2 papers 7 months ago

EgoVLM: Policy Optimization for Egocentric Video Understanding

Paper • 2506.03097 • Published Jun 3, 2025

Hard2Verify: A Step-Level Verification Benchmark for Open-Ended Frontier Math

Paper • 2510.13744 • Published Oct 15, 2025 • 6

authored a paper 8 months ago

SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

Paper • 2509.06283 • Published Sep 8, 2025 • 17

authored a paper 11 months ago

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Paper • 2501.05414 • Published Jan 9, 2025 • 2

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored 3 papers about 1 year ago

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

Paper • 2407.06249 • Published Jul 8, 2024

SFR-RAG: Towards Contextually Faithful LLMs

Paper • 2409.09916 • Published Sep 16, 2024 • 1

FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"

Paper • 2410.03727 • Published Sep 30, 2024 • 2

authored 2 papers about 1 year ago

Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

Paper • 2310.03780 • Published Oct 5, 2023

Protecting Human Cognition in the Age of AI

Paper • 2502.12447 • Published Feb 18, 2025 • 7

authored a paper over 1 year ago

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published Feb 3, 2025 • 9