TAUR Lab at UT Austin

university

https://taur.cs.utexas.edu/

AI & ML interests

None defined yet.

Recent Activity

wadhma submitted a paper about 2 months ago

CREATE: Testing LLMs for Associative Creativity

wadhma authored a paper about 2 months ago

CREATE: Testing LLMs for Associative Creativity

wadhma authored a paper about 2 months ago

EvalAgent: Discovering Implicit Evaluation Criteria from the Web

View all activity

submitted a paper to Daily Papers about 2 months ago

CREATE: Testing LLMs for Associative Creativity

Paper • 2603.09970 • Published Mar 10 • 15

authored 3 papers about 2 months ago

CREATE: Testing LLMs for Associative Creativity

Paper • 2603.09970 • Published Mar 10 • 15

EvalAgent: Discovering Implicit Evaluation Criteria from the Web

Paper • 2504.15219 • Published Apr 21, 2025 • 1

QUDsim: Quantifying Discourse Similarities in LLM-Generated Text

Paper • 2504.09373 • Published Apr 12, 2025

submitted a paper to Daily Papers 3 months ago

Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents

Paper • 2602.16699 • Published Feb 18 • 16

authored a paper 5 months ago

SkillFactory: Self-Distillation For Learning Cognitive Behaviors

Paper • 2512.04072 • Published Dec 3, 2025 • 5

authored a paper 5 months ago

SkillFactory: Self-Distillation For Learning Cognitive Behaviors

Paper • 2512.04072 • Published Dec 3, 2025 • 5

authored a paper 5 months ago

SkillFactory: Self-Distillation For Learning Cognitive Behaviors

Paper • 2512.04072 • Published Dec 3, 2025 • 5

authored a paper 11 months ago

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Paper • 2501.05414 • Published Jan 9, 2025 • 2

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored 4 papers 12 months ago

CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering

Paper • 2305.14869 • Published May 24, 2023 • 1

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

Paper • 2401.07286 • Published Jan 14, 2024

IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce

Paper • 2406.10173 • Published Jun 14, 2024

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

authored a paper 12 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16