3 7

Johannes Kirmayr

johanneskirmayr

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

submitted a paper 3 months ago

"What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing

authored a paper 3 months ago

"What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing

View all activity

Organizations

upvoted a paper 2 months ago

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Paper • 2601.22027 • Published Jan 29 • 86

submitted a paper to Daily Papers 3 months ago

"What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing

Paper • 2602.15569 • Published Feb 17 • 13

authored a paper 3 months ago

"What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing

Paper • 2602.15569 • Published Feb 17 • 13

upvoted a paper 3 months ago

"What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing

Paper • 2602.15569 • Published Feb 17 • 13

New activity in johanneskirmayr/car-bench-dataset 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

parquet-converter

updated a dataset 4 months ago

johanneskirmayr/car-bench-dataset

Viewer • Updated Feb 12 • 1.76M • 3.32k

authored a paper 4 months ago

CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding

Paper • 2501.09645 • Published Jan 16, 2025 • 2

upvoted a paper 4 months ago

CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding

Paper • 2501.09645 • Published Jan 16, 2025 • 2

published a dataset 4 months ago

johanneskirmayr/car-bench-dataset

Viewer • Updated Feb 12 • 1.76M • 3.32k

upvoted 4 papers 4 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 269

Reinforced Attention Learning

Paper • 2602.04884 • Published Feb 4 • 30

Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention

Paper • 2602.03338 • Published Feb 3 • 26

A Unified Framework for Rethinking Policy Divergence Measures in GRPO

Paper • 2602.05494 • Published Feb 5 • 2

submitted a paper to Daily Papers 4 months ago

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Paper • 2601.22027 • Published Jan 29 • 86

authored a paper 4 months ago

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty

Paper • 2601.22027 • Published Jan 29 • 86

Johannes Kirmayr

AI & ML interests

Recent Activity

Organizations

johanneskirmayr's activity

[bot] Conversion to Parquet