Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling Paper • 2602.09084 • Published 8 days ago • 27
Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation Paper • 2602.09319 • Published 8 days ago • 1
Blind to the Human Touch: Overlap Bias in LLM-Based Summary Evaluation Paper • 2602.07673 • Published 10 days ago • 1
Blind to the Human Touch: Overlap Bias in LLM-Based Summary Evaluation Paper • 2602.07673 • Published 10 days ago • 1
Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation Paper • 2602.09319 • Published 8 days ago • 1
Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance Paper • 2601.17690 • Published 24 days ago • 1
Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance Paper • 2601.17690 • Published 24 days ago • 1
PRISM: Learning Design Knowledge from Data for Stylistic Design Improvement Paper • 2601.11747 • Published Jan 16 • 1
StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos Paper • 2512.01707 • Published Dec 1, 2025 • 8
PRISM: Learning Design Knowledge from Data for Stylistic Design Improvement Paper • 2601.11747 • Published Jan 16 • 1
Structured Uncertainty guided Clarification for LLM Agents Paper • 2511.08798 • Published Nov 11, 2025
Charts Are Not Images: On the Challenges of Scientific Chart Editing Paper • 2512.00752 • Published Nov 30, 2025
Iterative Critique-Refine Framework for Enhancing LLM Personalization Paper • 2510.24469 • Published Oct 28, 2025
MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces Paper • 2510.08783 • Published Oct 9, 2025 • 5
Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs Paper • 2510.07429 • Published Oct 8, 2025 • 4
Optimizing Data Delivery: Insights from User Preferences on Visuals, Tables, and Text Paper • 2411.07451 • Published Nov 12, 2024
MODS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections Paper • 2502.00322 • Published Feb 1, 2025
A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations Paper • 2505.14106 • Published May 20, 2025