Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models Paper • 2604.21193 • Published 6 days ago • 2
Explainable Disentangled Representation Learning for Generalizable Authorship Attribution in the Era of Generative AI Paper • 2604.21300 • Published 6 days ago • 2
ViT-AdaLA: Adapting Vision Transformers with Linear Attention Paper • 2603.16063 • Published Mar 17 • 2
Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams Paper • 2603.07392 • Published Mar 8 • 18
Agentic Planning with Reasoning for Image Styling via Offline RL Paper • 2603.07148 • Published Mar 7 • 3
InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions Paper • 2603.03646 • Published Mar 4 • 8
Blind to the Human Touch: Overlap Bias in LLM-Based Summary Evaluation Paper • 2602.07673 • Published Feb 7 • 1
Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation Paper • 2602.09319 • Published Feb 10 • 1
Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling Paper • 2602.09084 • Published Feb 9 • 30
Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance Paper • 2601.17690 • Published Jan 25 • 1
PRISM: Learning Design Knowledge from Data for Stylistic Design Improvement Paper • 2601.11747 • Published Jan 16 • 1
MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces Paper • 2510.08783 • Published Oct 9, 2025 • 5
Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs Paper • 2510.07429 • Published Oct 8, 2025 • 4
The Photographer Eye: Teaching Multimodal Large Language Models to See and Critique like Photographers Paper • 2509.18582 • Published Sep 23, 2025 • 4
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper • 2509.16506 • Published Sep 20, 2025 • 22
mSCoRe: a Multilingual and Scalable Benchmark for Skill-based Commonsense Reasoning Paper • 2508.10137 • Published Aug 13, 2025 • 2
Lizard: An Efficient Linearization Framework for Large Language Models Paper • 2507.09025 • Published Jul 11, 2025 • 19
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality Paper • 2507.07202 • Published Jul 9, 2025 • 25