Pisets: A Robust Speech Recognition System for Lectures and Interviews Paper • 2601.18415 • Published 16 days ago • 31
TUN3D: Towards Real-World Scene Understanding from Unposed Images Paper • 2509.21388 • Published Sep 23, 2025 • 15
nablaNABLA: Neighborhood Adaptive Block-Level Attention Paper • 2507.13546 • Published Jul 17, 2025 • 125
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper • 2507.05964 • Published Jul 8, 2025 • 120
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5, 2025 • 133
Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback Paper • 2507.02321 • Published Jul 3, 2025 • 39
Listener-Rewarded Thinking in VLMs for Image Preferences Paper • 2506.22832 • Published Jun 28, 2025 • 23
Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models Paper • 2506.19103 • Published Jun 23, 2025 • 42
DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization Paper • 2505.20975 • Published May 27, 2025 • 36