AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference Paper • 2603.22053 • Published 1 day ago • 3
AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference Paper • 2603.22053 • Published 1 day ago • 3
ExposeAnyone: Personalized Audio-to-Expression Diffusion Models Are Robust Zero-Shot Face Forgery Detectors Paper • 2601.02359 • Published Jan 5 • 5
AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs Paper • 2511.20515 • Published Nov 25, 2025 • 5
AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs Paper • 2511.20515 • Published Nov 25, 2025 • 5
AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs Paper • 2511.20515 • Published Nov 25, 2025 • 5 • 2
AgroBench: Vision-Language Model Benchmark in Agriculture Paper • 2507.20519 • Published Jul 28, 2025 • 8
Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention Paper • 2509.09116 • Published Sep 11, 2025