Bhalaji Nagarajan's picture

Bhalaji Nagarajan

bhalajin
Β·

AI & ML interests

None yet

Recent Activity

posted an update 3 days ago
###### CVPR2026 MetaFood Workshop Challenge Alert ###### 🍽️ Dishcovery: Mission II VLM Challenge We’re excited to share that the Dishcovery II Vision-Language Model Challenge is LIVE, as part of the 3rd MetaFood Workshop @ CVPR 2026. πŸ‘‰ Join the challenge: https://www.kaggle.com/competitions/dishcovery-mission-ii-cvpr-2026 πŸ‘‰ Dataset collection (Hugging Face): https://huggingface.co/datasets/jesusmolrdv/MTF25-VLM-Challenge-Dataset-Web https://huggingface.co/datasets/jesusmolrdv/MTF25-VLM-Challenge-Dataset-Synth πŸ‘‰ Workshop details: https://sites.google.com/view/cvpr-metafood-2026 πŸ” Task Build a Vision-Language Model that aligns food images with text under real-world conditions: 1. Multi-label retrieval β†’ identify relevant ingredients/components 2. Single-label retrieval β†’ select the best dense food description πŸ“¦ Dataset Highlights - 400K+ image–caption pairs - Mix of real, noisy, and synthetic data - Designed for fine-grained food understanding - Reflects real-world multimodal challenges βš”οΈ What makes this interesting? - Not just accuracy β†’ efficiency matters - Robustness to noise and domain shift - Fine-grained alignment between visual and semantic concepts - Benchmark for next-gen VLMs (CLIP, SigLIP, LLaVA-style models) πŸ“… Timeline (key milestones) May 1, 2026 β†’ Final predictions + method summary πŸš€ Why participate? - Benchmark your models on large-scale multimodal food data - Test robustness under realistic conditions - Gain visibility via a global leaderboard - Contribute to the growing Food Γ— Vision Γ— Language research space 🍳 The kitchen is heating up β€” looking forward to seeing what the community builds! #multimodal #computervision #vlm #deeplearning #datasets #kaggle #huggingface #ai #research Dataset Citation: https://huggingface.co/papers/2407.03463
View all activity

Organizations

Universitat de Barcelona's profile picture