--- title: Multimodal Recipe RAG emoji: 🍳 colorFrom: red colorTo: pink sdk: gradio sdk_version: 4.44.1 app_file: app.py pinned: false license: apache-2.0 tags: - multimodal - rag - recipe-search - nvidia-nemotron - computer-vision - food --- # 🍳 Multimodal Recipe RAG System Search **500 recipes** using text queries across images and documents with three powerful search modes: - πŸ–ΌοΈ **Image**: Visual similarity search (find similar-looking dishes) - πŸ“ **Text**: Semantic text search (find by ingredients, instructions, reviews) - 🎨 **Image+Text**: Combined multimodal search (best of both worlds) ## πŸ”§ Technology **Models:** - Embedding: [nvidia/llama-nemotron-embed-vl-1b-v2](https://huggingface.co/nvidia/llama-nemotron-embed-vl-1b-v2) - Reranking: [nvidia/llama-nemotron-rerank-vl-1b-v2](https://huggingface.co/nvidia/llama-nemotron-rerank-vl-1b-v2) **Dataset:** [TurkishCodeMan/recipe-synthetic-images-10k](https://huggingface.co/datasets/TurkishCodeMan/recipe-synthetic-images-10k) **Embeddings:** - Image: 2048-dim visual embeddings - Text: 2048-dim semantic embeddings - Image+Text: 2048-dim multimodal embeddings ## πŸ’‘ Usage 1. Enter your search query (e.g., "chocolate cake", "healthy breakfast") 2. Choose search modality (Image/Text/Image+Text) 3. Optionally enable reranking for better results 4. Click Search! ## 🎯 Example Queries **Image Mode:** - "recipes with steak" - "chocolate desserts" - "colorful salads" **Text Mode:** - "healthy breakfast ideas" - "vegetarian meals with pasta" - "quick dinner under 30 minutes" **Image+Text Mode (Recommended):** - "creamy pasta dishes" - "spicy chicken recipes" - "fresh summer salads" ## ⚑ Performance - Average search time: ~1-2 seconds - With reranking: ~2-3 seconds - GPU recommended for optimal performance ## πŸ—οΈ Architecture Query β†’ Embedding Model β†’ Similarity Search β†’ [Optional Reranking] β†’ Top Results ## πŸ“Š Dataset 500 recipes from TurkishCodeMan/recipe-synthetic-images-10k: - Recipe images (PNG) - Full recipe text (markdown) - Ingredients, steps, nutrition info - User reviews ## πŸ™ Credits Built with: - NVIDIA Nemotron models - Hugging Face Transformers - Gradio for UI # Yemek temalΔ± colorFrom: red colorTo: pink # Alternatifler colorFrom: yellow colorTo: red colorFrom: purple colorTo: pink colorFrom: blue colorTo: indigo colorFrom: green colorTo: blue