File size: 2,417 Bytes
e53134e
3f8c153
 
 
 
e53134e
3f8c153
e53134e
 
3f8c153
 
 
 
 
 
 
 
e53134e
 
3f8c153
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
---
title: Multimodal Recipe RAG
emoji: 🍳
colorFrom: red
colorTo: pink
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: false
license: apache-2.0
tags:
  - multimodal
  - rag
  - recipe-search
  - nvidia-nemotron
  - computer-vision
  - food
---

# 🍳 Multimodal Recipe RAG System

Search **500 recipes** using text queries across images and documents with three powerful search modes:

- 🖼️ **Image**: Visual similarity search (find similar-looking dishes)
- 📝 **Text**: Semantic text search (find by ingredients, instructions, reviews)
- 🎨 **Image+Text**: Combined multimodal search (best of both worlds)

## 🔧 Technology

**Models:**
- Embedding: [nvidia/llama-nemotron-embed-vl-1b-v2](https://huggingface.co/nvidia/llama-nemotron-embed-vl-1b-v2)
- Reranking: [nvidia/llama-nemotron-rerank-vl-1b-v2](https://huggingface.co/nvidia/llama-nemotron-rerank-vl-1b-v2)

**Dataset:** [TurkishCodeMan/recipe-synthetic-images-10k](https://huggingface.co/datasets/TurkishCodeMan/recipe-synthetic-images-10k)

**Embeddings:**
- Image: 2048-dim visual embeddings
- Text: 2048-dim semantic embeddings  
- Image+Text: 2048-dim multimodal embeddings

## 💡 Usage

1. Enter your search query (e.g., "chocolate cake", "healthy breakfast")
2. Choose search modality (Image/Text/Image+Text)
3. Optionally enable reranking for better results
4. Click Search!

## 🎯 Example Queries

**Image Mode:**
- "recipes with steak"
- "chocolate desserts"
- "colorful salads"

**Text Mode:**
- "healthy breakfast ideas"
- "vegetarian meals with pasta"
- "quick dinner under 30 minutes"

**Image+Text Mode (Recommended):**
- "creamy pasta dishes"
- "spicy chicken recipes"
- "fresh summer salads"

## ⚡ Performance

- Average search time: ~1-2 seconds
- With reranking: ~2-3 seconds
- GPU recommended for optimal performance

## 🏗️ Architecture
Query → Embedding Model → Similarity Search → [Optional Reranking] → Top Results



## 📊 Dataset

500 recipes from TurkishCodeMan/recipe-synthetic-images-10k:
- Recipe images (PNG)
- Full recipe text (markdown)
- Ingredients, steps, nutrition info
- User reviews

## 🙏 Credits

Built with:
- NVIDIA Nemotron models
- Hugging Face Transformers
- Gradio for UI

# Yemek temalı
colorFrom: red
colorTo: pink

# Alternatifler
colorFrom: yellow
colorTo: red

colorFrom: purple
colorTo: pink

colorFrom: blue
colorTo: indigo

colorFrom: green
colorTo: blue