Spaces:

MatanKriel
/

Food_Recommender

Sleeping

App Files Files Community

MatanKriel commited on Dec 30, 2025

Commit

3e810e8

verified ·

1 Parent(s): 19567a7

Update README.md

Browse files

Files changed (1) hide show

README.md +11 -17

README.md CHANGED Viewed

@@ -59,24 +59,16 @@ Based on the plots above, **we deleted "bad" images** that were:
 * Too Bright/Washed out (Avg Pixel Intensity > 245)
 * Extreme Aspect Ratios (Too stretched or squashed, AR > 3.0)
-### 3. Advanced Feature Engineering
-After removing the garbage data, we engineered deeper visual features to assess image content:
-* **Sharpness Score:** Used Laplacian Variance to find blurry photos.
-* **Dominant Color (Hue):** Analyzed color clusters (e.g., Green for Salads vs. Red for Pizza).
-* **Texture Complexity:** Calculated pixel standard deviation to distinguish smooth vs. complex foods.
-![image](https://cdn-uploads.huggingface.co/production/uploads/67dfcd96d01eab4618a66f78/0QMOkOCATUfePwu_-nm0z.png)
 ---
-## ⚔️ Part 2: Model Comparison (CLIP vs. SigLIP)
-To ensure the best search results, we ran a "Challenger" test between two leading multimodal models.
 ### The Contestants:
 1.  **Baseline:** OpenAI CLIP (`clip-vit-base-patch32`)
 2.  **Challenger:** Google SigLIP (`siglip-base-patch16-224`)
 ### The Evaluation:
 We compared them using **Silhouette Scores** (measuring how distinct the food clusters are) and a visual "Taste Test" (checking nearest neighbors for specific dishes).
@@ -88,12 +80,13 @@ We compared them using **Silhouette Scores** (measuring how distinct the food cl
 We queried both models with the same image to see which returned more accurate similar foods.
-![image](https://cdn-uploads.huggingface.co/production/uploads/67dfcd96d01eab4618a66f78/R4biFno1FUizlVRLRVCqM.png)
 ---
 ## 🧠 Part 3: Embeddings & Clustering
-Using the winning model (**SigLIP**), we generated 768-dimensional vectors for the entire dataset. We applied dimensionality reduction to visualize how the AI groups food concepts.
 * **Algorithm:** K-Means Clustering (k=101 categories).
 * **Visualization:**
@@ -101,10 +94,8 @@ Using the winning model (**SigLIP**), we generated 768-dimensional vectors for t
     * **t-SNE:** To see local groupings (e.g., "Sushi" clusters separately from "Burgers").
-![image](https://cdn-uploads.huggingface.co/production/uploads/67dfcd96d01eab4618a66f78/MT92BvvwToLxk83X0Yd12.png)
-![image](https://cdn-uploads.huggingface.co/production/uploads/67dfcd96d01eab4618a66f78/KyMPVz6VUsIGq2IMEZRYl.png)
 ---
@@ -114,10 +105,13 @@ The final product is a **Gradio** web application hosted on Hugging Face Spaces.
 1.  **Image-to-Image:** Upload a photo (e.g., a burger) -> The app embeds it using SigLIP -> Finds the nearest 3 visual matches.
 2.  **Text-to-Image:** Type "Spicy Tacos" -> The app finds images matching that description.
 ### How to Run Locally
 1.  **Clone the repository:**
     ```bash
-    git clone [https://huggingface.co/spaces/YOUR_USERNAME/Food-Match](https://huggingface.co/spaces/YOUR_USERNAME/Food-Match)
     cd Food-Match
     ```
 2.  **Install dependencies:**

 * Too Bright/Washed out (Avg Pixel Intensity > 245)
 * Extreme Aspect Ratios (Too stretched or squashed, AR > 3.0)
 ---
+## ⚔️ Part 2: Model Comparison (CLIP vs. SigLIP vs metaclip)
+To ensure the best search results, we ran a "Challenger" test between three leading multimodal models.
 ### The Contestants:
 1.  **Baseline:** OpenAI CLIP (`clip-vit-base-patch32`)
 2.  **Challenger:** Google SigLIP (`siglip-base-patch16-224`)
+3.  **Challenger:** Facebook MetaCLIP": ("facebook/metaclip-b32-400m)
 ### The Evaluation:
 We compared them using **Silhouette Scores** (measuring how distinct the food clusters are) and a visual "Taste Test" (checking nearest neighbors for specific dishes).
 We queried both models with the same image to see which returned more accurate similar foods.
+![image](https://cdn-uploads.huggingface.co/production/uploads/67dfcd96d01eab4618a66f78/Yz1YyU-eGcH9806Kg6PiF.png)
 ---
 ## 🧠 Part 3: Embeddings & Clustering
+Using the winning model (**SigLIP**), We applied dimensionality reduction to visualize how the AI groups food concepts.
 * **Algorithm:** K-Means Clustering (k=101 categories).
 * **Visualization:**
     * **t-SNE:** To see local groupings (e.g., "Sushi" clusters separately from "Burgers").
+![image](https://cdn-uploads.huggingface.co/production/uploads/67dfcd96d01eab4618a66f78/i3qZepniP0HqGQ8m5H-7K.png)
 ---
 1.  **Image-to-Image:** Upload a photo (e.g., a burger) -> The app embeds it using SigLIP -> Finds the nearest 3 visual matches.
 2.  **Text-to-Image:** Type "Spicy Tacos" -> The app finds images matching that description.
+## Note
+The application is running the clip model even though the sigLip model won, sigLip was to big to be run on the hugging face space free tier
 ### How to Run Locally
 1.  **Clone the repository:**
     ```bash
+    git clone [https://huggingface.co/spaces/YOUR_USERNAME/Food_Recommender](https://huggingface.co/spaces/YOUR_USERNAME/Food-Match)
     cd Food-Match
     ```
 2.  **Install dependencies:**