Spaces:

DanielKiani
/

Food101-Classification

Sleeping

App Files Files Community

Food101-Classification / README.md

Daniel kiani

Update README.md

4d77859 verified 4 months ago

preview code

raw

history blame contribute delete

9.04 kB

	---
	title: Food-101 Classifier
	emoji: 🍔
	colorFrom: yellow
	colorTo: red
	sdk: gradio
	app_file: scripts/app.py
	---

	![Food101 Classification Banner](assets/banner.png)

	[![Python](https://img.shields.io/badge/Python-3.10-blue?logo=python)](https://www.python.org/)[![PyTorch](https://img.shields.io/badge/PyTorch-2.7.1-EE4C2C?logo=pytorch)](https://pytorch.org/)![Made with ML](https://img.shields.io/badge/Made%20with-ML-blueviolet?logo=openai)[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)

	# 🍽️ Food-101 Image Classification with EfficientNetV2-S and PyTorch Lightning

	This repository contains the code for an end-to-end deep learning project to classify 101 food categories from the challenging Food-101 dataset. The project demonstrates a systematic approach to model selection, fine-tuning, and hyperparameter optimization, achieving a final validation accuracy of 85.4% on the full dataset.

	The entire training and evaluation pipeline is built using modern, reproducible practices with PyTorch Lightning.

	---

	## 📑 Table of Contents

	- [🍽️ Food-101 Image Classification with EfficientNetV2-S and PyTorch Lightning](#️-food-101-image-classification-with-efficientnetv2-s-and-pytorch-lightning)
	- [📑 Table of Contents](#-table-of-contents)
	- [🎯 Project Highlights](#-project-highlights)
	- [💡 Real-World Applications](#-real-world-applications)
	- [🧫 Experimental Results](#-experimental-results)
	- [📊 Final Results](#-final-results)
	- [🔬 Performance Analysis and Error Diagnosis](#-performance-analysis-and-error-diagnosis)
	- [🍤 Lowest-Performing Classes](#-lowest-performing-classes)
	- [Root Cause Analysis of Misclassifications](#root-cause-analysis-of-misclassifications)
	- [Future Work](#future-work)
	- [🧪 Methodology and Experimental Process](#-methodology-and-experimental-process)
	- [📁 Repository Structure](#-repository-structure)
	- [🚀 Getting Started](#-getting-started)
	- [Prerequisites](#prerequisites)
	- [Installation](#installation)
	- [Usage](#usage)
	- [💻 Technologies Used](#-technologies-used)

	---

	## 🎯 Project Highlights

	- High-Performance Model ⚡: Utilizes a pre-trained `EfficientNetV2-S`, selected for its excellent balance of accuracy and computational efficiency suitable for potential edge deployment.
	- Reproducible Pipeline 🔄: Encapsulates the entire workflow—from data loading to training and evaluation—in a clean and organized `LightningModule` and `DataModule`.
	- Efficient Experimentation ⏱️: Overcame hardware limitations by implementing dataset subsetting for rapid prototyping.
	- Advanced Fine-Tuning 🛠️: Implemented a robust fine-tuning strategy, unfreezing the final three blocks of the feature extractor and using the `Adam` optimizer with a `CosineAnnealingLR` scheduler for stable convergence.
	- In-Depth Analysis 🔎: Went beyond simple accuracy by calculating and logging per-class F1-scores and accuracies, enabling a deep dive into the model's strengths and weaknesses.
	- Live Deployment 📺: The final model is deployed and accessible as an interactive Gradio web application on Hugging Face Spaces.

	---

	## 💡 Real-World Applications

	Beyond being a technical challenge, this food classification model serves as a foundation for numerous real-world applications in health, hospitality, and smart home technology.

	- Health and Nutrition Tracking
	- Automated Calorie Counting: Users can snap a photo of their meal, and an app can automatically identify each food item to provide an instant estimate of calories, macros, and other nutritional information.
	- Dietary Management: Assists individuals with allergies or specific dietary needs (e.g., diabetes, gluten-free) by helping them identify and log their food intake accurately.
	- Restaurant and Hospitality Tech
	- Self-Checkout Systems: In cafeterias or quick-service restaurants, a camera-based system could identify all items on a tray to automate the billing process, reducing queues and improving efficiency.
	- Interactive Menus: Allow diners to point their phone at a dish to get more information, such as ingredients, allergen warnings, or customer reviews.

	- Smart Home and Appliances
	- Smart Refrigerators: A fridge equipped with a camera could identify leftover dishes, suggest recipes based on available food, and help track food spoilage to reduce waste.

	---

	## 🧫 Experimental Results

	This project followed an iterative approach. The table below summarizes the key experiments and their outcomes, showing the progression from the initial baseline to the final model.

	\| Model \| Training Strategy \| Data % \| Key Hyperparameters \| Final Val Accuracy \|
	\| :--- \| :--- \| :--- \| :--- \| :--- \|
	\| `EfficientNet-B2` \| Simple fine-tune (last block) \| 50% \| `lr=1e-4` \| ~64% \|
	\| `EfficientNet-B2` \| Unfreeze last 3 blocks \| 50% \| `lr=1e-3` \| 82.0% \|
	\| `EfficientNet-B2` \| Two-Stage Fine-Tuning \| 50% \| `lr1=1e-3`, `lr2=1e-5` \| Performance Degraded \|
	\| `EfficientNetV2-S` \| Unfreeze last 3 blocks \| 50% \| `lr=1e-4` (Tuned) \| 82.4% \|
	\| `EfficientNetV2-S` \| Unfreeze last 3 blocks and more advanced transforms \| 50% \| `lr=1e-4` (Tuned) \| ~82.4% Pretty much the same Performance\|
	\| `EfficientNetV2-S` \| Unfreeze last 3 blocks \| 100% \| `lr=1e-4` (Tuned) \| 85.4% \|

	---

	## 📊 Final Results

	After systematically iterating on model architecture and hyperparameters, the final model achieved the following performance on the full Food-101 validation set:

	\| Metric \| Score \|
	\| :------------------ \| :------ \|
	\| Validation Accuracy \| 85.4% \|

	![Confusion Matrix Plot](assets/confusion_matrix.png)
	Here is the Confusion Matrix on the Test set. (you can find this plot in the assets section)

	This model is deployed and accessible as an interactive Gradio web application on Hugging Face Spaces.

	![Gradio](assets/gradio.png)

	Check out my [Food101 Gradio Demo](https://huggingface.co/spaces/your-username/food101-demo).

	---

	## 🔬 Performance Analysis and Error Diagnosis

	Beyond the aggregate accuracy, a per-class analysis was conducted to identify the model's specific limitations and diagnose the root causes of misclassifications.

	The model performed exceptionally well on many classes but struggled with a distinct set of categories, primarily due to visual ambiguity and high variability in appearance.

	#### 🍤 Lowest-Performing Classes

	The following five classes had the lowest validation accuracy:

	\| Class Name \| Index \| Validation Accuracy \|
	\| :------------------ \| :---- \| :------------------ \|
	\| `shrimp_and_grits` \| 93 \| 44.0% \|
	\| `ravioli` \| 77 \| 59.2% \|
	\| `apple_pie` \| 0 \| 61.6% \|
	\| `huevos_rancheros` \| 56 \| 63.2% \|
	\| `falafel` \| 36 \| 63.6% \|

	#### Root Cause Analysis of Misclassifications

	* High Intra-Class Variation: The model struggled with dishes that have no single, consistent appearance.
	* Fine-Grained Confusion: Errors occurred between visually similar classes like `ravioli` vs. `dumplings`.
	* Ambiguous Features: Foods like `falafel` resemble many small fried dishes, making classification tricky.

	#### Future Work

	Improvements could include:

	- Detailed confusion matrix analysis 🔍
	- More aggressive data augmentation 📈
	- Larger architectures for fine-grained recognition 🏋️
	- Training for longer 🏋️

	---

	## 🧪 Methodology and Experimental Process

	Steps taken in the project:

	1. Baseline Establishment 🏁 – EfficientNet-B2 achieved ~64%.
	2. Architecture Selection 🏗️ – EfficientNetV2-S chosen for balance of accuracy and size.
	3. Transforms Selection 🎨 – TrivialAugmentWide + RandomResizedCrop, RandAugment, etc.
	4. Fine-Tuning Strategy 🔧 – Final 3 blocks unfrozen for training.
	5. Final Model Training 🏆 – Full dataset, Adam, CosineAnnealingLR, EarlyStopping → 85.4%.

	---

	## 📁 Repository Structure

	```bash
	food-101-classification/
	├── data/
	├── logs/
	├── scripts/
	│ ├── main.py
	│ ├── models.py
	│ ├── class_names.py
	│ ├── app.py
	│ └── prepare_data.py
	├── .gitignore
	├── requirements.txt
	└── README.md
	```

	---

	## 🚀 Getting Started

	### Prerequisites

	- Python 3.10+ 🐍
	- PyTorch 🔥
	- CUDA-enabled GPU (recommended) 🎮

	### Installation

	1. Clone the repository:

	```bash
	git clone https://github.com/Deathshot78/Food101-Classification
	cd Food101-Classification
	```

	2. Install the dependencies:

	```bash
	pip install -r requirements.txt
	```

	### Usage

	Run training with a subset for quick testing:

	```bash
	python main.py
	```

	### 💻 Technologies Used

	- Python

	- PyTorch

	- PyTorch Lightning

	- TorchMetrics

	- Gradio

	- Matplotlib & Seaborn