Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,84 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: cc
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
|
| 6 |
+
---
|
| 7 |
+
|
| 8 |
+
# Model Card for AutoML Cuisine Classification
|
| 9 |
+
|
| 10 |
+
This model card documents the **AutoML Cuisine Classification** model trained with AutoGluon Multimodal on a classmate’s dataset of food images.
|
| 11 |
+
The task is to predict whether a food image belongs to **Asian** or **Western** cuisine (binary classification).
|
| 12 |
+
|
| 13 |
+
---
|
| 14 |
+
|
| 15 |
+
## Model Details
|
| 16 |
+
- **Developed by:** Bareethul Kader
|
| 17 |
+
- **Framework:** AutoGluon Multimodal
|
| 18 |
+
- **Repository:** bareethul/image-dataset-model
|
| 19 |
+
- **License:** CC BY 4.0
|
| 20 |
+
|
| 21 |
+
---
|
| 22 |
+
|
| 23 |
+
## Intended Use
|
| 24 |
+
### Direct Use
|
| 25 |
+
- Educational demonstration of AutoML on an image classification task.
|
| 26 |
+
- Comparison of different backbones (ResNet18, MobileNetV3, EfficientNet-B0).
|
| 27 |
+
- Exploring effects of augmentation and model selection under constrained compute budget.
|
| 28 |
+
|
| 29 |
+
### Out of Scope Use
|
| 30 |
+
- Not intended for production deployments in food classification systems.
|
| 31 |
+
- May not generalize to cuisines other than “Asian vs Western,” or to non-restaurant/home cooked settings.
|
| 32 |
+
- Not meant for health/dietary or allergy related automation.
|
| 33 |
+
|
| 34 |
+
---
|
| 35 |
+
|
| 36 |
+
## Dataset
|
| 37 |
+
- **Source:** [maryzhang/hw1-24679-image-dataset](https://huggingface.co/datasets/maryzhang/hw1-24679-image-dataset)
|
| 38 |
+
- **Task:** Binary image classification (label 0 = Western cuisine, label 1 = Asian cuisine)
|
| 39 |
+
- **Size:**
|
| 40 |
+
- Original images: 40
|
| 41 |
+
- Augmented images: 320
|
| 42 |
+
- Total: ≈ 360 images
|
| 43 |
+
- **Features:**
|
| 44 |
+
- `image`: Image (RGB, as provided)
|
| 45 |
+
- `label`: Integer 0 or 1
|
| 46 |
+
|
| 47 |
+
---
|
| 48 |
+
|
| 49 |
+
## Training Setup
|
| 50 |
+
- **AutoML framework:** AutoGluon Multimodal (`MultiModalPredictor`)
|
| 51 |
+
- **Evaluation metric:** Accuracy
|
| 52 |
+
- **Budget:** ~600 seconds (10 minutes) for quick runs; longer (~1800s) for full run and more accuracy.
|
| 53 |
+
- **Hardware:** Google Colab (GPU, typical environment)
|
| 54 |
+
- **Search Space:**
|
| 55 |
+
- Backbones: `resnet18`, `mobilenetv3_small_100`, `efficientnet_b0`
|
| 56 |
+
- **Preprocessing / Augmentation:** As provided in dataset (augmented split); resize and standard image transforms as in dataset loading
|
| 57 |
+
|
| 58 |
+
---
|
| 59 |
+
|
| 60 |
+
## Results
|
| 61 |
+
- **Best model (AutoGluon selected):** *efficientnet_b0*
|
| 62 |
+
- **Validation Accuracy:** *0.96875*
|
| 63 |
+
|
| 64 |
+
---
|
| 65 |
+
|
| 66 |
+
## Limitations, Biases, and Ethical Notes
|
| 67 |
+
- Small dataset size -> overfitting risk.
|
| 68 |
+
- Augmented data may not capture all real world variance (lighting, background, etc.).
|
| 69 |
+
- Binary classification “Asian vs Western” is coarse; many cuisines and dishes don’t neatly fit.
|
| 70 |
+
- Labeling reflects simplified categories; cultural/geographic nuance lost.
|
| 71 |
+
|
| 72 |
+
---
|
| 73 |
+
|
| 74 |
+
## Example Inference
|
| 75 |
+
```python
|
| 76 |
+
from autogluon.multimodal import MultiModalPredictor
|
| 77 |
+
|
| 78 |
+
# Load the pretrained model
|
| 79 |
+
predictor = MultiModalPredictor.load("bareethul/image-dataset-model")
|
| 80 |
+
|
| 81 |
+
# Run inference on an image file
|
| 82 |
+
pred = predictor.predict("path/to/your_test_food_image.jpg")
|
| 83 |
+
print("Prediction:", pred) # 0 = Western cuisine, 1 = Asian cuisine
|
| 84 |
+
|