bareethul commited on
Commit
0575c82
·
verified ·
1 Parent(s): 2812a91

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +84 -0
README.md ADDED
@@ -0,0 +1,84 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc
3
+ language:
4
+ - en
5
+
6
+ ---
7
+
8
+ # Model Card for AutoML Cuisine Classification
9
+
10
+ This model card documents the **AutoML Cuisine Classification** model trained with AutoGluon Multimodal on a classmate’s dataset of food images.
11
+ The task is to predict whether a food image belongs to **Asian** or **Western** cuisine (binary classification).
12
+
13
+ ---
14
+
15
+ ## Model Details
16
+ - **Developed by:** Bareethul Kader
17
+ - **Framework:** AutoGluon Multimodal
18
+ - **Repository:** bareethul/image-dataset-model
19
+ - **License:** CC BY 4.0
20
+
21
+ ---
22
+
23
+ ## Intended Use
24
+ ### Direct Use
25
+ - Educational demonstration of AutoML on an image classification task.
26
+ - Comparison of different backbones (ResNet18, MobileNetV3, EfficientNet-B0).
27
+ - Exploring effects of augmentation and model selection under constrained compute budget.
28
+
29
+ ### Out of Scope Use
30
+ - Not intended for production deployments in food classification systems.
31
+ - May not generalize to cuisines other than “Asian vs Western,” or to non-restaurant/home cooked settings.
32
+ - Not meant for health/dietary or allergy related automation.
33
+
34
+ ---
35
+
36
+ ## Dataset
37
+ - **Source:** [maryzhang/hw1-24679-image-dataset](https://huggingface.co/datasets/maryzhang/hw1-24679-image-dataset)
38
+ - **Task:** Binary image classification (label 0 = Western cuisine, label 1 = Asian cuisine)
39
+ - **Size:**
40
+ - Original images: 40
41
+ - Augmented images: 320
42
+ - Total: ≈ 360 images
43
+ - **Features:**
44
+ - `image`: Image (RGB, as provided)
45
+ - `label`: Integer 0 or 1
46
+
47
+ ---
48
+
49
+ ## Training Setup
50
+ - **AutoML framework:** AutoGluon Multimodal (`MultiModalPredictor`)
51
+ - **Evaluation metric:** Accuracy
52
+ - **Budget:** ~600 seconds (10 minutes) for quick runs; longer (~1800s) for full run and more accuracy.
53
+ - **Hardware:** Google Colab (GPU, typical environment)
54
+ - **Search Space:**
55
+ - Backbones: `resnet18`, `mobilenetv3_small_100`, `efficientnet_b0`
56
+ - **Preprocessing / Augmentation:** As provided in dataset (augmented split); resize and standard image transforms as in dataset loading
57
+
58
+ ---
59
+
60
+ ## Results
61
+ - **Best model (AutoGluon selected):** *efficientnet_b0*
62
+ - **Validation Accuracy:** *0.96875*
63
+
64
+ ---
65
+
66
+ ## Limitations, Biases, and Ethical Notes
67
+ - Small dataset size -> overfitting risk.
68
+ - Augmented data may not capture all real world variance (lighting, background, etc.).
69
+ - Binary classification “Asian vs Western” is coarse; many cuisines and dishes don’t neatly fit.
70
+ - Labeling reflects simplified categories; cultural/geographic nuance lost.
71
+
72
+ ---
73
+
74
+ ## Example Inference
75
+ ```python
76
+ from autogluon.multimodal import MultiModalPredictor
77
+
78
+ # Load the pretrained model
79
+ predictor = MultiModalPredictor.load("bareethul/image-dataset-model")
80
+
81
+ # Run inference on an image file
82
+ pred = predictor.predict("path/to/your_test_food_image.jpg")
83
+ print("Prediction:", pred) # 0 = Western cuisine, 1 = Asian cuisine
84
+