ddecosmo commited on
Commit
597b980
·
verified ·
1 Parent(s): 4bdd494

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +119 -0
README.md ADDED
@@ -0,0 +1,119 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ '[object Object]': null
3
+ license: mit
4
+ datasets:
5
+ - maryzhang/hw1-24679-image-dataset
6
+ language:
7
+ - en
8
+ ---
9
+
10
+ # Model Card for {{ model_id | default("Model ID", true) }}
11
+
12
+ <!-- Provide a quick summary of what the model is/does. -->
13
+
14
+ This is a fine tuned version of the RandomForestEntr_BAG_L1 model for classification. This was fine tuned on the EricCRX/books-tabular-datasetwhich is a dataset of the measurements of books.
15
+ In this case, it was used for binary classification between softcover and hardcover books.
16
+
17
+ ## Model Details
18
+
19
+ ### Model Description
20
+
21
+ This model uses the RandomForestEntr_BAG_L1 with accuracy as the main parameter and multi class accuracy and cross entropy as the main hyperparameters.
22
+ It also uses L1 regularization to reduce overfitting.
23
+
24
+ - **Developed by:** Devin DeCosmo
25
+ - **Model type:** Binary Classifier
26
+ - **Language(s) (NLP):** English
27
+ - **License:** MIT
28
+ - **Finetuned from model:** RandomForestEntr_BAG_L1
29
+
30
+
31
+ ## Uses
32
+
33
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
34
+
35
+ This could be used for general image classification tasks, especially those for culinary uses.
36
+
37
+ ### Direct Use
38
+
39
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
40
+
41
+ The direct use would be to classify food as either Western or Asian based on an image.
42
+
43
+ ### Out-of-Scope Use
44
+
45
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
46
+
47
+ If the dataset was expanded, this could be used to classify other types of food among numerous other classes.
48
+
49
+ ## Bias, Risks, and Limitations
50
+
51
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
52
+
53
+ This is trained off a small dataset of 30 original photos and 300 augmented photos. This could suggest overfitting of the model and additional information is required to make it more robust.
54
+
55
+ ### Recommendations
56
+
57
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
58
+
59
+ The small dataset size means this model is not highly generalizable.
60
+
61
+ ## How to Get Started with the Model
62
+
63
+ Use the code below to get started with the model.
64
+
65
+ ## Training Details
66
+
67
+ ### Training Data
68
+
69
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
70
+
71
+ maryzhang/hw1-24679-image-dataset
72
+
73
+ This is the training dataset used.
74
+ It consists of 30 original images used for validation along with 300 synthetic pieces of data from training.
75
+
76
+ ### Training Procedure
77
+
78
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
79
+ This model was trained with an AutoML process with accuracy as the main metrics. The modelw as trained over 20 epochs with a batch size of 32 images.
80
+
81
+
82
+ #### Training Hyperparameters
83
+
84
+ - **Training regime:** {{ training_regime | default("[More Information Needed]", true)}} <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
85
+
86
+
87
+ ## Evaluation
88
+
89
+ <!-- This section describes the evaluation protocols and provides the results. -->
90
+
91
+ ### Testing Data, Factors & Metrics
92
+
93
+ #### Testing Data
94
+
95
+ <!-- This should link to a Dataset Card if possible. -->
96
+ maryzhang/hw1-24679-image-dataset
97
+ The testing data was the 'original' split, the 30 original images in this set.
98
+
99
+ #### Factors
100
+
101
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
102
+
103
+ This dataset is evaluating whether the food is Western, "1", or Asian, "0".
104
+
105
+ #### Metrics
106
+
107
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
108
+
109
+ The testing metric used was accuracy to ensure the highest accuracy of the model possible.
110
+
111
+
112
+ ### Results
113
+
114
+ After training with the initial dataset, this model reached an accuracy of 95% in validation.
115
+
116
+ #### Summary
117
+
118
+ This model reached a high accuracy with our current model, but this perfomance can not be confirmed to continue as the dataset was very small.
119
+