ddecosmo commited on
Commit
5c950f3
·
verified ·
1 Parent(s): 749cc39

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +126 -0
README.md ADDED
@@ -0,0 +1,126 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ '[object Object]': null
3
+ license: mit
4
+ datasets:
5
+ - ddecosmo/lanternfly_training_dataset
6
+ language:
7
+ - en
8
+ base_model:
9
+ - google/efficientnet-b1
10
+ ---
11
+
12
+ # Model Card for {{ model_id | default("Model ID", true) }}
13
+
14
+ <!-- Provide a quick summary of what the model is/does. -->
15
+
16
+ This is a fine tuned version of an EfficientNetB1 model trained on lanternfly, other insects, and general photos
17
+ for classification.
18
+
19
+ ## Model Details
20
+
21
+ ### Model Description
22
+
23
+ This model uses an EfficientNEtB1 with an Adam optimizer, mulit-class accuracy, and cross entropy loss.
24
+
25
+ - **Developed by:** Devin DeCosmo
26
+ - **Model type:** Image Classifier
27
+ - **Language(s) (NLP):** English
28
+ - **License:** MIT
29
+ - **Finetuned from model:** EfficientNetB1
30
+
31
+ ## Uses
32
+
33
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
34
+
35
+ This model is used for classifying lanterfly photos vs other insects and non insect photos.
36
+
37
+ ### Direct Use
38
+
39
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
40
+
41
+ The direct use is classiying photos within the 3 classes provided. Lanternfly, other insect, and non insect classes.
42
+
43
+ ### Out-of-Scope Use
44
+
45
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
46
+
47
+ This could be expanded to additional insect classes to expand range tracking capabilities.
48
+
49
+ ## Bias, Risks, and Limitations
50
+
51
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
52
+
53
+ This model is trained off a subset of lanternfly, insect, and non lanternfly images.
54
+ The dataset is a moderate size with a large number of augmented values.
55
+ It is accurate to 95% within testing and validation but there are edge cases
56
+ not included in the dataset that cause errors.
57
+
58
+ This includes insects in locations not included in training data and outdoor scenes with different lighting.
59
+ The dataset should be expanded or the model should be changed to improve it.
60
+
61
+ ### Recommendations
62
+
63
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
64
+
65
+ The small dataset size means this model is not highly generalizable.
66
+
67
+ ## How to Get Started with the Model
68
+
69
+ Use the code below to get started with the model.
70
+
71
+ ## Training Details
72
+
73
+ ### Training Data
74
+
75
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
76
+
77
+ ddecosmo/lanternfly_training_dataset
78
+
79
+ This is the training dataset used.
80
+
81
+ ### Training Procedure
82
+
83
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
84
+ This model was trained with an AutoML process with accuracy as the main metrics. The modelw as trained over 20 epochs with a batch size of 32 images.
85
+
86
+
87
+ #### Training Hyperparameters
88
+
89
+ This model used an Adam optimizer, mulit-class accuracy, and cross entropy loss.
90
+
91
+
92
+ ## Evaluation
93
+
94
+ <!-- This section describes the evaluation protocols and provides the results. -->
95
+
96
+ ### Testing Data, Factors & Metrics
97
+
98
+ #### Testing Data
99
+
100
+ <!-- This should link to a Dataset Card if possible. -->
101
+ ddecosmo/lanternfly_training_dataset
102
+ The testing data was the 'original' split, the original and 3rd party images in this set.
103
+
104
+ #### Factors
105
+
106
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
107
+
108
+ This dataset is evaluating whether the food is Lanternfly, "0", or Other Insect, "1", or Non Insect "2".
109
+
110
+ #### Metrics
111
+
112
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
113
+
114
+ The testing metric used was accuracy to ensure the highest accuracy of the model possible.
115
+
116
+
117
+ ### Results
118
+
119
+ After training with the initial dataset, this model reached an accuracy of 95% in validation.
120
+
121
+ #### Summary
122
+
123
+ This model reached a high accuracy with our current model.
124
+ The large size of the dataset allowed for a large amount of training.
125
+ After training, it was found the training dataset had gaps, causing edge case failures
126
+ that fell outside the bounds of the original dataset.