rajputnavya commited on
Commit
3f73437
·
verified ·
1 Parent(s): adb86aa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -42
README.md CHANGED
@@ -4,7 +4,7 @@ language:
4
  base_model:
5
  - google-bert/bert-base-uncased
6
  ---
7
- # Model Card for Model ID
8
 
9
  <!-- Provide a quick summary of what the model is/does. -->
10
 
@@ -18,75 +18,84 @@ This modelcard aims to be a base template for new models. It has been generated
18
 
19
 
20
 
21
- - **Developed by:** [More Information Needed]
22
  - **Funded by [optional]:** [More Information Needed]
23
  - **Shared by [optional]:** [More Information Needed]
24
  - **Model type:** [More Information Needed]
25
  - **Language(s) (NLP):** [More Information Needed]
26
- - **License:** [More Information Needed]
27
- - **Finetuned from model [optional]:** [More Information Needed]
28
 
29
- ### Model Sources [optional]
30
 
31
  <!-- Provide the basic links for the model. -->
32
 
33
  - **Repository:** [More Information Needed]
34
  - **Paper [optional]:** [More Information Needed]
35
- - **Demo [optional]:** [More Information Needed]
36
 
37
  ## Uses
38
 
39
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 
40
 
41
- ### Direct Use
42
 
43
  <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
44
 
45
- [More Information Needed]
46
 
47
- ### Downstream Use [optional]
48
 
49
  <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
50
 
51
- [More Information Needed]
52
 
53
  ### Out-of-Scope Use
54
 
55
  <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
56
 
57
- [More Information Needed]
58
 
59
  ## Bias, Risks, and Limitations
60
 
61
  <!-- This section is meant to convey both technical and sociotechnical limitations. -->
 
62
 
63
- [More Information Needed]
64
 
65
  ### Recommendations
66
 
67
  <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
68
-
69
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
70
 
71
  ## How to Get Started with the Model
72
 
73
  Use the code below to get started with the model.
74
 
75
- [More Information Needed]
 
 
 
76
 
77
  ## Training Details
78
 
79
  ### Training Data
80
 
81
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
82
-
83
- [More Information Needed]
 
 
84
 
85
  ### Training Procedure
86
 
87
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
 
 
88
 
89
- #### Preprocessing [optional]
90
 
91
  [More Information Needed]
92
 
@@ -95,41 +104,43 @@ Use the code below to get started with the model.
95
 
96
  - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
97
 
98
- #### Speeds, Sizes, Times [optional]
99
 
100
  <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
101
 
102
- [More Information Needed]
103
-
104
- ## Evaluation
105
 
 
 
106
  <!-- This section describes the evaluation protocols and provides the results. -->
107
 
108
- ### Testing Data, Factors & Metrics
109
 
110
- #### Testing Data
111
 
112
  <!-- This should link to a Dataset Card if possible. -->
113
 
114
- [More Information Needed]
115
 
116
- #### Factors
117
 
118
  <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
119
 
120
- [More Information Needed]
121
-
122
- #### Metrics
123
 
124
  <!-- These are the evaluation metrics being used, ideally with a description of why. -->
125
 
126
- [More Information Needed]
127
 
128
  ### Results
129
 
130
- [More Information Needed]
 
 
131
 
132
- #### Summary
133
 
134
 
135
 
@@ -137,13 +148,13 @@ Use the code below to get started with the model.
137
 
138
  <!-- Relevant interpretability work for the model goes here -->
139
 
140
- [More Information Needed]
141
-
142
- ## Environmental Impact
143
 
144
  <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
145
 
146
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
147
 
148
  - **Hardware Type:** [More Information Needed]
149
  - **Hours used:** [More Information Needed]
@@ -167,15 +178,15 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
167
 
168
  #### Software
169
 
170
- [More Information Needed]
171
 
172
- ## Citation [optional]
173
 
174
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
175
 
176
- **BibTeX:**
177
 
178
- [More Information Needed]
179
 
180
  **APA:**
181
 
@@ -185,7 +196,7 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
185
 
186
  <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
187
 
188
- [More Information Needed]
189
 
190
  ## More Information [optional]
191
 
@@ -197,4 +208,6 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
197
 
198
  ## Model Card Contact
199
 
200
- [More Information Needed]
 
 
 
4
  base_model:
5
  - google-bert/bert-base-uncased
6
  ---
7
+ <!-- # Model Card for Model ID
8
 
9
  <!-- Provide a quick summary of what the model is/does. -->
10
 
 
18
 
19
 
20
 
21
+ <!-- - **Developed by:** [More Information Needed]
22
  - **Funded by [optional]:** [More Information Needed]
23
  - **Shared by [optional]:** [More Information Needed]
24
  - **Model type:** [More Information Needed]
25
  - **Language(s) (NLP):** [More Information Needed]
26
+ - **License:** [More Information Needed] -->
27
+ - **Finetuned from model :** google-bert/bert-base-uncased
28
 
29
+ <!-- ### Model Sources [optional]
30
 
31
  <!-- Provide the basic links for the model. -->
32
 
33
  - **Repository:** [More Information Needed]
34
  - **Paper [optional]:** [More Information Needed]
35
+ - **Demo [optional]:** [More Information Needed] -->
36
 
37
  ## Uses
38
 
39
  <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
40
+ This model classifies food recipe or ingredient into three disctinct categories: Vegan, Vegetarian and Non-vegetarian
41
 
42
+ <!-- ### Direct Use
43
 
44
  <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
45
 
46
+ [More Information Needed] -->
47
 
48
+ <!-- ### Downstream Use [optional] -->
49
 
50
  <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
51
 
52
+ <!-- [More Information Needed]
53
 
54
  ### Out-of-Scope Use
55
 
56
  <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
57
 
58
+ [More Information Needed] -->
59
 
60
  ## Bias, Risks, and Limitations
61
 
62
  <!-- This section is meant to convey both technical and sociotechnical limitations. -->
63
+ This model is trained on controlled dataset.
64
 
65
+ <!-- [More Information Needed] -->
66
 
67
  ### Recommendations
68
 
69
  <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
70
+ Model should be fine-tuned on huge book corpus and large synthetic dataset.
71
+ <!-- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. -->
72
 
73
  ## How to Get Started with the Model
74
 
75
  Use the code below to get started with the model.
76
 
77
+ 1. Download model.
78
+ 2. Run testing script.
79
+
80
+ <!-- [More Information Needed] -->
81
 
82
  ## Training Details
83
 
84
  ### Training Data
85
 
86
  <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
87
+ 1. https://huggingface.co/datasets/rajputnavya/food-classification-mlm-clean
88
+ 2. https://huggingface.co/datasets/rajputnavya/food-classification-nsp-format
89
+ 3. https://huggingface.co/datasets/rajputnavya/food-classification-recipe-classification-data/blob/main/fine_tune_format.jsonl
90
+ <!-- [More Information Needed] -->
91
 
92
  ### Training Procedure
93
 
94
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
95
+ 1. Training on mlm and nsp dataset combined
96
+ 2. Fine-tuning on synthetic dataset for recipe classification
97
 
98
+ <!-- #### Preprocessing [optional]
99
 
100
  [More Information Needed]
101
 
 
104
 
105
  - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
106
 
107
+ <!-- #### Speeds, Sizes, Times [optional]
108
 
109
  <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
110
 
111
+ <!-- [More Information Needed] -->
 
 
112
 
113
+ <!-- ## Evaluation -->
114
+ -->
115
  <!-- This section describes the evaluation protocols and provides the results. -->
116
 
117
+ <!-- ### Testing Data, Factors & Metrics -->
118
 
119
+ <!-- #### Testing Data
120
 
121
  <!-- This should link to a Dataset Card if possible. -->
122
 
123
+ <!-- [More Information Needed] -->
124
 
125
+ <!-- #### Factors --> -->
126
 
127
  <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
128
 
129
+ <!-- [More Information Needed] -->
130
+ -->
131
+ <!-- #### Metrics -->
132
 
133
  <!-- These are the evaluation metrics being used, ideally with a description of why. -->
134
 
135
+ <!-- [More Information Needed] -->
136
 
137
  ### Results
138
 
139
+ <!-- [More Information Needed] -->
140
+ {'accuracy': 0.9166666666666666, 'precision': 0.9333333333333332, 'recall': 0.9166666666666666, 'f1_score': 0.9153439153439153}
141
+
142
 
143
+ <!-- #### Summary
144
 
145
 
146
 
 
148
 
149
  <!-- Relevant interpretability work for the model goes here -->
150
 
151
+ <!-- [More Information Needed] -->
152
+ <!-- --> -->
153
+ <!-- ## Environmental Impact
154
 
155
  <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
156
 
157
+ <!-- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
158
 
159
  - **Hardware Type:** [More Information Needed]
160
  - **Hours used:** [More Information Needed]
 
178
 
179
  #### Software
180
 
181
+ [More Information Needed] -->
182
 
183
+ <!-- ## Citation [optional] -->
184
 
185
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
186
 
187
+ <!-- **BibTeX:** --> -->
188
 
189
+ <!-- [More Information Needed]
190
 
191
  **APA:**
192
 
 
196
 
197
  <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
198
 
199
+ <!-- [More Information Needed]
200
 
201
  ## More Information [optional]
202
 
 
208
 
209
  ## Model Card Contact
210
 
211
+ [More Information Needed]
212
+
213
+ --> -->