End of training

Browse files

Files changed (5) hide show

README.md +18 -23
config.json +1 -1
model.safetensors +1 -1
tokenizer.json +1 -6
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -1,5 +1,4 @@
 ---
-library_name: transformers
 license: apache-2.0
 base_model: google-bert/bert-base-uncased
 tags:
@@ -9,8 +8,6 @@ metrics:
 model-index:
 - name: bert-phishing-classifier_teacher
   results: []
-datasets:
-- shawhin/phishing-site-classification
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,17 +15,15 @@ should probably proofread and complete it, then remove this comment. -->
 # bert-phishing-classifier_teacher
-This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2881
-- Accuracy: 0.867
 - Auc: 0.951
 ## Model description
-Teacher model for knowledge distillation example.
-[Video](https://youtu.be/FLkUOkeMd5M) | [Blog](https://towardsdatascience.com/compressing-large-language-models-llms-9f406eea5b5e) | [Example code](https://github.com/ShawhinT/YouTube-Blog/tree/main/LLMs/model-compression)
 ## Intended uses & limitations
@@ -55,21 +50,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Auc   |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:-----:|
-| 0.4916        | 1.0   | 263  | 0.4228          | 0.784    | 0.915 |
-| 0.3894        | 2.0   | 526  | 0.3586          | 0.818    | 0.932 |
-| 0.3837        | 3.0   | 789  | 0.3144          | 0.86     | 0.939 |
-| 0.3574        | 4.0   | 1052 | 0.4494          | 0.807    | 0.942 |
-| 0.3517        | 5.0   | 1315 | 0.3287          | 0.86     | 0.947 |
-| 0.3518        | 6.0   | 1578 | 0.3042          | 0.871    | 0.949 |
-| 0.3185        | 7.0   | 1841 | 0.2900          | 0.862    | 0.949 |
-| 0.3267        | 8.0   | 2104 | 0.2958          | 0.876    | 0.95  |
-| 0.3153        | 9.0   | 2367 | 0.2881          | 0.867    | 0.951 |
-| 0.3061        | 10.0  | 2630 | 0.2963          | 0.873    | 0.951 |
 ### Framework versions
-- Transformers 4.44.2
-- Pytorch 2.2.2
-- Datasets 2.21.0
-- Tokenizers 0.19.1

 ---
 license: apache-2.0
 base_model: google-bert/bert-base-uncased
 tags:
 model-index:
 - name: bert-phishing-classifier_teacher
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # bert-phishing-classifier_teacher
+This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2984
+- Accuracy: 0.873
 - Auc: 0.951
 ## Model description
+More information needed
 ## Intended uses & limitations
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Auc   |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:-----:|
+| 0.495         | 1.0   | 263  | 0.4166          | 0.78     | 0.912 |
+| 0.3896        | 2.0   | 526  | 0.3570          | 0.822    | 0.931 |
+| 0.3824        | 3.0   | 789  | 0.3168          | 0.858    | 0.938 |
+| 0.3561        | 4.0   | 1052 | 0.4707          | 0.789    | 0.941 |
+| 0.3516        | 5.0   | 1315 | 0.3298          | 0.862    | 0.946 |
+| 0.354         | 6.0   | 1578 | 0.3049          | 0.869    | 0.948 |
+| 0.3215        | 7.0   | 1841 | 0.2908          | 0.864    | 0.949 |
+| 0.3262        | 8.0   | 2104 | 0.2987          | 0.876    | 0.95  |
+| 0.3154        | 9.0   | 2367 | 0.2896          | 0.864    | 0.951 |
+| 0.306         | 10.0  | 2630 | 0.2984          | 0.873    | 0.951 |
 ### Framework versions
+- Transformers 4.43.1
+- Pytorch 2.3.1
+- Datasets 3.2.0
+- Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -28,7 +28,7 @@
   "position_embedding_type": "absolute",
   "problem_type": "single_label_classification",
   "torch_dtype": "float32",
-  "transformers_version": "4.44.2",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

   "position_embedding_type": "absolute",
   "problem_type": "single_label_classification",
   "torch_dtype": "float32",
+  "transformers_version": "4.43.1",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f84a167f2d2459a4bf6984691bd133a11c47e171b25153668add2db492eb2a7a
 size 437958648

 version https://git-lfs.github.com/spec/v1
+oid sha256:af108aad02608a7e6c01a820a56490634406aff5e48a49378fff48943571d50d
 size 437958648

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 512,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6aceaf13011825b84313dbb2f00a7029cca6a8cddd1f996c2d43c82554403c66
-size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:6236ce2a42ccfe06196d03dff57baf9f5889bfc06ae9672175ba932d390f0ed5
+size 5176