mcanoglu
/

microsoft-codebert-base-finetuned-defect-detection

Text Classification

Generated from Trainer

text-embeddings-inference

Model card Files Files and versions

Metrics Training metrics Community

mcanoglu commited on Feb 19, 2024

Commit

bfaba8f

·

verified ·

1 Parent(s): cee87e9

End of training

Files changed (2) hide show

README.md +19 -14
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -19,11 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/codebert-base](https://huggingface.co/microsoft/codebert-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5498
-- Accuracy: 0.7026
-- F1: 0.7299
-- Precision: 0.6559
-- Recall: 0.8227
 ## Model description
@@ -43,25 +43,30 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 32
 - eval_batch_size: 8
 - seed: 4711
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| 0.6584        | 1.0   | 997  | 0.5554          | 0.6827   | 0.6347 | 0.7252    | 0.5642 |
-| 0.5304        | 2.0   | 1994 | 0.5229          | 0.6975   | 0.7269 | 0.6502    | 0.8243 |
-| 0.4572        | 3.0   | 2991 | 0.5498          | 0.7026   | 0.7299 | 0.6559    | 0.8227 |
 ### Framework versions
-- Transformers 4.36.2
-- Pytorch 2.1.2+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.0

 This model is a fine-tuned version of [microsoft/codebert-base](https://huggingface.co/microsoft/codebert-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6534
+- Accuracy: 0.7342
+- F1: 0.7413
+- Precision: 0.7066
+- Recall: 0.7795
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 4711
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
+- mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| 0.6396        | 1.0   | 996  | 0.5277          | 0.6905   | 0.6502 | 0.7258    | 0.5889 |
+| 0.4862        | 2.0   | 1993 | 0.5331          | 0.7176   | 0.7393 | 0.6733    | 0.8196 |
+| 0.4043        | 3.0   | 2989 | 0.5521          | 0.7339   | 0.7343 | 0.7167    | 0.7528 |
+| 0.3439        | 4.0   | 3986 | 0.5945          | 0.7357   | 0.7422 | 0.7087    | 0.7790 |
+| 0.2946        | 5.0   | 4980 | 0.6534          | 0.7342   | 0.7413 | 0.7066    | 0.7795 |
 ### Framework versions
+- Transformers 4.37.2
+- Pytorch 2.2.0+cu121
+- Datasets 2.17.1
+- Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c8e6af3703f588f4ac2fa2e5c0e510218610b9f63590bc4d4b0c4ad294a2f8d3
 size 498612824

 version https://git-lfs.github.com/spec/v1
+oid sha256:f57df9a5aabbb2a15b7f226c0c95adc1965af247e8b396bf28891b6c79a881a1
 size 498612824