AurelPx commited on
Commit
4f586b7
·
verified ·
1 Parent(s): 25148ef

DistilBERT multi-label HR classifier v2

Browse files
Files changed (3) hide show
  1. README.md +10 -33
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -4,7 +4,6 @@ license: apache-2.0
4
  base_model: distilbert/distilbert-base-uncased
5
  tags:
6
  - generated_from_trainer
7
- - ml-intern
8
  metrics:
9
  - precision
10
  - recall
@@ -21,13 +20,13 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.6631
25
- - F1 Micro: 0.1481
26
- - F1 Macro: 0.1452
27
- - Precision: 0.08
28
- - Recall: 1.0
29
  - Accuracy: 0.0
30
- - Hamming: 0.92
31
 
32
  ## Model description
33
 
@@ -46,7 +45,7 @@ More information needed
46
  ### Training hyperparameters
47
 
48
  The following hyperparameters were used during training:
49
- - learning_rate: 3e-05
50
  - train_batch_size: 8
51
  - eval_batch_size: 8
52
  - seed: 42
@@ -55,16 +54,14 @@ The following hyperparameters were used during training:
55
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
56
  - lr_scheduler_type: linear
57
  - lr_scheduler_warmup_steps: 5
58
- - num_epochs: 8
59
 
60
  ### Training results
61
 
62
  | Training Loss | Epoch | Step | Validation Loss | F1 Micro | F1 Macro | Precision | Recall | Accuracy | Hamming |
63
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------:|:------:|:--------:|:-------:|
64
- | 1.3535 | 1.0 | 5 | 0.6631 | 0.1481 | 0.1452 | 0.08 | 1.0 | 0.0 | 0.92 |
65
- | 1.1928 | 2.0 | 10 | 0.5828 | 0.1481 | 0.1452 | 0.08 | 1.0 | 0.0 | 0.92 |
66
- | 1.0551 | 3.0 | 15 | 0.5213 | 0.1401 | 0.1322 | 0.0759 | 0.9062 | 0.0 | 0.89 |
67
- | 0.9999 | 4.0 | 20 | 0.4835 | 0.1418 | 0.1274 | 0.0771 | 0.875 | 0.0 | 0.8475 |
68
 
69
 
70
  ### Framework versions
@@ -73,23 +70,3 @@ The following hyperparameters were used during training:
73
  - Pytorch 2.11.0+cu130
74
  - Datasets 4.8.5
75
  - Tokenizers 0.22.2
76
-
77
- <!-- ml-intern-provenance -->
78
- ## Generated by ML Intern
79
-
80
- This model repository was generated by [ML Intern](https://github.com/huggingface/ml-intern), an agent for machine learning research and development on the Hugging Face Hub.
81
-
82
- - Try ML Intern: https://smolagents-ml-intern.hf.space
83
- - Source code: https://github.com/huggingface/ml-intern
84
-
85
- ## Usage
86
-
87
- ```python
88
- from transformers import AutoModelForCausalLM, AutoTokenizer
89
-
90
- model_id = 'AurelPx/hr-conversations-classifier'
91
- tokenizer = AutoTokenizer.from_pretrained(model_id)
92
- model = AutoModelForCausalLM.from_pretrained(model_id)
93
- ```
94
-
95
- For non-causal architectures, replace `AutoModelForCausalLM` with the appropriate `AutoModel` class.
 
4
  base_model: distilbert/distilbert-base-uncased
5
  tags:
6
  - generated_from_trainer
 
7
  metrics:
8
  - precision
9
  - recall
 
20
 
21
  This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.6809
24
+ - F1 Micro: 0.1111
25
+ - F1 Macro: 0.0470
26
+ - Precision: 0.0714
27
+ - Recall: 0.25
28
  - Accuracy: 0.0
29
+ - Hamming: 0.32
30
 
31
  ## Model description
32
 
 
45
  ### Training hyperparameters
46
 
47
  The following hyperparameters were used during training:
48
+ - learning_rate: 1.0000000000000002e-06
49
  - train_batch_size: 8
50
  - eval_batch_size: 8
51
  - seed: 42
 
54
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
55
  - lr_scheduler_type: linear
56
  - lr_scheduler_warmup_steps: 5
57
+ - num_epochs: 2
58
 
59
  ### Training results
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | F1 Micro | F1 Macro | Precision | Recall | Accuracy | Hamming |
62
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|:---------:|:------:|:--------:|:-------:|
63
+ | 1.3633 | 1.0 | 5 | 0.6809 | 0.1111 | 0.0470 | 0.0714 | 0.25 | 0.0 | 0.32 |
64
+ | 1.3502 | 2.0 | 10 | 0.6771 | 0.1 | 0.0450 | 0.0648 | 0.2188 | 0.0 | 0.315 |
 
 
65
 
66
 
67
  ### Framework versions
 
70
  - Pytorch 2.11.0+cu130
71
  - Datasets 4.8.5
72
  - Tokenizers 0.22.2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:47236dfdfc9cfc4c87c96dfdefc072d83982c74151298784f3a5007d77c165bb
3
  size 267887936
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f756c07a9328f8dcd94706b0fe1b9b595c1db841a595c129037c4798cba79e0a
3
  size 267887936
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f2a67b9b3a81433171067eee5ac5567f0effae5b3d842161da65bc3dda49ab2f
3
  size 5329
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c78eb30eb8bb36c69819fe090ac512e14369aeaef61d0a5f1f565ecc8296a795
3
  size 5329