SantmanKT commited on Oct 25, 2025

Commit

61d8207

verified ·

1 Parent(s): a960140

Upload intent classifier v2 - Accuracy: 100.00%

Browse files

Files changed (32) hide show

README.md +144 -0
checkpoint-220/config.json +54 -0
checkpoint-220/model.safetensors +3 -0
checkpoint-220/optimizer.pt +3 -0
checkpoint-220/rng_state.pth +3 -0
checkpoint-220/scheduler.pt +3 -0
checkpoint-220/special_tokens_map.json +7 -0
checkpoint-220/tokenizer.json +0 -0
checkpoint-220/tokenizer_config.json +56 -0
checkpoint-220/trainer_state.json +119 -0
checkpoint-220/training_args.bin +3 -0
checkpoint-220/vocab.txt +0 -0
checkpoint-275/config.json +54 -0
checkpoint-275/model.safetensors +3 -0
checkpoint-275/optimizer.pt +3 -0
checkpoint-275/rng_state.pth +3 -0
checkpoint-275/scheduler.pt +3 -0
checkpoint-275/special_tokens_map.json +7 -0
checkpoint-275/tokenizer.json +0 -0
checkpoint-275/tokenizer_config.json +56 -0
checkpoint-275/trainer_state.json +138 -0
checkpoint-275/training_args.bin +3 -0
checkpoint-275/vocab.txt +0 -0
config.json +54 -0
label_encoder.pkl +3 -0
model.safetensors +3 -0
special_tokens_map.json +7 -0
tokenizer.json +0 -0
tokenizer_config.json +56 -0
training_args.bin +3 -0
training_results.json +37 -0
vocab.txt +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,144 @@

+---
+language: en
+license: apache-2.0
+tags:
+- text-classification
+- intent-classification
+- conversational-ai
+- bert
+- distilbert
+datasets:
+- custom
+metrics:
+- accuracy
+- f1
+model-index:
+- name: intent-classifier-v2
+  results:
+  - task:
+      type: text-classification
+      name: Intent Classification
+    metrics:
+    - type: accuracy
+      value: 1.0000
+      name: Test Accuracy
+    - type: f1
+      value: 1.0000
+      name: Weighted F1
+---
+# DAPA Intent Classifier v2
+## Model Description
+This model classifies user intents for the DAPA AI conversational assistant system. It supports 13 different intents including 12 agentic workflows and 1 general Q&A fallback.
+**Model Type:** DistilBERT for Sequence Classification
+**Training Date:** 2025-10-25
+**Accuracy:** 100.00%
+**F1 Score:** 1.0000
+## Supported Intents
+The model classifies queries into 13 intents:
+### Agentic Intents (12)
+1. **generate-offer** - Generate job offers, NDAs, contracts
+2. **schedule-interview** - Schedule candidate interviews
+3. **update-employee-profile** - Update employee information
+4. **access-employee-record** - Access employee records
+5. **approve-expense** - Approve expense reports
+6. **check-leave-balance** - Check leave balances
+7. **confirm-training-completion** - Confirm training completion
+8. **provide-candidate-feedback** - Provide candidate feedback
+9. **request-leave** - Request time off
+10. **request-training** - Request training enrollment
+11. **review-contract** - Review contracts
+12. **submit-expense** - Submit expense reports
+### Q&A Intent (1)
+13. **general-query** - Generic queries (email lookups, status checks, policy questions)
+## Training Data
+- **Total Samples:** 1,240
+- **Training Split:** 70% (868 samples)
+- **Validation Split:** 15% (186 samples)
+- **Test Split:** 15% (186 samples)
+- **Data Balance:** 80-200 examples per intent
+## Performance
+### Overall Metrics
+- **Test Accuracy:** 100.00%
+- **Weighted Precision:** 1.0000
+- **Weighted Recall:** 1.0000
+- **Weighted F1:** 1.0000
+### Per-Intent Performance
+| Intent | Precision | Recall | F1 | Support |
+|--------|-----------|--------|-----|---------|
+| access-employee-record | 1.000 | 1.000 | 1.000 | 12 |\n| approve-expense | 1.000 | 1.000 | 1.000 | 12 |\n| check-leave-balance | 1.000 | 1.000 | 1.000 | 12 |\n| confirm-training-completion | 1.000 | 1.000 | 1.000 | 12 |\n| general-query | 1.000 | 1.000 | 1.000 | 30 |\n| generate-offer | 1.000 | 1.000 | 1.000 | 18 |\n| provide-candidate-feedback | 1.000 | 1.000 | 1.000 | 12 |\n| request-leave | 1.000 | 1.000 | 1.000 | 12 |\n| request-training | 1.000 | 1.000 | 1.000 | 12 |\n| review-contract | 1.000 | 1.000 | 1.000 | 12 |\n| schedule-interview | 1.000 | 1.000 | 1.000 | 15 |\n| submit-expense | 1.000 | 1.000 | 1.000 | 12 |\n| update-employee-profile | 1.000 | 1.000 | 1.000 | 15 |\n
+## Usage
+```python
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import torch
+# Load model and tokenizer
+tokenizer = AutoTokenizer.from_pretrained("SantmanKT/intent-classifier-v2")
+model = AutoModelForSequenceClassification.from_pretrained("SantmanKT/intent-classifier-v2")
+# Predict intent
+query = "send offer to John"
+context = "{domain: hr}"
+input_text = f"{query} [context: {context}]"
+inputs = tokenizer(input_text, return_tensors="pt", truncation=True, max_length=128)
+outputs = model(**inputs)
+probs = torch.softmax(outputs.logits, dim=-1)
+confidence, predicted_idx = torch.max(probs, dim=-1)
+intent = model.config.id2label[predicted_idx.item()]
+print(f"Intent: {intent}, Confidence: {confidence.item():.2%}")
+```
+## Routing Logic
+- **High Confidence (≥70%) + Agentic Intent** → Route to Domain Service
+- **Low Confidence (<70%) OR general-query** → Route to Q&A Service
+## Model Details
+- **Base Model:** distilbert-base-uncased
+- **Max Sequence Length:** 128 tokens
+- **Training Epochs:** 5
+- **Batch Size:** 16
+- **Learning Rate:** 2e-5
+- **Framework:** HuggingFace Transformers
+## Limitations
+- Optimized for English language only
+- Requires context formatting: `[context: {...}]`
+- Performance may degrade on queries significantly different from training data
+## Citation
+If you use this model, please cite:
+```
+@misc{dapa-intent-classifier-v2,
+  author = {SantmanKT},
+  title = {DAPA Intent Classifier v2},
+  year = {2025},
+  publisher = {HuggingFace},
+  url = {https://huggingface.co/SantmanKT/intent-classifier-v2}
+}
+```
+## License
+Apache 2.0

checkpoint-220/config.json ADDED Viewed

	@@ -0,0 +1,54 @@

+{
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "dim": 768,
+  "dropout": 0.1,
+  "dtype": "float32",
+  "hidden_dim": 3072,
+  "id2label": {
+    "0": "LABEL_0",
+    "1": "LABEL_1",
+    "2": "LABEL_2",
+    "3": "LABEL_3",
+    "4": "LABEL_4",
+    "5": "LABEL_5",
+    "6": "LABEL_6",
+    "7": "LABEL_7",
+    "8": "LABEL_8",
+    "9": "LABEL_9",
+    "10": "LABEL_10",
+    "11": "LABEL_11",
+    "12": "LABEL_12"
+  },
+  "initializer_range": 0.02,
+  "label2id": {
+    "LABEL_0": 0,
+    "LABEL_1": 1,
+    "LABEL_10": 10,
+    "LABEL_11": 11,
+    "LABEL_12": 12,
+    "LABEL_2": 2,
+    "LABEL_3": 3,
+    "LABEL_4": 4,
+    "LABEL_5": 5,
+    "LABEL_6": 6,
+    "LABEL_7": 7,
+    "LABEL_8": 8,
+    "LABEL_9": 9
+  },
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "pad_token_id": 0,
+  "problem_type": "single_label_classification",
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "transformers_version": "4.57.1",
+  "vocab_size": 30522
+}

checkpoint-220/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7271f7943cae385cf6983f9b28593d0b455415275ce51c6d3e8d62c888103cf0
+size 267866404

checkpoint-220/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a881b17de8775bd85d9f400a6d7e46096a50a44b51a59286a041341eb75004dc
+size 535796811

checkpoint-220/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:80b137b2f227cdf353434f493e3c7ba766b954350d79b48a45c8463422ff4eff
+size 14645

checkpoint-220/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f734eabbfe87eaa3ac495e780d550b3e9bd337494d9a3caea639822a8fea66dc
+size 1465

checkpoint-220/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

checkpoint-220/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

checkpoint-220/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

checkpoint-220/trainer_state.json ADDED Viewed

	@@ -0,0 +1,119 @@

+{
+  "best_global_step": 220,
+  "best_metric": 0.9837687666378767,
+  "best_model_checkpoint": "intent_classifier_v2/checkpoint-220",
+  "epoch": 4.0,
+  "eval_steps": 500,
+  "global_step": 220,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 0.9090909090909091,
+      "grad_norm": 1.5899983644485474,
+      "learning_rate": 1.9600000000000003e-06,
+      "loss": 2.5642,
+      "step": 50
+    },
+    {
+      "epoch": 1.0,
+      "eval_accuracy": 0.1881720430107527,
+      "eval_f1": 0.10086428092694365,
+      "eval_loss": 2.5428154468536377,
+      "eval_precision": 0.0936228251427814,
+      "eval_recall": 0.1881720430107527,
+      "eval_runtime": 0.1839,
+      "eval_samples_per_second": 1011.503,
+      "eval_steps_per_second": 65.258,
+      "step": 55
+    },
+    {
+      "epoch": 1.8181818181818183,
+      "grad_norm": 3.272284984588623,
+      "learning_rate": 3.96e-06,
+      "loss": 2.5118,
+      "step": 100
+    },
+    {
+      "epoch": 2.0,
+      "eval_accuracy": 0.26344086021505375,
+      "eval_f1": 0.17357707395658062,
+      "eval_loss": 2.3705923557281494,
+      "eval_precision": 0.23162059134445975,
+      "eval_recall": 0.26344086021505375,
+      "eval_runtime": 0.202,
+      "eval_samples_per_second": 920.577,
+      "eval_steps_per_second": 59.392,
+      "step": 110
+    },
+    {
+      "epoch": 2.7272727272727275,
+      "grad_norm": 4.249340057373047,
+      "learning_rate": 5.9600000000000005e-06,
+      "loss": 2.303,
+      "step": 150
+    },
+    {
+      "epoch": 3.0,
+      "eval_accuracy": 0.8064516129032258,
+      "eval_f1": 0.7699332427341867,
+      "eval_loss": 1.9005680084228516,
+      "eval_precision": 0.8112076095947064,
+      "eval_recall": 0.8064516129032258,
+      "eval_runtime": 0.18,
+      "eval_samples_per_second": 1033.189,
+      "eval_steps_per_second": 66.657,
+      "step": 165
+    },
+    {
+      "epoch": 3.6363636363636362,
+      "grad_norm": 4.2065110206604,
+      "learning_rate": 7.960000000000002e-06,
+      "loss": 1.8712,
+      "step": 200
+    },
+    {
+      "epoch": 4.0,
+      "eval_accuracy": 0.9838709677419355,
+      "eval_f1": 0.9837687666378767,
+      "eval_loss": 1.1924996376037598,
+      "eval_precision": 0.984740928604229,
+      "eval_recall": 0.9838709677419355,
+      "eval_runtime": 0.1796,
+      "eval_samples_per_second": 1035.622,
+      "eval_steps_per_second": 66.814,
+      "step": 220
+    }
+  ],
+  "logging_steps": 50,
+  "max_steps": 275,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 5,
+  "save_steps": 500,
+  "stateful_callbacks": {
+    "EarlyStoppingCallback": {
+      "args": {
+        "early_stopping_patience": 2,
+        "early_stopping_threshold": 0.0
+      },
+      "attributes": {
+        "early_stopping_patience_counter": 0
+      }
+    },
+    "TrainerControl": {
+      "args": {
+        "should_epoch_stop": false,
+        "should_evaluate": false,
+        "should_log": false,
+        "should_save": true,
+        "should_training_stop": false
+      },
+      "attributes": {}
+    }
+  },
+  "total_flos": 27852593715744.0,
+  "train_batch_size": 16,
+  "trial_name": null,
+  "trial_params": null
+}

checkpoint-220/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7161c9b09c8630acc22aba2b488c51d515d356f1c8d06aae95857631447c08a5
+size 5777

checkpoint-220/vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

checkpoint-275/config.json ADDED Viewed

	@@ -0,0 +1,54 @@

+{
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "dim": 768,
+  "dropout": 0.1,
+  "dtype": "float32",
+  "hidden_dim": 3072,
+  "id2label": {
+    "0": "LABEL_0",
+    "1": "LABEL_1",
+    "2": "LABEL_2",
+    "3": "LABEL_3",
+    "4": "LABEL_4",
+    "5": "LABEL_5",
+    "6": "LABEL_6",
+    "7": "LABEL_7",
+    "8": "LABEL_8",
+    "9": "LABEL_9",
+    "10": "LABEL_10",
+    "11": "LABEL_11",
+    "12": "LABEL_12"
+  },
+  "initializer_range": 0.02,
+  "label2id": {
+    "LABEL_0": 0,
+    "LABEL_1": 1,
+    "LABEL_10": 10,
+    "LABEL_11": 11,
+    "LABEL_12": 12,
+    "LABEL_2": 2,
+    "LABEL_3": 3,
+    "LABEL_4": 4,
+    "LABEL_5": 5,
+    "LABEL_6": 6,
+    "LABEL_7": 7,
+    "LABEL_8": 8,
+    "LABEL_9": 9
+  },
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "pad_token_id": 0,
+  "problem_type": "single_label_classification",
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "transformers_version": "4.57.1",
+  "vocab_size": 30522
+}

checkpoint-275/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bf565fcda041fac56c09c0e315084bcb47682925c0c57096a43a29342e94bdce
+size 267866404

checkpoint-275/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0f7b1d1db774ff8d88a88ffbaf3ffca6ad7a3a952031832762c71d415b791e69
+size 535796811

checkpoint-275/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:85a3f5aa2ce3b77be772ceec4ad0d0619c7aef5028d836b01431f1a826218479
+size 14645

checkpoint-275/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e29603040e49cd81de77c77d5b6453279762d8f6af23c4e143b794082920649e
+size 1465

checkpoint-275/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

checkpoint-275/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

checkpoint-275/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

checkpoint-275/trainer_state.json ADDED Viewed

	@@ -0,0 +1,138 @@

+{
+  "best_global_step": 275,
+  "best_metric": 1.0,
+  "best_model_checkpoint": "intent_classifier_v2/checkpoint-275",
+  "epoch": 5.0,
+  "eval_steps": 500,
+  "global_step": 275,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 0.9090909090909091,
+      "grad_norm": 1.5899983644485474,
+      "learning_rate": 1.9600000000000003e-06,
+      "loss": 2.5642,
+      "step": 50
+    },
+    {
+      "epoch": 1.0,
+      "eval_accuracy": 0.1881720430107527,
+      "eval_f1": 0.10086428092694365,
+      "eval_loss": 2.5428154468536377,
+      "eval_precision": 0.0936228251427814,
+      "eval_recall": 0.1881720430107527,
+      "eval_runtime": 0.1839,
+      "eval_samples_per_second": 1011.503,
+      "eval_steps_per_second": 65.258,
+      "step": 55
+    },
+    {
+      "epoch": 1.8181818181818183,
+      "grad_norm": 3.272284984588623,
+      "learning_rate": 3.96e-06,
+      "loss": 2.5118,
+      "step": 100
+    },
+    {
+      "epoch": 2.0,
+      "eval_accuracy": 0.26344086021505375,
+      "eval_f1": 0.17357707395658062,
+      "eval_loss": 2.3705923557281494,
+      "eval_precision": 0.23162059134445975,
+      "eval_recall": 0.26344086021505375,
+      "eval_runtime": 0.202,
+      "eval_samples_per_second": 920.577,
+      "eval_steps_per_second": 59.392,
+      "step": 110
+    },
+    {
+      "epoch": 2.7272727272727275,
+      "grad_norm": 4.249340057373047,
+      "learning_rate": 5.9600000000000005e-06,
+      "loss": 2.303,
+      "step": 150
+    },
+    {
+      "epoch": 3.0,
+      "eval_accuracy": 0.8064516129032258,
+      "eval_f1": 0.7699332427341867,
+      "eval_loss": 1.9005680084228516,
+      "eval_precision": 0.8112076095947064,
+      "eval_recall": 0.8064516129032258,
+      "eval_runtime": 0.18,
+      "eval_samples_per_second": 1033.189,
+      "eval_steps_per_second": 66.657,
+      "step": 165
+    },
+    {
+      "epoch": 3.6363636363636362,
+      "grad_norm": 4.2065110206604,
+      "learning_rate": 7.960000000000002e-06,
+      "loss": 1.8712,
+      "step": 200
+    },
+    {
+      "epoch": 4.0,
+      "eval_accuracy": 0.9838709677419355,
+      "eval_f1": 0.9837687666378767,
+      "eval_loss": 1.1924996376037598,
+      "eval_precision": 0.984740928604229,
+      "eval_recall": 0.9838709677419355,
+      "eval_runtime": 0.1796,
+      "eval_samples_per_second": 1035.622,
+      "eval_steps_per_second": 66.814,
+      "step": 220
+    },
+    {
+      "epoch": 4.545454545454545,
+      "grad_norm": 3.407564163208008,
+      "learning_rate": 9.960000000000001e-06,
+      "loss": 1.2426,
+      "step": 250
+    },
+    {
+      "epoch": 5.0,
+      "eval_accuracy": 1.0,
+      "eval_f1": 1.0,
+      "eval_loss": 0.5706155300140381,
+      "eval_precision": 1.0,
+      "eval_recall": 1.0,
+      "eval_runtime": 0.1958,
+      "eval_samples_per_second": 950.01,
+      "eval_steps_per_second": 61.291,
+      "step": 275
+    }
+  ],
+  "logging_steps": 50,
+  "max_steps": 275,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 5,
+  "save_steps": 500,
+  "stateful_callbacks": {
+    "EarlyStoppingCallback": {
+      "args": {
+        "early_stopping_patience": 2,
+        "early_stopping_threshold": 0.0
+      },
+      "attributes": {
+        "early_stopping_patience_counter": 0
+      }
+    },
+    "TrainerControl": {
+      "args": {
+        "should_epoch_stop": false,
+        "should_evaluate": false,
+        "should_log": false,
+        "should_save": true,
+        "should_training_stop": true
+      },
+      "attributes": {}
+    }
+  },
+  "total_flos": 34815742144680.0,
+  "train_batch_size": 16,
+  "trial_name": null,
+  "trial_params": null
+}

checkpoint-275/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7161c9b09c8630acc22aba2b488c51d515d356f1c8d06aae95857631447c08a5
+size 5777

checkpoint-275/vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

config.json ADDED Viewed

	@@ -0,0 +1,54 @@

+{
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "dim": 768,
+  "dropout": 0.1,
+  "dtype": "float32",
+  "hidden_dim": 3072,
+  "id2label": {
+    "0": "LABEL_0",
+    "1": "LABEL_1",
+    "2": "LABEL_2",
+    "3": "LABEL_3",
+    "4": "LABEL_4",
+    "5": "LABEL_5",
+    "6": "LABEL_6",
+    "7": "LABEL_7",
+    "8": "LABEL_8",
+    "9": "LABEL_9",
+    "10": "LABEL_10",
+    "11": "LABEL_11",
+    "12": "LABEL_12"
+  },
+  "initializer_range": 0.02,
+  "label2id": {
+    "LABEL_0": 0,
+    "LABEL_1": 1,
+    "LABEL_10": 10,
+    "LABEL_11": 11,
+    "LABEL_12": 12,
+    "LABEL_2": 2,
+    "LABEL_3": 3,
+    "LABEL_4": 4,
+    "LABEL_5": 5,
+    "LABEL_6": 6,
+    "LABEL_7": 7,
+    "LABEL_8": 8,
+    "LABEL_9": 9
+  },
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "pad_token_id": 0,
+  "problem_type": "single_label_classification",
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "transformers_version": "4.57.1",
+  "vocab_size": 30522
+}

label_encoder.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:549d2b82bcefa38e87aa8a183c4936ab261263b64c055ba903b32b819049dc6d
+size 517

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bf565fcda041fac56c09c0e315084bcb47682925c0c57096a43a29342e94bdce
+size 267866404

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7161c9b09c8630acc22aba2b488c51d515d356f1c8d06aae95857631447c08a5
+size 5777

training_results.json ADDED Viewed

	@@ -0,0 +1,37 @@

+{
+  "timestamp": "2025-10-25T19:48:08.305153",
+  "model_name": "distilbert-base-uncased",
+  "num_classes": 13,
+  "class_names": [
+    "access-employee-record",
+    "approve-expense",
+    "check-leave-balance",
+    "confirm-training-completion",
+    "general-query",
+    "generate-offer",
+    "provide-candidate-feedback",
+    "request-leave",
+    "request-training",
+    "review-contract",
+    "schedule-interview",
+    "submit-expense",
+    "update-employee-profile"
+  ],
+  "test_accuracy": 1.0,
+  "test_weighted_f1": 1.0,
+  "per_class_f1": {
+    "access-employee-record": 1.0,
+    "approve-expense": 1.0,
+    "check-leave-balance": 1.0,
+    "confirm-training-completion": 1.0,
+    "general-query": 1.0,
+    "generate-offer": 1.0,
+    "provide-candidate-feedback": 1.0,
+    "request-leave": 1.0,
+    "request-training": 1.0,
+    "review-contract": 1.0,
+    "schedule-interview": 1.0,
+    "submit-expense": 1.0,
+    "update-employee-profile": 1.0
+  }
+}

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff