MartinRodrigo commited on Oct 24, 2025

Commit

2fdb9ef

verified ·

1 Parent(s): 5b66e28

Upload folder using huggingface_hub

Browse files

Files changed (33) hide show

.gitattributes +1 -0
README.md +163 -0
checkpoint-1000/config.json +24 -0
checkpoint-1000/model.safetensors +3 -0
checkpoint-1000/optimizer.pt +3 -0
checkpoint-1000/rng_state.pth +3 -0
checkpoint-1000/scheduler.pt +3 -0
checkpoint-1000/special_tokens_map.json +7 -0
checkpoint-1000/tokenizer.json +0 -0
checkpoint-1000/tokenizer_config.json +56 -0
checkpoint-1000/trainer_state.json +133 -0
checkpoint-1000/training_args.bin +3 -0
checkpoint-1000/vocab.txt +0 -0
checkpoint-1500/config.json +24 -0
checkpoint-1500/model.safetensors +3 -0
checkpoint-1500/optimizer.pt +3 -0
checkpoint-1500/rng_state.pth +3 -0
checkpoint-1500/scheduler.pt +3 -0
checkpoint-1500/special_tokens_map.json +7 -0
checkpoint-1500/tokenizer.json +0 -0
checkpoint-1500/tokenizer_config.json +56 -0
checkpoint-1500/trainer_state.json +178 -0
checkpoint-1500/training_args.bin +3 -0
checkpoint-1500/vocab.txt +0 -0
config.json +24 -0
model.safetensors +3 -0
model_info.json +52 -0
special_tokens_map.json +7 -0
tokenizer.json +0 -0
tokenizer_config.json +56 -0
training_args.bin +3 -0
training_history.png +3 -0
vocab.txt +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+training_history.png filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,163 @@

+---
+language: en
+license: apache-2.0
+tags:
+- sentiment-analysis
+- transformers
+- unknown
+- text-classification
+datasets:
+- unknown
+metrics:
+- accuracy
+- f1
+- precision
+- recall
+model-index:
+- name: unknown-sentiment
+  results:
+  - task:
+      type: text-classification
+      name: Sentiment Analysis
+    dataset:
+      name: UNKNOWN
+      type: unknown
+    metrics:
+    - type: accuracy
+      value: 0.0000
+      name: Test Accuracy
+    - type: f1
+      value: 0.0000
+      name: F1 Score
+    - type: precision
+      value: 0.0000
+      name: Precision
+    - type: recall
+      value: 0.0000
+      name: Recall
+---
+# UNKNOWN Fine-tuned for Sentiment Analysis
+## 📊 Model Description
+This model is a fine-tuned version of `unknown` for sentiment analysis on the UNKNOWN dataset.
+**Model Architecture:** unknown
+**Task:** Binary Sentiment Classification (Positive/Negative)
+**Language:** English
+**Training Date:** N/A
+## 🎯 Performance Metrics
+| Metric | Score |
+|--------|-------|
+| **Accuracy** | 0.0000 |
+| **F1 Score** | 0.0000 |
+| **Precision** | 0.0000 |
+| **Recall** | 0.0000 |
+| **Loss** | 0.0000 |
+## 🔧 Training Details
+### Hyperparameters
+```json
+{}
+```
+### Dataset
+- **Training samples:** N/A
+- **Validation samples:** N/A
+- **Test samples:** N/A
+## 🚀 Usage
+### With Transformers Pipeline
+```python
+from transformers import pipeline
+# Load the model
+classifier = pipeline("sentiment-analysis", model="YOUR_USERNAME/YOUR_MODEL_NAME")
+# Predict
+result = classifier("I love this movie!")
+print(result)
+# [{'label': 'POSITIVE', 'score': 0.9998}]
+```
+### Manual Usage
+```python
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import torch
+# Load model and tokenizer
+model_name = "YOUR_USERNAME/YOUR_MODEL_NAME"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSequenceClassification.from_pretrained(model_name)
+# Prepare input
+text = "This is an amazing product!"
+inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=512)
+# Predict
+with torch.no_grad():
+    outputs = model(**inputs)
+    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
+# Get result
+label_id = torch.argmax(predictions).item()
+score = predictions[0][label_id].item()
+labels = ["NEGATIVE", "POSITIVE"]
+print(f"Label: {labels[label_id]}, Score: {score:.4f}")
+```
+## 📈 Training Curves
+Training history visualization is available in the model files.
+## 🏷️ Label Mapping
+```
+0: NEGATIVE
+1: POSITIVE
+```
+## ⚙️ Model Configuration
+```json
+{}
+```
+## 📝 Citation
+If you use this model, please cite:
+```bibtex
+@misc{sentiment-model-unknown,
+  author = {Your Name},
+  title = {unknown Fine-tuned for Sentiment Analysis},
+  year = {2025},
+  publisher = {Hugging Face},
+  howpublished = {\url{https://huggingface.co/YOUR_USERNAME/YOUR_MODEL_NAME}}
+}
+```
+## 🤝 Contact
+For questions or feedback, please open an issue in the repository.
+## 📄 License
+Apache 2.0
+## 🔗 Related Models
+- [unknown](https://huggingface.co/unknown)
+---
+**Generated with MLflow tracking** 🚀

checkpoint-1000/config.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "dim": 768,
+  "dropout": 0.1,
+  "dtype": "float32",
+  "hidden_dim": 3072,
+  "initializer_range": 0.02,
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "pad_token_id": 0,
+  "problem_type": "single_label_classification",
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "transformers_version": "4.57.1",
+  "vocab_size": 30522
+}

checkpoint-1000/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3964c1d4be890e85123c158baa9cac253288570c2c5083634890be87c5bf04c1
+size 267832560

checkpoint-1000/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6adcb6bf2c8c9793579925b306eb5cc34fb6c11b37f65f16b410924a3f49a942
+size 535724875

checkpoint-1000/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9f1e0d31acc437fbbd16411d0d11d500c5f4dbbc7561671dab7dbf23eb0f2c43
+size 14455

checkpoint-1000/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:130cffffc7979a0d2b8ed9eb9e4e3acd2a313e096a319925fba2ed8345492f55
+size 1465

checkpoint-1000/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

checkpoint-1000/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

checkpoint-1000/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

checkpoint-1000/trainer_state.json ADDED Viewed

	@@ -0,0 +1,133 @@

+{
+  "best_global_step": 1000,
+  "best_metric": 0.877,
+  "best_model_checkpoint": "./trained_model/checkpoint-1000",
+  "epoch": 2.0,
+  "eval_steps": 500,
+  "global_step": 1000,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 0.2,
+      "grad_norm": 19.170202255249023,
+      "learning_rate": 1.8680000000000004e-05,
+      "loss": 0.4772,
+      "step": 100
+    },
+    {
+      "epoch": 0.4,
+      "grad_norm": 18.27229881286621,
+      "learning_rate": 1.7346666666666668e-05,
+      "loss": 0.3614,
+      "step": 200
+    },
+    {
+      "epoch": 0.6,
+      "grad_norm": 3.4699323177337646,
+      "learning_rate": 1.6013333333333335e-05,
+      "loss": 0.3736,
+      "step": 300
+    },
+    {
+      "epoch": 0.8,
+      "grad_norm": 17.60018539428711,
+      "learning_rate": 1.4680000000000002e-05,
+      "loss": 0.3418,
+      "step": 400
+    },
+    {
+      "epoch": 1.0,
+      "grad_norm": 27.09922981262207,
+      "learning_rate": 1.3346666666666667e-05,
+      "loss": 0.3215,
+      "step": 500
+    },
+    {
+      "epoch": 1.0,
+      "eval_accuracy": 0.82,
+      "eval_f1": 0.8161193592140474,
+      "eval_loss": 0.5520513653755188,
+      "eval_runtime": 50.3487,
+      "eval_samples_per_second": 19.861,
+      "eval_steps_per_second": 1.251,
+      "step": 500
+    },
+    {
+      "epoch": 1.2,
+      "grad_norm": 29.522594451904297,
+      "learning_rate": 1.2013333333333334e-05,
+      "loss": 0.2891,
+      "step": 600
+    },
+    {
+      "epoch": 1.4,
+      "grad_norm": 50.437538146972656,
+      "learning_rate": 1.0680000000000001e-05,
+      "loss": 0.206,
+      "step": 700
+    },
+    {
+      "epoch": 1.6,
+      "grad_norm": 0.3681044578552246,
+      "learning_rate": 9.346666666666666e-06,
+      "loss": 0.2191,
+      "step": 800
+    },
+    {
+      "epoch": 1.8,
+      "grad_norm": 4.929210662841797,
+      "learning_rate": 8.013333333333333e-06,
+      "loss": 0.1967,
+      "step": 900
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 0.25087302923202515,
+      "learning_rate": 6.680000000000001e-06,
+      "loss": 0.1687,
+      "step": 1000
+    },
+    {
+      "epoch": 2.0,
+      "eval_accuracy": 0.877,
+      "eval_f1": 0.8765589700871793,
+      "eval_loss": 0.429619163274765,
+      "eval_runtime": 49.6341,
+      "eval_samples_per_second": 20.147,
+      "eval_steps_per_second": 1.269,
+      "step": 1000
+    }
+  ],
+  "logging_steps": 100,
+  "max_steps": 1500,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 3,
+  "save_steps": 500,
+  "stateful_callbacks": {
+    "EarlyStoppingCallback": {
+      "args": {
+        "early_stopping_patience": 2,
+        "early_stopping_threshold": 0.0
+      },
+      "attributes": {
+        "early_stopping_patience_counter": 0
+      }
+    },
+    "TrainerControl": {
+      "args": {
+        "should_epoch_stop": false,
+        "should_evaluate": false,
+        "should_log": false,
+        "should_save": true,
+        "should_training_stop": false
+      },
+      "attributes": {}
+    }
+  },
+  "total_flos": 1059739189248000.0,
+  "train_batch_size": 8,
+  "trial_name": null,
+  "trial_params": null
+}

checkpoint-1000/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bdc171376ed6a59d58c310ea00b74472cce43086596094b233f1856d9bc55a1b
+size 5841

checkpoint-1000/vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

checkpoint-1500/config.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "dim": 768,
+  "dropout": 0.1,
+  "dtype": "float32",
+  "hidden_dim": 3072,
+  "initializer_range": 0.02,
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "pad_token_id": 0,
+  "problem_type": "single_label_classification",
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "transformers_version": "4.57.1",
+  "vocab_size": 30522
+}

checkpoint-1500/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c34c0866fe1f54735459b6f599366d927e8ead1fe24f0a2315ff493f5f7dbe3a
+size 267832560

checkpoint-1500/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dd954bbf1225cce904630a28e0278e2b2bbb863b7ee50158ad84daee23995e8e
+size 535724875

checkpoint-1500/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:391d01d3aeb4a35151817d446e4ba0b9c8a04084ae1b1b66eda188a30729da0a
+size 14455

checkpoint-1500/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:748cd50395dac6ec1a3d9e7b31c2867f4867e0ff24162e6ca580fe4a3fabef19
+size 1465

checkpoint-1500/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

checkpoint-1500/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

checkpoint-1500/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

checkpoint-1500/trainer_state.json ADDED Viewed

	@@ -0,0 +1,178 @@

+{
+  "best_global_step": 1500,
+  "best_metric": 0.898,
+  "best_model_checkpoint": "./trained_model/checkpoint-1500",
+  "epoch": 3.0,
+  "eval_steps": 500,
+  "global_step": 1500,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 0.2,
+      "grad_norm": 19.170202255249023,
+      "learning_rate": 1.8680000000000004e-05,
+      "loss": 0.4772,
+      "step": 100
+    },
+    {
+      "epoch": 0.4,
+      "grad_norm": 18.27229881286621,
+      "learning_rate": 1.7346666666666668e-05,
+      "loss": 0.3614,
+      "step": 200
+    },
+    {
+      "epoch": 0.6,
+      "grad_norm": 3.4699323177337646,
+      "learning_rate": 1.6013333333333335e-05,
+      "loss": 0.3736,
+      "step": 300
+    },
+    {
+      "epoch": 0.8,
+      "grad_norm": 17.60018539428711,
+      "learning_rate": 1.4680000000000002e-05,
+      "loss": 0.3418,
+      "step": 400
+    },
+    {
+      "epoch": 1.0,
+      "grad_norm": 27.09922981262207,
+      "learning_rate": 1.3346666666666667e-05,
+      "loss": 0.3215,
+      "step": 500
+    },
+    {
+      "epoch": 1.0,
+      "eval_accuracy": 0.82,
+      "eval_f1": 0.8161193592140474,
+      "eval_loss": 0.5520513653755188,
+      "eval_runtime": 50.3487,
+      "eval_samples_per_second": 19.861,
+      "eval_steps_per_second": 1.251,
+      "step": 500
+    },
+    {
+      "epoch": 1.2,
+      "grad_norm": 29.522594451904297,
+      "learning_rate": 1.2013333333333334e-05,
+      "loss": 0.2891,
+      "step": 600
+    },
+    {
+      "epoch": 1.4,
+      "grad_norm": 50.437538146972656,
+      "learning_rate": 1.0680000000000001e-05,
+      "loss": 0.206,
+      "step": 700
+    },
+    {
+      "epoch": 1.6,
+      "grad_norm": 0.3681044578552246,
+      "learning_rate": 9.346666666666666e-06,
+      "loss": 0.2191,
+      "step": 800
+    },
+    {
+      "epoch": 1.8,
+      "grad_norm": 4.929210662841797,
+      "learning_rate": 8.013333333333333e-06,
+      "loss": 0.1967,
+      "step": 900
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 0.25087302923202515,
+      "learning_rate": 6.680000000000001e-06,
+      "loss": 0.1687,
+      "step": 1000
+    },
+    {
+      "epoch": 2.0,
+      "eval_accuracy": 0.877,
+      "eval_f1": 0.8765589700871793,
+      "eval_loss": 0.429619163274765,
+      "eval_runtime": 49.6341,
+      "eval_samples_per_second": 20.147,
+      "eval_steps_per_second": 1.269,
+      "step": 1000
+    },
+    {
+      "epoch": 2.2,
+      "grad_norm": 0.0790085569024086,
+      "learning_rate": 5.346666666666667e-06,
+      "loss": 0.0971,
+      "step": 1100
+    },
+    {
+      "epoch": 2.4,
+      "grad_norm": 11.744353294372559,
+      "learning_rate": 4.013333333333334e-06,
+      "loss": 0.0952,
+      "step": 1200
+    },
+    {
+      "epoch": 2.6,
+      "grad_norm": 0.05501548573374748,
+      "learning_rate": 2.68e-06,
+      "loss": 0.1343,
+      "step": 1300
+    },
+    {
+      "epoch": 2.8,
+      "grad_norm": 0.061339035630226135,
+      "learning_rate": 1.3466666666666668e-06,
+      "loss": 0.0971,
+      "step": 1400
+    },
+    {
+      "epoch": 3.0,
+      "grad_norm": 0.04910397157073021,
+      "learning_rate": 1.3333333333333334e-08,
+      "loss": 0.1523,
+      "step": 1500
+    },
+    {
+      "epoch": 3.0,
+      "eval_accuracy": 0.898,
+      "eval_f1": 0.897973886328725,
+      "eval_loss": 0.46152323484420776,
+      "eval_runtime": 33.7071,
+      "eval_samples_per_second": 29.667,
+      "eval_steps_per_second": 1.869,
+      "step": 1500
+    }
+  ],
+  "logging_steps": 100,
+  "max_steps": 1500,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 3,
+  "save_steps": 500,
+  "stateful_callbacks": {
+    "EarlyStoppingCallback": {
+      "args": {
+        "early_stopping_patience": 2,
+        "early_stopping_threshold": 0.0
+      },
+      "attributes": {
+        "early_stopping_patience_counter": 0
+      }
+    },
+    "TrainerControl": {
+      "args": {
+        "should_epoch_stop": false,
+        "should_evaluate": false,
+        "should_log": false,
+        "should_save": true,
+        "should_training_stop": true
+      },
+      "attributes": {}
+    }
+  },
+  "total_flos": 1589608783872000.0,
+  "train_batch_size": 8,
+  "trial_name": null,
+  "trial_params": null
+}

checkpoint-1500/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bdc171376ed6a59d58c310ea00b74472cce43086596094b233f1856d9bc55a1b
+size 5841

checkpoint-1500/vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

config.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "dim": 768,
+  "dropout": 0.1,
+  "dtype": "float32",
+  "hidden_dim": 3072,
+  "initializer_range": 0.02,
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "pad_token_id": 0,
+  "problem_type": "single_label_classification",
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "transformers_version": "4.57.1",
+  "vocab_size": 30522
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c34c0866fe1f54735459b6f599366d927e8ead1fe24f0a2315ff493f5f7dbe3a
+size 267832560

model_info.json ADDED Viewed

	@@ -0,0 +1,52 @@

+{
+  "model_config": {
+    "model": {
+      "name": "distilbert-base-uncased",
+      "num_labels": 2,
+      "max_length": 512
+    },
+    "training": {
+      "output_dir": "./trained_model",
+      "learning_rate": 2e-05,
+      "per_device_train_batch_size": 8,
+      "per_device_eval_batch_size": 16,
+      "num_train_epochs": 3,
+      "weight_decay": 0.01,
+      "eval_strategy": "epoch",
+      "save_strategy": "epoch",
+      "logging_steps": 100,
+      "save_total_limit": 2,
+      "load_best_model_at_end": true,
+      "metric_for_best_model": "eval_accuracy",
+      "greater_is_better": true
+    },
+    "data": {
+      "dataset_name": "imdb",
+      "train_size": 4000,
+      "eval_size": 1000,
+      "test_size": 500
+    },
+    "mlflow": {
+      "enabled": true,
+      "tracking_uri": null,
+      "experiment_name": "sentiment-analysis-training",
+      "artifact_location": null,
+      "registered_model_prefix": "sentiment-model"
+    },
+    "api": {
+      "host": "0.0.0.0",
+      "port": 8000,
+      "max_batch_size": 32
+    }
+  },
+  "training_metrics": {
+    "eval_loss": 0.3413584530353546,
+    "eval_accuracy": 0.92,
+    "eval_f1": 0.919953893442623,
+    "eval_runtime": 19.0635,
+    "eval_samples_per_second": 26.228,
+    "eval_steps_per_second": 1.679,
+    "epoch": 3.0
+  },
+  "model_path": "./trained_model"
+}

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,56 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bdc171376ed6a59d58c310ea00b74472cce43086596094b233f1856d9bc55a1b
+size 5841

training_history.png ADDED Viewed

Git LFS Details

SHA256: 1ffa4fa0f0b9b6ada6f71f10d3671a8c3e17a7c1c08fb47e1141ef4e4c04198d
Pointer size: 131 Bytes
Size of remote file: 201 kB

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff