intent-classifier / README.md

Mukalingam0813

Update README.md

8ff3930 almost 2 years ago

preview code

raw

history blame contribute delete

2.43 kB

metadata

license: apache-2.0
base_model: distilbert-base-uncased
tags:
  - generated_from_keras_callback
model-index:
  - name: Mukalingam0813/intent-classifier
    results: []

Mukalingam0813/intent-classifier

This model is a fine-tuned version of distilbert-base-uncased on an unknown dataset. It achieves the following results on the evaluation set:

Train Loss: 0.0309
Validation Loss: 0.1017
Train Accuracy: 0.9726
Epoch: 3

Model description

Overview

The DistilBERT model was proposed in the blog post Smaller, faster, cheaper, lighter: Introducing DistilBERT, a distilled version of BERT, and the paper DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. DistilBERT is a small, fast, cheap and light Transformer model trained by distilling BERT base. It has 40% less parameters than bert-base-uncased, runs 60% faster while preserving over 95% of BERT’s performances as measured on the GLUE language understanding benchmark.

Training and evaluation data

We have used our own dataset for classifying the intentension in the given input text.

Training hyperparameters

The following hyperparameters were used during training:

optimizer: {'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': False, 'is_legacy_optimizer': False, 'learning_rate': {'module': 'keras.optimizers.schedules', 'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 25548, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'registered_name': None}, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
training_precision: float32

Training results

Train Loss	Validation Loss	Train Accuracy	Epoch
0.1562	0.1044	0.9679	0
0.0654	0.0950	0.9723	1
0.0309	0.1017	0.9726	2

Framework versions

Transformers 4.36.2
TensorFlow 2.15.0
Datasets 2.16.0
Tokenizers 0.15.0