Instructions to use Sayan01/TinyBert4_sst2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Sayan01/TinyBert4_sst2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="Sayan01/TinyBert4_sst2")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("Sayan01/TinyBert4_sst2") model = AutoModelForSequenceClassification.from_pretrained("Sayan01/TinyBert4_sst2") - Notebooks
- Google Colab
- Kaggle
TinyBERT paper results for SST2
#1
by Dipti - opened
Hello Sayantan Dasgupta,
I'm a student trying to reproduce the results of the task specific distillation of TinyBert for SST2. I use the general distilled model from HuggingFace (Acc 58% on SST as per the glue script). But when I perform the intermediate and prediction layer distillations, the accuracy drops to 50%.
I use the Fine tuned BERT from https://huggingface.co/JeremiahZ/bert-base-uncased-sst2 (accuracy ~92%) and the training commands as per the TinyBERT repository. Are there any differences in the training pipeline of your models like the use of a learning rate scheduler or a better BERT model to distill from?
Any advice would be very helpful.
Best,
Dipti