PhoBERT: Pre-trained language models for Vietnamese
Paper • 2003.00744 • Published • 1
How to use PaulTran/vietnamese_essay_identify with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="PaulTran/vietnamese_essay_identify") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("PaulTran/vietnamese_essay_identify")
model = AutoModelForSequenceClassification.from_pretrained("PaulTran/vietnamese_essay_identify")This is a finetuned PhoBERT model for essay categories classification.
The general architecture and experimental results of PhoBERT can be found in EMNLP-2020 Findings paper:
@article{phobert,
title = {{PhoBERT: Pre-trained language models for Vietnamese}},
author = {Dat Quoc Nguyen and Anh Tuan Nguyen},
journal = {Findings of EMNLP},
year = {2020}
}