BERTimbau-large Edu Classifier

BERTimbau-large Edu Classifier is a BERT based model that can be used for judging the educational value of a given Portuguese text string. This model was trained on the Portuguese Educational Qwen Annotations dataset.

Details

For training, we added a classification head with a single regression output to neuralmind/bert-large-portuguese-cased. Only the classification head was trained, i.e., the rest of the model was frozen.

This repository has the source code used to train this model.

Evaluation Results

Confusion Matrix

1 2 3 4 5
1 5658 1535 32 2 0
2 1080 5664 848 7 0
3 17 1201 2460 230 0
4 0 35 590 627 2
5 0 0 1 10 1
  • Precision: 0.6368
  • Recall: 0.5482
  • F1 Macro: 0.5731
  • Accuracy: 0.7205

Usage

Here's an example of how to use the BERTimbau-large Edu Classifier:

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

tokenizer = AutoTokenizer.from_pretrained("Polygl0t/portuguese-bertimbau-large-edu-classifier")
model = AutoModelForSequenceClassification.from_pretrained("Polygl0t/portuguese-bertimbau-large-edu-classifier")
model.to(device)

text = "Coloque aqui o seu texto ..."
encoded_input  =  tokenizer(text, return_tensors="pt", padding="longest", truncation=True).to(device)

with  torch.no_grad():
    model_output  =  model(**encoded_input)
    logits  =  model_output.logits.squeeze(-1).float().cpu().numpy()

# scores are produced in the range [0, 4]. To convert to the range [1, 5], we can simply add 1 to the score.
score = [x + 1 for x in logits.tolist()][0]

print({
 "text": text,
 "score": score,
 "int_score": [int(round(max(0, min(score, 4)))) + 1 for score in logits][0],
})

Cite as πŸ€—

@misc{correa2026tucano2cool,
      title={{Tucano 2 Cool: Better Open Source LLMs for Portuguese}}, 
      author={Nicholas Kluge Corr{\^e}a and Aniket Sen and Shiza Fatimah and Sophia Falk and Lennard Landgraf and Julia Kastner and Lucie Flek},
      year={2026},
      eprint={2603.03543},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2603.03543}, 
}

Aknowlegments

Polyglot is a project funded by the Federal Ministry of Education and Research (BMBF) and the Ministry of Culture and Science of the State of North Rhine-Westphalia (MWK) as part of TRA Sustainable Futures (University of Bonn) and the Excellence Strategy of the federal and state governments.

We also gratefully acknowledge the granted access to the Marvin cluster hosted by University of Bonn along with the support provided by its High Performance Computing & Analytics Lab.

License

BERTimbau-large Edu Classifier is licensed under the Apache License, Version 2.0. For more details, see the LICENSE file.

Downloads last month
14
Safetensors
Model size
0.3B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for Polygl0t/portuguese-bertimbau-large-edu-classifier

Finetuned
(59)
this model

Dataset used to train Polygl0t/portuguese-bertimbau-large-edu-classifier

Collection including Polygl0t/portuguese-bertimbau-large-edu-classifier

Paper for Polygl0t/portuguese-bertimbau-large-edu-classifier