Feature Extraction
Transformers
PyTorch
Safetensors
multitask_modernbert
Generated from Trainer
custom_code
Instructions to use SociauxLing/modernbert-CGEdit-AAE_gold_final with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use SociauxLing/modernbert-CGEdit-AAE_gold_final with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="SociauxLing/modernbert-CGEdit-AAE_gold_final", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("SociauxLing/modernbert-CGEdit-AAE_gold_final", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
modernbert-CGEdit-AAE_gold_final
This model is a fine-tuned version of on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.9271
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 3e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- gradient_accumulation_steps: 4
- total_train_batch_size: 32
- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
- lr_scheduler_type: cosine
- lr_scheduler_warmup_steps: 0.1
- num_epochs: 40
Training results
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 3.7290 | 1.0 | 104 | 0.9340 |
| 3.7026 | 2.0 | 208 | 0.9303 |
| 3.7027 | 3.0 | 312 | 0.9273 |
| 3.7066 | 4.0 | 416 | 0.9259 |
| 3.4770 | 5.0 | 520 | 0.9284 |
| 3.6642 | 6.0 | 624 | 0.9267 |
| 3.6335 | 7.0 | 728 | 0.9271 |
| 3.6158 | 8.0 | 832 | 0.9260 |
| 3.5904 | 9.0 | 936 | 0.9271 |
Framework versions
- Transformers 5.0.0
- Pytorch 2.5.1+cu121
- Tokenizers 0.22.1
- Downloads last month
- 27