End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,4 +1,5 @@
 ---
 license: mit
 base_model: FacebookAI/xlm-roberta-large
 tags:
@@ -9,46 +10,48 @@ model-index:
 - name: roberta-large-ner-qlorafinetune
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# roberta-large-ner-qlorafinetune
-This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on the conll2002 dataset.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 0.0004
-- train_batch_size: 32
-- eval_batch_size: 32
-- seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
-- training_steps: 1820
-### Training results
-### Framework versions
-- Transformers 4.31.0
-- Pytorch 2.5.1
-- Datasets 3.1.0
-- Tokenizers 0.13.3

 ---
+library_name: peft
 license: mit
 base_model: FacebookAI/xlm-roberta-large
 tags:
 - name: roberta-large-ner-qlorafinetune
   results: []
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# roberta-large-ner-qlorafinetune
+This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on the conll2002 dataset.
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0004
+- train_batch_size: 32
+- eval_batch_size: 32
+- seed: 42
+- optimizer: Use paged_adamw_8bit with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- training_steps: 1820
+- mixed_precision_training: Native AMP
+### Training results
+### Framework versions
+- PEFT 0.14.0
+- Transformers 4.49.0
+- Pytorch 2.6.0+cu118
+- Datasets 3.2.0
+- Tokenizers 0.21.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2cce779ee8e23f5a638ec3fef6ed3968eadfac9dae4d59e0e0ce04538f63eca5
 size 453064692

 version https://git-lfs.github.com/spec/v1
+oid sha256:eedd1dcc71efa9192f2a4dfe0c616b275584ad485f8901548a02f84e847b71fe
 size 453064692

runs/Mar17_15-08-59_DESKTOP-P79TL96/events.out.tfevents.1742242301.DESKTOP-P79TL96.2284.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fe6b5f9204c8ef86fc26f4b300a31664a900b8eae41a30b9af6360746213a030
-size 24991

 version https://git-lfs.github.com/spec/v1
+oid sha256:4b9f9227f10596cbd152d90fddf61ef42c7d2239d16daa66dd0dcc3474a27780
+size 25556