TungCan
/

vietnamese-correction-lora-v3

Text Generation

text2text-generation

Model card Files Files and versions

TungCan commited on Apr 15, 2025

Commit

9054fa5

·

verified ·

1 Parent(s): 0a8e3fc

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -82,7 +82,7 @@ The following hyperparameters were used during training:
 - trainable params: 25,165,824 || all params: 326,801,408 || trainable%: 7.700647360735974
 > The model is loaded in 8-bit precision. To train this model you need to add additional modules inside the model such as adapters using `peft` library and freeze the model weights. Please check  the examples in https://github.com/huggingface/peft for more details.
-- Num examples = 10_00,000
 - Num Epochs = 5
 - Instantaneous batch size per device = 6
 - Total train batch size (w. parallel, distributed & accumulation) = 144
@@ -93,7 +93,6 @@ The following hyperparameters were used during training:
 ### Framework versions
 - PEFT 0.4.0
-- PEFT 0.14.0
 - Transformers 4.47.0
 - Pytorch 2.5.1+cu121
 - Datasets 3.3.1

 - trainable params: 25,165,824 || all params: 326,801,408 || trainable%: 7.700647360735974
 > The model is loaded in 8-bit precision. To train this model you need to add additional modules inside the model such as adapters using `peft` library and freeze the model weights. Please check  the examples in https://github.com/huggingface/peft for more details.
+- Num examples = 1_000_000
 - Num Epochs = 5
 - Instantaneous batch size per device = 6
 - Total train batch size (w. parallel, distributed & accumulation) = 144
 ### Framework versions
 - PEFT 0.4.0
 - Transformers 4.47.0
 - Pytorch 2.5.1+cu121
 - Datasets 3.3.1