Upload MBartForConditionalGeneration

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
-library_name: transformers
 license: mit
 datasets:
 - skypro1111/ubertext-2-news-verbalized
-language:
-- uk
 widget:
- - text: "Очікувалось, що цей застосунок буде запущено о 11 ранку 22.08.2025, але розробники затягнули святкування і запуск був відкладений на 2 тижні."
 ---
 # Model Card for mbart-large-50-verbalization
@@ -18,11 +19,11 @@ widget:
 This model is based on the [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) architecture, renowned for its effectiveness in translation and text generation tasks across numerous languages.
 ## Training Data
-The model was fine-tuned on a subset of 457,610 sentences from the Ubertext dataset, focusing on news content. The verbalized equivalents were created using Google Gemini Pro, providing a rich basis for learning text transformation tasks.
 Dataset [skypro1111/ubertext-2-news-verbalized](https://huggingface.co/datasets/skypro1111/ubertext-2-news-verbalized)
 ## Training Procedure
-The model underwent 410,000 training steps.
 ```python
 from transformers import MBartForConditionalGeneration, AutoTokenizer, Trainer, TrainingArguments

 ---
+language:
+- uk
 license: mit
+library_name: transformers
 datasets:
 - skypro1111/ubertext-2-news-verbalized
 widget:
+- text: Очікувалось, що цей застосунок буде запущено о 11 ранку 22.08.2025, але розробники
+    затягнули святкування і запуск був відкладений на 2 тижні.
 ---
 # Model Card for mbart-large-50-verbalization
 This model is based on the [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) architecture, renowned for its effectiveness in translation and text generation tasks across numerous languages.
 ## Training Data
+The model was fine-tuned on a subset of 96,780 sentences from the Ubertext dataset, focusing on news content. The verbalized equivalents were created using Google Gemini Pro, providing a rich basis for learning text transformation tasks.
 Dataset [skypro1111/ubertext-2-news-verbalized](https://huggingface.co/datasets/skypro1111/ubertext-2-news-verbalized)
 ## Training Procedure
+The model underwent 70,000 training steps, which is almost 2 epochs, with further training the results degraded.
 ```python
 from transformers import MBartForConditionalGeneration, AutoTokenizer, Trainer, TrainingArguments

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "./tmp/checkpoint-70000",
   "_num_labels": 3,
   "activation_dropout": 0.0,
   "activation_function": "gelu",

 {
+  "_name_or_path": "./results/facebook/mbart-large-50-verbalization/checkpoint-410000",
   "_num_labels": 3,
   "activation_dropout": 0.0,
   "activation_function": "gelu",

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:22512659311a9a194a00c2ec2d33a13b34e09de219e267cd34adbb23842b9664
 size 2444578688

 version https://git-lfs.github.com/spec/v1
+oid sha256:4c5509ecd391d8d5f39318b468ef05878e270df2366c9e66f296890495c95720
 size 2444578688