vngrs-ai
/

VBART-XLarge-Base

text2text-generation

Model card Files Files and versions

meliksahturker commited on Dec 9, 2025

Commit

020ea4e

·

verified ·

1 Parent(s): 18f3d78

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -31,12 +31,11 @@ VBART-XLarge improves the results compared to VBART-Large albeit in small margin
 ### Pre-training Data
 The base model is pre-trained on [vngrs-web-corpus](https://huggingface.co/datasets/vngrs-ai/vngrs-web-corpus). It is curated by cleaning and filtering Turkish parts of [OSCAR-2201](https://huggingface.co/datasets/oscar-corpus/OSCAR-2201) and [mC4](https://huggingface.co/datasets/mc4) datasets. These datasets consist of documents of unstructured web crawl data. More information about the dataset can be found on their respective pages. Data is filtered using a set of heuristics and certain rules, explained in the appendix of our [paper](https://arxiv.org/abs/2403.01308).
-#### Hardware
-- **GPUs**: 8 x Nvidia A100-80 GB
 #### Software
 - TensorFlow
 #### Pre-training Setting
-- **Duration**: Pre-trained for 30 days.
 - **Training tokens**: 84B
 - **Context Length**: 1024 for both encoder and decoder
 - **Training regime:** fp16 mixed precision

 ### Pre-training Data
 The base model is pre-trained on [vngrs-web-corpus](https://huggingface.co/datasets/vngrs-ai/vngrs-web-corpus). It is curated by cleaning and filtering Turkish parts of [OSCAR-2201](https://huggingface.co/datasets/oscar-corpus/OSCAR-2201) and [mC4](https://huggingface.co/datasets/mc4) datasets. These datasets consist of documents of unstructured web crawl data. More information about the dataset can be found on their respective pages. Data is filtered using a set of heuristics and certain rules, explained in the appendix of our [paper](https://arxiv.org/abs/2403.01308).
 #### Software
 - TensorFlow
 #### Pre-training Setting
+- **Duration**: Pre-trained for 8 days.
+- **GPUs**: 8 x Nvidia A100-80 GB
 - **Training tokens**: 84B
 - **Context Length**: 1024 for both encoder and decoder
 - **Training regime:** fp16 mixed precision