ai-forever
/

mGPT

Text Generation

text-generation-inference

Model card Files Files and versions

typowy-93 commited on Mar 2, 2024

Commit

2384a32

·

verified ·

1 Parent(s): 40897bd

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -62,7 +62,6 @@ language:
 - et
 - fi
 - hu
 pipeline_tag: text-generation
 tags:
 - multilingual
@@ -75,7 +74,7 @@ tags:
 datasets:
 - mc4
 - wikipedia
-thumbnail: "https://github.com/sberbank-ai/mgpt"
 ---
 # Multilingual GPT model
@@ -140,4 +139,4 @@ Languages:
 ## Details
 The model was trained with sequence length 512 using Megatron and Deepspeed libs by [SberDevices](https://sberdevices.ru/) team on a dataset of 600 GB of texts in 61 languages. The model has seen 440 billion BPE tokens in total.
-Total training time was around 14 days on 256 Nvidia V100 GPUs.

 - et
 - fi
 - hu
 pipeline_tag: text-generation
 tags:
 - multilingual
 datasets:
 - mc4
 - wikipedia
+thumbnail: https://github.com/sberbank-ai/mgpt
 ---
 # Multilingual GPT model
 ## Details
 The model was trained with sequence length 512 using Megatron and Deepspeed libs by [SberDevices](https://sberdevices.ru/) team on a dataset of 600 GB of texts in 61 languages. The model has seen 440 billion BPE tokens in total.
+Total training time was around 14 days on 256 Nvidia V100 GPUs.