kazzand
/

ru-longformer-base-4096

Model card Files Files and versions

kazzand commited on Jul 12, 2023

Commit

a54ae14

·

1 Parent(s): 02c87f7

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -3,9 +3,14 @@ language:
 - ru
 ---
-This is a base version of Russian Longformer model created from [blinoff/roberta-base-russian-v0](https://huggingface.co/blinoff/roberta-base-russian-v0) weights with the length of context expanded to 4096 tokens.
-The model was fine-tuned on russian books dataset but also supports English as its source model.
-For a more comprehensive overview, please refer to this Habr post, which is available in Russian.
 The model can be used as-is to produce text embeddings or it can be further fine-tuned for a specific downstream task.

 - ru
 ---
+This is a base Longformer model designed for Russian language.
+It was initialized from [blinoff/roberta-base-russian-v0](https://huggingface.co/blinoff/roberta-base-russian-v0) weights and has been modified to support a context length of up to 4096 tokens.
+We fine-tuned it on a dataset of Russian books. For a detailed information check out our post on Habr.
+Model attributes:
+* 12 attention heads
+* 12 hidden layers
+* 4096 tokens length of context
 The model can be used as-is to produce text embeddings or it can be further fine-tuned for a specific downstream task.