| | --- |
| | tags: |
| | - model_hub_mixin |
| | - pytorch_model_hub_mixin |
| | datasets: |
| | - IgorVolochay/russian_jokes |
| | language: |
| | - ru |
| | pipeline_tag: text-generation |
| | --- |
| | |
| | First homewok for NLP course in MSU from VK. |
| | Causal Transformer with Alibi bias and SwiGLU MLP. |
| | Can generate jokes in Russian language. Only the very small "nano" model with only 0.51 million parametrs and 1024 vocabluary size was implemented due to weak computing capabilities. |
| |
|
| | This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration: |