README / README.md
npaev's picture
Update README.md
57440b5 verified
---
title: README
emoji: 🏢
colorFrom: gray
colorTo: indigo
sdk: static
pinned: false
short_description: Description page of AIaLT-IICT organization
---
# Artificial Intelligence and Language Technologies Department at IICT-BAS
Welcome to the HuggingFace organization page of the **Artificial Intelligence and Language Technologies Department** at the **Institute of Information and Communication Technologies, Bulgarian Academy of Sciences**!
The department focuses on developing language resources, theoretical machine learning, information retrieval, speech recognition and generation, and language models development for Bulgarian NLP applications.
## This repository offers openly available pre-trained language models designed for the Bulgarian language:
- ModernBERT based models
- base (149M) and large (395M) variants with 8192 tokens context length
- BERT based models
- base (124M) and large (355M) both uncased and cased variants
- extra large variant (859M)
- T5 based models
- 403M and 1.1B variants
- 470M with character level tokenization suitable for spelling correction tasks