Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,61 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
+
|
| 5 |
+
This repo contains YugoGPT - the best open-source base 7B LLM for BCS (Bosnian, Croatian, Serbian) languages developed by Aleksa Gordić.
|
| 6 |
+
|
| 7 |
+
You can access more powerful iterations of YugoGPT already through the recently announced [RunaAI's API platform](https://dev.runaai.com/)!
|
| 8 |
+
|
| 9 |
+
Serbian LLM eval results:
|
| 10 |
+

|
| 11 |
+
|
| 12 |
+
Eval was computed using https://github.com/gordicaleksa/serbian-llm-eval
|
| 13 |
+
|
| 14 |
+
It was trained on tens of billions of BCS tokens and is based off of [Mistral 7B](https://huggingface.co/mistralai/Mistral-7B-v0.1).
|
| 15 |
+
|
| 16 |
+
## Notes
|
| 17 |
+
|
| 18 |
+
1) YugoGPT is a base model and therefore does not have any moderation mechanisms.
|
| 19 |
+
|
| 20 |
+
2) Since it's a base model it won't follow your instructions as it's just a powerful autocomplete engine.
|
| 21 |
+
|
| 22 |
+
3) If you want an access to much more powerful BCS LLMs (some of which are powering [yugochat](https://www.yugochat.com/)) - you can access the models through [RunaAI's API](https://dev.runaai.com/)
|
| 23 |
+
|
| 24 |
+
# Credits
|
| 25 |
+
|
| 26 |
+
The data for the project was obtained with the help of [Nikola Ljubešić](https://nljubesi.github.io/), [CLARIN.SI](https://www.clarin.si), and [CLASSLA](https://www.clarin.si/info/k-centre/).
|
| 27 |
+
|
| 28 |
+
# Project Sponsors
|
| 29 |
+
|
| 30 |
+
A big thank you to the project sponsors!
|
| 31 |
+
|
| 32 |
+
## Platinum sponsors 🌟
|
| 33 |
+
* <b>Ivan</b> (anon)
|
| 34 |
+
* [**Things Solver**](https://thingsolver.com/)
|
| 35 |
+
|
| 36 |
+
## Gold sponsors 🟡
|
| 37 |
+
* **qq** (anon)
|
| 38 |
+
* [**Adam Sofronijevic**](https://www.linkedin.com/in/adam-sofronijevic-685b911/)
|
| 39 |
+
* [**Yanado**](https://yanado.com/)
|
| 40 |
+
* [**Mitar Perovic**](https://www.linkedin.com/in/perovicmitar/)
|
| 41 |
+
* [**Nikola Ivancevic**](https://www.linkedin.com/in/nivancevic/)
|
| 42 |
+
* **Rational Development DOO**
|
| 43 |
+
* [**Ivan**](https://www.linkedin.com/in/ivan-kokic-258262175/) i [**Natalija Kokić**](https://www.linkedin.com/in/natalija-kokic-19a458131/)
|
| 44 |
+
|
| 45 |
+
## Silver sponsors ⚪
|
| 46 |
+
[**psk.rs**](https://psk.rs/), [**OmniStreak**](https://omnistreak.com/), [**Luka Važić**](https://www.linkedin.com/in/vazic/), [**Miloš Durković**](https://www.linkedin.com/in/milo%C5%A1-d-684b99188/), [**Marjan Radeski**](https://www.linkedin.com/in/marjanradeski/), **Marjan Stankovic**, [**Nikola Stojiljkovic**](https://www.linkedin.com/in/nikola-stojiljkovic-10469239/), [**Mihailo Tomić**](https://www.linkedin.com/in/mihailotomic/), [**Bojan Jevtic**](https://www.linkedin.com/in/bojanjevtic/), [**Jelena Jovanović**](https://www.linkedin.com/in/eldumo/), [**Nenad Davidović**](https://www.linkedin.com/in/nenad-davidovic-662ab749/), [**Mika Tasich**](https://www.linkedin.com/in/mikatasich/), [**TRENCH-NS**](https://www.linkedin.com/in/milorad-vukadinovic-64639926/), [**Nemanja Grujičić**](https://twitter.com/nemanjagrujicic), [**tim011**](https://knjigovodja.in.rs/sh)
|
| 47 |
+
|
| 48 |
+
**Also a big thank you to the following individuals:**
|
| 49 |
+
- [**Slobodan Marković**](https://www.linkedin.com/in/smarkovic/) - for spreading the word! :)
|
| 50 |
+
- [**Aleksander Segedi**](https://www.linkedin.com/in/aleksander-segedi-08430936/) - for help around bookkeeping!
|
| 51 |
+
|
| 52 |
+
## Citation
|
| 53 |
+
|
| 54 |
+
```
|
| 55 |
+
@article{YugoGPT,
|
| 56 |
+
author = "Gordić Aleksa",
|
| 57 |
+
title = "YugoGPT - an open-source LLM for Serbian, Bosnian, and Croatian languages",
|
| 58 |
+
year = "2024"
|
| 59 |
+
howpublished = {\url{https://huggingface.co/gordicaleksa/YugoGPT}},
|
| 60 |
+
}
|
| 61 |
+
```
|