KlusAI
Where AI research meets real-world impact
---
## ๐ What We're About
KlusAI bridges the gap between cutting-edge AI research and production systems. We publish our datasets and models openly to advance the field โ **9M+ synthetic training examples** and counting.
**Research Themes:**
- ๐งฌ **Synthetic Data Generation** โ Large-scale training data without privacy concerns
- โก **Efficient AI Systems** โ Models that run on consumer hardware
- ๐ **Multilingual NLP** โ With deep Romanian language expertise
---
## ๐ Featured Publication
### Synthetic Data Generation Using Large Language Models
*Advances in Text and Code* โ **IEEE Access, 2025**
Our comprehensive survey on generating training data using LLMs. How enterprises can generate training data at scale โ reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data.
๐ [Read on IEEE Xplore](https://ieeexplore.ieee.org/abstract/document/11080380) ยท ๐ [arXiv Preprint](https://arxiv.org/abs/2503.14023)
---
## ๐ฌ Flagship Project: TinyFabulist
**TinyFabulist** is our open research programme on large-scale synthetic narrative generation. We demonstrate that small, efficient models can produce high-quality training data at scale.
| Release | Description | Size |
|---------|-------------|------|
| **TinyFabulist v1** | Synthetic English Fables | ~3M examples |
| *Upcoming* | Multilingual extensions, evaluation benchmarks | โ |
**Key principles:**
- ๐ **Scale** โ 9M+ synthetic training examples generated
- ๐ง **Efficiency** โ All content produced with โค8B parameter models
- ๐ **Openness** โ Generation scripts, pipelines, and methodology shared publicly
๐ [Paper (arXiv)](https://arxiv.org/abs/2504.20605) ยท ๐ป [Code (GitHub)](https://github.com/klusai/tinyfabulist)
---
## ๐ฆ What You'll Find Here
- **Datasets** โ Large-scale synthetic training corpora for fine-tuning and research
- **Models** โ Efficient, instruction-tuned models optimized for specific tasks
- **Evaluation** โ Benchmarks and tooling for synthetic data quality assessment
---
## ๐ค Work With Us
Beyond open research, we offer enterprise AI services:
| Service | Description |
|---------|-------------|
| **AI Strategy** | Define your AI roadmap and implementation plan |
| **Custom Development** | Bespoke AI solutions tailored to your domain |
| **Model Training** | Fine-tuning and deploying models for your use case |
| **MLOps & Infrastructure** | Scalable pipelines and production deployment |
**Need custom synthetic data or domain-specific models?** We partner with organizations on applied research challenges.
---
## ๐ซ Get in Touch
| Purpose | Contact |
|---------|---------|
| Research collaboration | [research@klusai.com](mailto:research@klusai.com) |
| Enterprise services | [services@klusai.com](mailto:services@klusai.com) |
| General inquiries | [hello@klusai.com](mailto:hello@klusai.com) |
> **Technical questions?** Open an issue on the relevant dataset or model repository.
---
Applied Research ยท AI Services ยท Ventures
klusai.com ยท GitHub ยท X