KlusAI
Where AI research meets real-world impact

--- ## 🔍 What We're About KlusAI bridges the gap between cutting-edge AI research and production systems. We publish our datasets and models openly to advance the field — **9M+ synthetic training examples** and counting. **Research Themes:** - 🧬 **Synthetic Data Generation** — Large-scale training data without privacy concerns - ⚡ **Efficient AI Systems** — Models that run on consumer hardware - 🌍 **Multilingual NLP** — With deep Romanian language expertise --- ## 📄 Featured Publication ### Synthetic Data Generation Using Large Language Models *Advances in Text and Code* — **IEEE Access, 2025** Our comprehensive survey on generating training data using LLMs. How enterprises can generate training data at scale — reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data. 📖 [Read on IEEE Xplore](https://ieeexplore.ieee.org/abstract/document/11080380) · 📝 [arXiv Preprint](https://arxiv.org/abs/2503.14023) --- ## 🔬 Flagship Project: TinyFabulist **TinyFabulist** is our open research programme on large-scale synthetic narrative generation. We demonstrate that small, efficient models can produce high-quality training data at scale. | Release | Description | Size | |---------|-------------|------| | **TinyFabulist v1** | Synthetic English Fables | ~3M examples | | *Upcoming* | Multilingual extensions, evaluation benchmarks | — | **Key principles:** - 📊 **Scale** — 9M+ synthetic training examples generated - 🔧 **Efficiency** — All content produced with ≤8B parameter models - 🔓 **Openness** — Generation scripts, pipelines, and methodology shared publicly 📄 [Paper (arXiv)](https://arxiv.org/abs/2504.20605) · 💻 [Code (GitHub)](https://github.com/klusai/tinyfabulist) --- ## 📦 What You'll Find Here - **Datasets** — Large-scale synthetic training corpora for fine-tuning and research - **Models** — Efficient, instruction-tuned models optimized for specific tasks - **Evaluation** — Benchmarks and tooling for synthetic data quality assessment --- ## 🤝 Work With Us Beyond open research, we offer enterprise AI services: | Service | Description | |---------|-------------| | **AI Strategy** | Define your AI roadmap and implementation plan | | **Custom Development** | Bespoke AI solutions tailored to your domain | | **Model Training** | Fine-tuning and deploying models for your use case | | **MLOps & Infrastructure** | Scalable pipelines and production deployment | **Need custom synthetic data or domain-specific models?** We partner with organizations on applied research challenges. --- ## 📫 Get in Touch | Purpose | Contact | |---------|---------| | Research collaboration | [research@klusai.com](mailto:research@klusai.com) | | Enterprise services | [services@klusai.com](mailto:services@klusai.com) | | General inquiries | [hello@klusai.com](mailto:hello@klusai.com) | > **Technical questions?** Open an issue on the relevant dataset or model repository. ---

Applied Research · AI Services · Ventures
klusai.com · GitHub · X