KlusAI
Where AI research meets real-world impact

Website GitHub X Research

--- ## ๐Ÿ” What We're About KlusAI bridges the gap between cutting-edge AI research and production systems. We publish our datasets and models openly to advance the field โ€” **9M+ synthetic training examples** and counting. **Research Themes:** - ๐Ÿงฌ **Synthetic Data Generation** โ€” Large-scale training data without privacy concerns - โšก **Efficient AI Systems** โ€” Models that run on consumer hardware - ๐ŸŒ **Multilingual NLP** โ€” With deep Romanian language expertise --- ## ๐Ÿ“„ Featured Publication ### Synthetic Data Generation Using Large Language Models *Advances in Text and Code* โ€” **IEEE Access, 2025** Our comprehensive survey on generating training data using LLMs. How enterprises can generate training data at scale โ€” reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data. ๐Ÿ“– [Read on IEEE Xplore](https://ieeexplore.ieee.org/abstract/document/11080380) ยท ๐Ÿ“ [arXiv Preprint](https://arxiv.org/abs/2503.14023) --- ## ๐Ÿ”ฌ Flagship Project: TinyFabulist **TinyFabulist** is our open research programme on large-scale synthetic narrative generation. We demonstrate that small, efficient models can produce high-quality training data at scale. | Release | Description | Size | |---------|-------------|------| | **TinyFabulist v1** | Synthetic English Fables | ~3M examples | | *Upcoming* | Multilingual extensions, evaluation benchmarks | โ€” | **Key principles:** - ๐Ÿ“Š **Scale** โ€” 9M+ synthetic training examples generated - ๐Ÿ”ง **Efficiency** โ€” All content produced with โ‰ค8B parameter models - ๐Ÿ”“ **Openness** โ€” Generation scripts, pipelines, and methodology shared publicly ๐Ÿ“„ [Paper (arXiv)](https://arxiv.org/abs/2504.20605) ยท ๐Ÿ’ป [Code (GitHub)](https://github.com/klusai/tinyfabulist) --- ## ๐Ÿ“ฆ What You'll Find Here - **Datasets** โ€” Large-scale synthetic training corpora for fine-tuning and research - **Models** โ€” Efficient, instruction-tuned models optimized for specific tasks - **Evaluation** โ€” Benchmarks and tooling for synthetic data quality assessment --- ## ๐Ÿค Work With Us Beyond open research, we offer enterprise AI services: | Service | Description | |---------|-------------| | **AI Strategy** | Define your AI roadmap and implementation plan | | **Custom Development** | Bespoke AI solutions tailored to your domain | | **Model Training** | Fine-tuning and deploying models for your use case | | **MLOps & Infrastructure** | Scalable pipelines and production deployment | **Need custom synthetic data or domain-specific models?** We partner with organizations on applied research challenges. --- ## ๐Ÿ“ซ Get in Touch | Purpose | Contact | |---------|---------| | Research collaboration | [research@klusai.com](mailto:research@klusai.com) | | Enterprise services | [services@klusai.com](mailto:services@klusai.com) | | General inquiries | [hello@klusai.com](mailto:hello@klusai.com) | > **Technical questions?** Open an issue on the relevant dataset or model repository. ---

Applied Research ยท AI Services ยท Ventures
klusai.com ยท GitHub ยท X