mihainadas commited on
Commit
26899ce
Β·
verified Β·
1 Parent(s): 2b33a1e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +70 -28
README.md CHANGED
@@ -1,8 +1,8 @@
1
  <!-- KlusAI β€’ Hugging Face Org Card -->
2
 
3
  <p align="center">
4
- <strong>KlusAI Labs</strong><br>
5
- <em>Applied AI β€’ Open Research β€’ Romanian Craftsmanship</em>
6
  </p>
7
 
8
  <p align="center">
@@ -12,50 +12,92 @@
12
  <a href="https://github.com/klusai">
13
  <img src="https://img.shields.io/badge/GitHub-@klusai-black?logo=github" alt="GitHub">
14
  </a>
15
- <a href="https://twitter.com/klusai">
16
- <img src="https://img.shields.io/badge/Twitter-@klusai-1DA1F2?logo=twitter&logoColor=white" alt="Twitter">
17
  </a>
18
- <a href="https://www.klusai.com/contact">
19
- <img src="https://img.shields.io/badge/Contact-us-brightgreen?logo=minutemailer&logoColor=white" alt="Contact us">
20
  </a>
21
  </p>
22
 
23
  ---
24
 
25
- ## πŸ” What we’re about
26
- KlusAI builds **compact, production-ready language models** and shares the research openly.
27
- Our goal: make enterprise-grade NLP accessible without the giant hardware bill.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
 
29
  ---
30
 
31
- ## πŸ”¬ Research spotlight
 
 
 
 
 
 
 
32
 
33
- ### TinyFabulist Project
34
- **TinyFabulist** is our flagship *open research programme* on generative narrative AI. Rather than a single dataset, it is a **living corpus** that explores how *small language models (SLMs)* can craft moral fables at scale, providing a fertile playground for studying story structure, controllable generation, and cultural adaptation.
 
 
35
 
36
- - **Growing corpus** – Public releases begin with **TinyFabulist v1** (~3 M English fables). Upcoming versions will add new languages, richer metadata, and evaluation benchmarks.
37
- - **Compact-model focus** – All content is produced with lightweight, instruction-tuned models (≀ 8 B params), showing that compelling long-form text is possible without gigantic, closed models.
38
- - **Open tooling** – We share generation scripts, evaluation pipelines, and annotation guides so the community can reproduce, critique, and extend our work.
39
 
40
  ---
41
 
42
- ## πŸ’Ό What we do for partners
43
 
44
- | Service | What you get |
45
- | --- | --- |
46
- | **Custom AI consulting & delivery** | End-to-end design, build, and deployment of tailor-made ML/NLP systems. |
47
- | **Business-process automation** | AI workflows that cut manual effort and boost operational speed. |
48
- | **Data architecture & analytics** | Data pipelines and dashboards that transform raw data into decisions. |
49
- | **AI training & upskilling** | Workshops, bootcamps, and mentorship for teams and academia. |
50
 
51
  ---
52
 
53
- ## 🀝 Let’s collaborate
54
 
55
- We love open dialogue!
56
 
57
- - **Website & contact form:** <https://www.klusai.com/contact/>
58
- - **GitHub & Twitter:** `@klusai`
59
- - **Hugging Face issues:** Open an issue on this profile for technical questions or requests.
 
 
 
60
 
61
- > Working on something exciting in Romanian AI? **Drop us a line** – we’re always keen to collaborate.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  <!-- KlusAI β€’ Hugging Face Org Card -->
2
 
3
  <p align="center">
4
+ <strong>KlusAI</strong><br>
5
+ <em>Where AI research meets real-world impact</em>
6
  </p>
7
 
8
  <p align="center">
 
12
  <a href="https://github.com/klusai">
13
  <img src="https://img.shields.io/badge/GitHub-@klusai-black?logo=github" alt="GitHub">
14
  </a>
15
+ <a href="https://x.com/klusai">
16
+ <img src="https://img.shields.io/badge/X-@klusai-black?logo=x&logoColor=white" alt="X">
17
  </a>
18
+ <a href="https://www.klusai.com/research/">
19
+ <img src="https://img.shields.io/badge/Research-klusai.com-brightgreen?logo=beaker&logoColor=white" alt="Research">
20
  </a>
21
  </p>
22
 
23
  ---
24
 
25
+ ## πŸ” What We're About
26
+
27
+ KlusAI bridges the gap between cutting-edge AI research and production systems. We publish our datasets and models openly to advance the field β€” **9M+ synthetic training examples** and counting.
28
+
29
+ **Research Themes:**
30
+ - 🧬 **Synthetic Data Generation** β€” Large-scale training data without privacy concerns
31
+ - ⚑ **Efficient AI Systems** β€” Models that run on consumer hardware
32
+ - 🌍 **Multilingual NLP** β€” With deep Romanian language expertise
33
+
34
+ ---
35
+
36
+ ## πŸ“„ Featured Publication
37
+
38
+ ### Synthetic Data Generation Using Large Language Models
39
+ *Advances in Text and Code* β€” **IEEE Access, 2025**
40
+
41
+ Our comprehensive survey on generating training data using LLMs. How enterprises can generate training data at scale β€” reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data.
42
+
43
+ πŸ“– [Read on IEEE Xplore](https://ieeexplore.ieee.org/abstract/document/11080380) Β· πŸ“ [arXiv Preprint](https://arxiv.org/abs/2503.14023)
44
 
45
  ---
46
 
47
+ ## πŸ”¬ Flagship Project: TinyFabulist
48
+
49
+ **TinyFabulist** is our open research programme on large-scale synthetic narrative generation. We demonstrate that small, efficient models can produce high-quality training data at scale.
50
+
51
+ | Release | Description | Size |
52
+ |---------|-------------|------|
53
+ | **TinyFabulist v1** | Synthetic English Fables | ~3M examples |
54
+ | *Upcoming* | Multilingual extensions, evaluation benchmarks | β€” |
55
 
56
+ **Key principles:**
57
+ - πŸ“Š **Scale** β€” 9M+ synthetic training examples generated
58
+ - πŸ”§ **Efficiency** β€” All content produced with ≀8B parameter models
59
+ - πŸ”“ **Openness** β€” Generation scripts, pipelines, and methodology shared publicly
60
 
61
+ πŸ“„ [Paper (arXiv)](https://arxiv.org/abs/2504.20605) Β· πŸ’» [Code (GitHub)](https://github.com/klusai/tinyfabulist)
 
 
62
 
63
  ---
64
 
65
+ ## πŸ“¦ What You'll Find Here
66
 
67
+ - **Datasets** β€” Large-scale synthetic training corpora for fine-tuning and research
68
+ - **Models** β€” Efficient, instruction-tuned models optimized for specific tasks
69
+ - **Evaluation** β€” Benchmarks and tooling for synthetic data quality assessment
 
 
 
70
 
71
  ---
72
 
73
+ ## 🀝 Work With Us
74
 
75
+ Beyond open research, we offer enterprise AI services:
76
 
77
+ | Service | Description |
78
+ |---------|-------------|
79
+ | **AI Strategy** | Define your AI roadmap and implementation plan |
80
+ | **Custom Development** | Bespoke AI solutions tailored to your domain |
81
+ | **Model Training** | Fine-tuning and deploying models for your use case |
82
+ | **MLOps & Infrastructure** | Scalable pipelines and production deployment |
83
 
84
+ **Need custom synthetic data or domain-specific models?** We partner with organizations on applied research challenges.
85
+
86
+ ---
87
+
88
+ ## πŸ“« Get in Touch
89
+
90
+ | Purpose | Contact |
91
+ |---------|---------|
92
+ | Research collaboration | [research@klusai.com](mailto:research@klusai.com) |
93
+ | Enterprise services | [services@klusai.com](mailto:services@klusai.com) |
94
+ | General inquiries | [hello@klusai.com](mailto:hello@klusai.com) |
95
+
96
+ > **Technical questions?** Open an issue on the relevant dataset or model repository.
97
+
98
+ ---
99
+
100
+ <p align="center">
101
+ <strong>Applied Research Β· AI Services Β· Ventures</strong><br>
102
+ <a href="https://klusai.com">klusai.com</a> Β· <a href="https://github.com/klusai">GitHub</a> Β· <a href="https://x.com/klusai">X</a>
103
+ </p>