Kroko-ASR / README.md
Banafo's picture
Update README.md
76e3e66 verified
---
license: other
license_name: test
license_link: LICENSE
language:
- en
- fr
- de
- es
- pt
metrics:
- accuracy
- cer
pipeline_tag: automatic-speech-recognition
homepage: https://kroko.ai/
---
# Welcome to Kroko πŸ‘‹
## **Open-source speech recognition built for developers.**
>
> Our engine is fully open-source, and you choose how to deploy models: use our **CC-BY-SA licensed community models** or upgrade to **commercial models** with premium performance. We focus on building **fast, high-quality production models** and providing **examples that take the guesswork out** of integration.
## Why Kroko ASR?
- ⚑ **Fast & lightweight** – optimized Zipformer models (Whisper and parakeet style coming).
- 🧩 **Flexible licensing** – use **fully open-source CC-BY-SA community models** or integrate **commercial/OEM models** for premium accuracy.
- 🌍 **Runs anywhere** – cross-platform and with support for many programming languages.
- πŸ“± **Mobile & web ready** – works on Android, (iOS coming soon) in the browser via WASM, and with WebSockets for streaming.
- 🧰 **Production focus** – we prioritize real-world performance, stability, and examples.
- 🀝 **Customizable** – bring your own model, fine-tune for domain-specific vocabularies, or commission us.
> Our mission: **fast, high-quality ASR with licensing that works for both open-source and closed-source projects.**
## Demos
### ▢️ Android App
Run speech recognition **natively on your phone** using ONNX Runtime.
- [Kroko ASR Model Explorer](https://play.google.com/store/apps/details?id=com.krokoasr.demo&hl=en)
### 🌐 Browser (WASM)
Experience transcription **directly in your browser**, no server required.
- [Hugging Face Spaces Demo](https://huggingface.co/spaces/Banafo/Kroko-Streaming-ASR-Wasm)
## Models
Kroko ASR follows a **unique dual-model strategy**:
### 1. Community Models (free, open-source)
- Licensed under **CC-BY-SA**.
- Low-latency, lightweight models.
- Perfect for hobby projects, research, or free tiers.
- Faster and smaller than Whisper/Parakeet in many scenarios.
### 2. Commercial & OEM Models
- Premium accuracy and robustness.
- Licensed for professional and production products.
- Designed for SaaS, dev tools, and enterprise integration.
### 3. Bring, Train, or Commission Your Own
- **DIY:** Use our training guides to build and distribute your own models.
- **Professional services:** Work with us to create fine-tuned models for accents, jargon, or specialized domains.
> This gives you **full freedom**: start free, scale commercially, or roll your own.
## Our Community
Join the Kroko community to learn, share, and contribute:
- πŸ’¬ **[Discord](https://discord.gg/JT7wdtnK79)** – chat with developers, ask questions, and share projects.
- πŸ“’ **[Reddit](https://www.reddit.com/r/kroko_ai/)** – join discussions, showcase your integrations, and follow updates.
- πŸ€— **[Hugging Face](https://huggingface.co/Banafo/Kroko-ASR)** – explore our models, try live demos, and contribute feedback.
## Contributing
PRs welcome! Run `ruff`, `black`, and `pytest` before submitting.
---
## License
Apache-2.0 engine. Models licensed separately (CC-BY-SA community or commercial OEM).
---
## Credits
Kroko ASR is built on top of [**Sherpa-ONNX**](https://k2-fsa.github.io/sherpa/).
⚠️ **Note:** Kroko ASR is an independent project and is **not affiliated with Sherpa-ONNX**. We build on their excellent open-source engine, but our models, demos, and packaging are developed and maintained separately.
---