File size: 3,571 Bytes
612e6d1 2b25c1b 612e6d1 859102d 76e3e66 56eee5c d613389 56eee5c d613389 de5691b d613389 de5691b d613389 ecc0002 d613389 fc16586 d613389 ecc0002 afa5c74 d613389 56eee5c afa5c74 56eee5c fc16586 d613389 c683ee7 d613389 c683ee7 d613389 56eee5c d613389 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 |
---
license: other
license_name: test
license_link: LICENSE
language:
- en
- fr
- de
- es
- pt
metrics:
- accuracy
- cer
pipeline_tag: automatic-speech-recognition
homepage: https://kroko.ai/
---
# Welcome to Kroko π
## **Open-source speech recognition built for developers.**
>
> Our engine is fully open-source, and you choose how to deploy models: use our **CC-BY-SA licensed community models** or upgrade to **commercial models** with premium performance. We focus on building **fast, high-quality production models** and providing **examples that take the guesswork out** of integration.
## Why Kroko ASR?
- β‘ **Fast & lightweight** β optimized Zipformer models (Whisper and parakeet style coming).
- π§© **Flexible licensing** β use **fully open-source CC-BY-SA community models** or integrate **commercial/OEM models** for premium accuracy.
- π **Runs anywhere** β cross-platform and with support for many programming languages.
- π± **Mobile & web ready** β works on Android, (iOS coming soon) in the browser via WASM, and with WebSockets for streaming.
- π§° **Production focus** β we prioritize real-world performance, stability, and examples.
- π€ **Customizable** β bring your own model, fine-tune for domain-specific vocabularies, or commission us.
> Our mission: **fast, high-quality ASR with licensing that works for both open-source and closed-source projects.**
## Demos
### βΆοΈ Android App
Run speech recognition **natively on your phone** using ONNX Runtime.
- [Kroko ASR Model Explorer](https://play.google.com/store/apps/details?id=com.krokoasr.demo&hl=en)
### π Browser (WASM)
Experience transcription **directly in your browser**, no server required.
- [Hugging Face Spaces Demo](https://huggingface.co/spaces/Banafo/Kroko-Streaming-ASR-Wasm)
## Models
Kroko ASR follows a **unique dual-model strategy**:
### 1. Community Models (free, open-source)
- Licensed under **CC-BY-SA**.
- Low-latency, lightweight models.
- Perfect for hobby projects, research, or free tiers.
- Faster and smaller than Whisper/Parakeet in many scenarios.
### 2. Commercial & OEM Models
- Premium accuracy and robustness.
- Licensed for professional and production products.
- Designed for SaaS, dev tools, and enterprise integration.
### 3. Bring, Train, or Commission Your Own
- **DIY:** Use our training guides to build and distribute your own models.
- **Professional services:** Work with us to create fine-tuned models for accents, jargon, or specialized domains.
> This gives you **full freedom**: start free, scale commercially, or roll your own.
## Our Community
Join the Kroko community to learn, share, and contribute:
- π¬ **[Discord](https://discord.gg/JT7wdtnK79)** β chat with developers, ask questions, and share projects.
- π’ **[Reddit](https://www.reddit.com/r/kroko_ai/)** β join discussions, showcase your integrations, and follow updates.
- π€ **[Hugging Face](https://huggingface.co/Banafo/Kroko-ASR)** β explore our models, try live demos, and contribute feedback.
## Contributing
PRs welcome! Run `ruff`, `black`, and `pytest` before submitting.
---
## License
Apache-2.0 engine. Models licensed separately (CC-BY-SA community or commercial OEM).
---
## Credits
Kroko ASR is built on top of [**Sherpa-ONNX**](https://k2-fsa.github.io/sherpa/).
β οΈ **Note:** Kroko ASR is an independent project and is **not affiliated with Sherpa-ONNX**. We build on their excellent open-source engine, but our models, demos, and packaging are developed and maintained separately.
---
|