Banafo
/

Kroko-ASR

Automatic Speech Recognition

Model card Files Files and versions

xet

Community

Banafo commited on Sep 29, 2025

Commit

d613389

verified ·

1 Parent(s): 4ade228

Update README.md

Browse files

Files changed (1) hide show

README.md +66 -20

README.md CHANGED Viewed

@@ -13,36 +13,82 @@ metrics:
 - cer
 pipeline_tag: automatic-speech-recognition
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
 >
->
-**( update september 2025 - CC-BY-SA models were just uploaded, the new ones (with .data extension ) are CC-BY-SA licensed, the .onnx are still non-commercial only. Github and readme updates coming soon. )**
-## Overview
-This is a family of low-latency streaming models designed for use on edge devices.
-**Goal**: Provide faster or higher-quality performance compared to similarly sized Whisper and other models.
-- **Languages**: English, French, German (7 more languages coming).
 ## Demos
-- [**Browser Demo (CPU)**](https://huggingface.co/spaces/Banafo/Kroko-Streaming-ASR-Wasm)
-  *(Runs entirely in the browser using CPU.)*
-- [**Gradio / Python Demo**](https://huggingface.co/spaces/Banafo/Kroko-Streaming-ASR-Python)
 ## License
-The license is still under consideration (likely Coqui). The model is intended to be **dual-licensed**:
-- **Free for non-commercial use**.
-- **Affordable license for commercial use**.
-## Training
-- Training is done with a modified k2/Icefall pipeline.
-- Inference can be performed with the standard Sherpa project.
-- Silence padding and volume normalization may help produce better results.
-## Acknowledgements
-Special thanks to the [Lhotse](https://github.com/lhotse-speech/lhotse), [Sherpa](https://github.com/k2-fsa/sherpa), [k2](https://github.com/k2-fsa/k2), and [Icefall](https://github.com/k2-fsa/icefall) teams for their support and tools.

 - cer
 pipeline_tag: automatic-speech-recognition
 ---
+# Welcome to Kroko 👋
+## **Open-source speech recognition built for developers.**
 >
+> Our engine is fully open-source, and you choose how to deploy models: use our **CC-BY-SA licensed community models** or upgrade to **commercial models** with premium performance. We focus on building **fast, high-quality production models** and providing **examples that take the guesswork out** of integration.
+## Why Kroko ASR?
+- ⚡ **Fast & lightweight** – optimized Zipformer models (Whisper and parakeet style coming).
+- 🧩 **Flexible licensing** – use **fully open-source CC-BY community models** or integrate **commercial/OEM models** for premium accuracy.
+- 🌍 **Runs anywhere** – cross-platform and with support for many programming languages.
+- 📱 **Mobile & web ready** – works on Android, (iOS coming soon) in the browser via WASM, and with WebSockets for streaming.
+- 🧰 **Production focus** – we prioritize real-world performance, stability, and examples.
+- 🤝 **Customizable** – bring your own model, fine-tune for domain-specific vocabularies, or commission us.
+> Our mission: **fast, high-quality ASR with licensing that works for both open-source and closed-source projects.**
 ## Demos
+### ▶️ Android App
+Run speech recognition **natively on your phone** using ONNX Runtime.
+- [Kroko ASR Model Explorer](https://play.google.com/store/apps/details?id=com.krokoasr.demo&hl=en)
+### 🌐 Browser (WASM)
+Experience transcription **directly in your browser**, no server required.
+- [Hugging Face Spaces Demo](https://huggingface.co/spaces/Banafo/Kroko-Streaming-ASR-Wasm)
+## Models
+Kroko ASR follows a **unique dual-model strategy**:
+### 1. Community Models (free, open-source)
+- Licensed under **CC-BY-SA**.
+- Low-latency, lightweight models.
+- Perfect for hobby projects, research, or free tiers.
+- Faster and smaller than Whisper/Parakeet in many scenarios.
+### 2. Commercial & OEM Models
+- Premium accuracy and robustness.
+- Licensed for professional and production products.
+- Designed for SaaS, dev tools, and enterprise integration.
+### 3. Bring, Train, or Commission Your Own
+- **DIY:** Use our training guides to build and distribute your own models.
+- **Professional services:** Work with us to create fine-tuned models for accents, jargon, or specialized domains.
+> This gives you **full freedom**: start free, scale commercially, or roll your own.
+## Our Community
+Join the Kroko community to learn, share, and contribute:
+- 💬 **[Discord](https://discord.gg/JT7wdtnK79)** – chat with developers, ask questions, and share projects.
+- 📢 **[Reddit](https://www.reddit.com/r/kroko_ai/)** – join discussions, showcase your integrations, and follow updates.
+- 🤗 **[Hugging Face](https://huggingface.co/Banafo/Kroko-ASR)** – explore our models, try live demos, and contribute feedback.
+## Contributing
+PRs welcome! Run `ruff`, `black`, and `pytest` before submitting.
+---
 ## License
+Apache-2.0 engine. Models licensed separately (CC-BY community or commercial OEM).
+---
+## Credits
+Kroko ASR is built on top of [**Sherpa-ONNX**](https://k2-fsa.github.io/sherpa/).
+⚠️ **Note:** Kroko ASR is an independent project and is **not affiliated with Sherpa-ONNX**. We build on their excellent open-source engine, but our models, demos, and packaging are developed and maintained separately.
+---