assemsabry commited on
Commit
f7050f2
·
verified ·
1 Parent(s): c0274af

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -3
README.md CHANGED
@@ -1,3 +1,36 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ListenX Medium (GGUF) by Token AI
2
+
3
+ **ListenX Medium** is an advanced speech recognition model developed by **Token AI**, based on the Whisper Medium architecture.
4
+ It is optimized for accurate multilingual transcription and translation, supporting high-quality performance across various speech recognition tasks.
5
+
6
+ ## Model Overview
7
+
8
+ - **Model Name:** ListenX Medium
9
+ - **Developer:** Token AI
10
+ - **Format:** GGUF
11
+ - **Architecture:** Whisper Medium (modified and optimized by Token AI)
12
+ - **Primary Use:** Speech-to-text and audio transcription
13
+ - **Supported Languages:** English, Arabic, and multiple others
14
+ - **Release Year:** 2025
15
+
16
+ This model was designed to achieve high transcription accuracy even in noisy environments, making it suitable for research, AI assistants, automated captioning, and call center systems.
17
+
18
+ ## Technical Details
19
+
20
+ | Attribute | Description |
21
+ |------------|-------------|
22
+ | Model Type | Encoder-decoder Transformer |
23
+ | Quantization | GGUF format for optimized CPU and GPU inference |
24
+ | Input | 16kHz mono audio waveform |
25
+ | Output | Transcribed or translated text |
26
+ | Training Data | Multilingual and domain-diverse speech datasets |
27
+ | Framework Compatibility | whisper.cpp, ctransformers, llama.cpp, and compatible backends |
28
+
29
+ ## Usage
30
+
31
+ ### 1. Using `whisper.cpp`
32
+
33
+ Download the model file (`model.gguf`) and run:
34
+
35
+ ```bash
36
+ ./main -m models/listenx-medium/model.gguf -f samples/audio.wav -otxt