ListenX-Medium / README.md

assemsabry

Update README.md

42be5c8 verified 4 months ago

preview code

raw

history blame contribute delete

1.91 kB

metadata

language:
  - en
  - ar
tags:
  - gguf
  - speech-recognition
  - audio
  - whisper
  - tokenai
license: apache-2.0
datasets:
  - multilingual-speech
model-index:
  - name: ListenX Medium
    results:
      - task:
          type: automatic-speech-recognition
          name: Automatic Speech Recognition
        dataset:
          name: multilingual-speech
          type: audio
        metrics:
          - type: word-error-rate
            value: 5.2

ListenX Logo

ListenX Medium (GGUF) by Token AI

ListenX Medium is an advanced speech recognition model developed by Token AI, based on the Whisper Medium architecture.
It is optimized for accurate multilingual transcription and translation, supporting high-quality performance across various speech recognition tasks.

Model Overview

Model Name: ListenX Medium
Developer: Token AI
Format: GGUF
Architecture: Whisper Medium (modified and optimized by Token AI)
Primary Use: Speech-to-text and audio transcription
Supported Languages: English, Arabic, and multiple others
Release Year: 2025

This model was designed to achieve high transcription accuracy even in noisy environments, making it suitable for research, AI assistants, automated captioning, and call center systems.

Technical Details

Attribute	Description
Model Type	Encoder-decoder Transformer
Quantization	GGUF format for optimized CPU and GPU inference
Input	16kHz mono audio waveform
Output	Transcribed or translated text
Training Data	Multilingual and domain-diverse speech datasets
Framework Compatibility	whisper.cpp, ctransformers, llama.cpp, and compatible backends

Usage

1. Using `whisper.cpp`

Download the model file (model.gguf) and run:

./main -m models/listenx-medium/model.gguf -f samples/audio.wav -otxt

ListenX Medium (GGUF) by Token AI

Model Overview

Technical Details

Usage

1. Using whisper.cpp

1. Using `whisper.cpp`