cohere-asr-ja-v0.1 - GGUF

GGUF conversion of efwkjn/cohere-asr-ja-v0.1, a Japanese fine-tune of CohereLabs/cohere-transcribe-03-2026.

These files target the Cohere Transcribe runtime in CrispASR. The GGUF architecture metadata is cohere-transcribe.

Files

File Type Notes
cohere-asr-ja-v0.1.gguf F16 Full-precision GGUF export
cohere-asr-ja-v0.1-q8_0.gguf Q8_0 Quantized
cohere-asr-ja-v0.1-q6_k.gguf Q6_K Quantized
cohere-asr-ja-v0.1-q5_1.gguf Q5_1 Quantized
cohere-asr-ja-v0.1-q5_0.gguf Q5_0 Quantized
cohere-asr-ja-v0.1-q4_k.gguf Q4_K Quantized

Usage

./build/bin/crispasr \
  --backend cohere \
  -m cohere-asr-ja-v0.1-q4_k.gguf \
  -f audio-ja.wav \
  -l ja

For lower memory startup, CrispASR can mmap GGUF weights:

CRISPASR_GGUF_MMAP=1 ./build/bin/crispasr --backend cohere \
  -m cohere-asr-ja-v0.1-q4_k.gguf -f audio-ja.wav -l ja

Conversion

Converted with CrispASR's Cohere exporter:

python models/convert-cohere-asr-to-gguf.py \
  --model-dir /path/to/efwkjn/cohere-asr-ja-v0.1 \
  --output cohere-asr-ja-v0.1.gguf

./build/bin/crispasr-quantize cohere-asr-ja-v0.1.gguf cohere-asr-ja-v0.1-q4_k.gguf q4_k

The variant set mirrors cstr/cohere-transcribe-03-2026-GGUF: F16 plus Q8_0, Q6_K, Q5_1, Q5_0, and Q4_K.

Smoke Test

All six files were smoke-tested locally with CrispASR on a Japanese WAV. Output for the sample was:

おはよう。今日も一日頑張ってね

License

The base Cohere Transcribe model is Apache-2.0. The fine-tune repository does not declare an explicit license field at the time of conversion; check the source model card before redistribution or commercial use.

Downloads last month
201
GGUF
Model size
2B params
Architecture
cohere-transcribe
Hardware compatibility
Log In to add your hardware

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TransWithAI/cohere-transcribe-ja-v0.1-GGUF

Quantized
(1)
this model