Punctuate-All (GGUF)

GGUF conversion of kredor/punctuate-all for use with CrispASR.

Adds punctuation to unpunctuated ASR output. 12 languages with ASCII punctuation output. Smaller and faster alternative to fullstop-punc-multilang (base vs large).

Model Details

  • Architecture: XLM-RoBERTa-base โ€” 12L, d=768, 12 heads, d_ffn=3072, GELU
  • Parameters: ~278M
  • Classifier: Linear(768, 6) โ€” 6 punctuation classes
  • Labels: none, . (period), , (comma), ? (question), - (dash), : (colon)
  • Vocabulary: SentencePiece (250,002 tokens)
  • Max sequence: 512 tokens (auto-chunked)
  • Languages: en, de, fr, es, bg, it, pl, nl, cs, pt, sk, sl
  • License: MIT

Usage with CrispASR

crispasr --backend wav2vec2 -m wav2vec2.gguf --punc-model punctuate-all-q4_k.gguf -f audio.wav

Available Files

File Quant Size Description
punctuate-all-f16.gguf F16 901 MB Half precision
punctuate-all-q4_k.gguf Q4_K 154 MB Recommended

Original Model

Downloads last month
65
GGUF
Model size
0.3B params
Architecture
fireredpunc
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/punctuate-all-GGUF

Quantized
(3)
this model