cstr
/

fireredpunc-GGUF

Token Classification

text-processing

Model card Files Files and versions

fireredpunc-GGUF / README.md

cstr's picture

Add README

ffe0b1f verified about 2 months ago

|

history blame contribute delete

2.46 kB

	---
	license: apache-2.0
	language:
	- zh
	- en
	tags:
	- gguf
	- ggml
	- punctuation
	- text-processing
	- bert
	- nlp
	base_model: FireRedTeam/FireRedPunc
	pipeline_tag: token-classification
	---

	# FireRedPunc (GGUF)

	GGUF conversion of [FireRedTeam/FireRedPunc](https://huggingface.co/FireRedTeam/FireRedPunc) for use with [CrispASR](https://github.com/CrispStrobe/CrispASR).

	Adds punctuation to unpunctuated ASR output using a BERT-based token classifier.

	## Model Details

	- Architecture: BERT-base (LERT) — 12L, d=768, 12 heads, d_ffn=3072, GELU
	- Parameters: ~102M
	- Classifier: Linear(768, 5) — 5 punctuation classes
	- Labels: space, ，(comma), 。(period), ？(question), ！(exclamation)
	- Vocabulary: chinese-bert-wwm-ext (21,128 tokens, WordPiece)
	- Max sequence: 512 tokens (longer texts are automatically chunked)
	- Languages: Chinese + English
	- License: Apache 2.0

	## Usage with CrispASR

	```bash
	# Add punctuation to any ASR backend's output
	crispasr --backend wav2vec2 -m wav2vec2.gguf --punc-model fireredpunc-q8_0.gguf -f audio.wav

	# Works with all backends
	crispasr --backend omniasr -m omniasr-ctc-1b-q4_k.gguf --punc-model fireredpunc-q8_0.gguf -f audio.wav
	```

	## Available Files

	\| File \| Quant \| Size \| Description \|
	\|------\|-------\|------\|-------------\|
	\| `fireredpunc.gguf` \| F16 \| 195 MB \| Full precision \|
	\| `fireredpunc-q8_0.gguf` \| Q8_0 \| 104 MB \| Recommended — lossless quality \|
	\| `fireredpunc-q4_k.gguf` \| Q4_K \| 56 MB \| Smaller but may miss some punctuation \|

	Recommendation: Use Q8_0. Token classification is more sensitive to quantization than language modeling — Q4_K occasionally drops commas that Q8_0 and F16 correctly predict.

	## Example

	Input (from CTC ASR):
	```
	and so my fellow americans ask not what your country can do for you ask what you can do for your country
	```

	Output (with FireRedPunc):
	```
	And so my fellow americans， ask not what your country can do for you， ask what you can do for your country？
	```

	## Conversion

	```bash
	python models/convert-fireredpunc-to-gguf.py \
	--input /path/to/FireRedPunc \
	--output fireredpunc.gguf
	```

	## Original Model

	- Source: [FireRedTeam/FireRedPunc](https://huggingface.co/FireRedTeam/FireRedPunc)
	- Paper: [FireRedASR2S](https://arxiv.org/abs/2603.10420)
	- Base: [LERT (Linguistically-motivated BERT)](https://github.com/ymcui/LERT)
	- Training: 18.57B Chinese characters + 2.20B English words