Instructions to use moinbach7/asr_en_text_normalization with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use moinbach7/asr_en_text_normalization with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="moinbach7/asr_en_text_normalization")

# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("moinbach7/asr_en_text_normalization")
model = AutoModelForSeq2SeqLM.from_pretrained("moinbach7/asr_en_text_normalization")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use moinbach7/asr_en_text_normalization with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "moinbach7/asr_en_text_normalization"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "moinbach7/asr_en_text_normalization",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/moinbach7/asr_en_text_normalization

SGLang

How to use moinbach7/asr_en_text_normalization with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "moinbach7/asr_en_text_normalization" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "moinbach7/asr_en_text_normalization",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "moinbach7/asr_en_text_normalization" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "moinbach7/asr_en_text_normalization",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use moinbach7/asr_en_text_normalization with Docker Model Runner:
```
docker model run hf.co/moinbach7/asr_en_text_normalization
```

moinbach7 commited on Aug 22, 2025

Commit

a7e2179

verified ·

1 Parent(s): 5d4c68e

Update README.md

Browse files

Files changed (1) hide show

README.md +57 -54

README.md CHANGED Viewed

@@ -1,54 +1,57 @@
----
-license: gpl-3.0
-language:
-  - en
-pipeline_tag: text2text-generation
-tags:
-  - code
-  - asr
-  - inverse text normalization
-datasets:
-  - text-normalization-challenge-english-language
----
----
----
-# asr_inverse_text_normalization
-Finetuned a facebook/bart-base Pretrained model on the ASR inverse text normalization dataset by treating it as a seq2seq task. Other approaches which may be considered is by considering it as a TokenClassification task and the one mentioned here https://machinelearning.apple.com/research/inverse-text-normal.
-## Model description
-BART (Bidirectional and Auto-Regressive Transformers) is a pre-trained transformer-based neural network model developed by Facebook AI Research (FAIR) for various natural language processing (NLP) tasks
-The BART architecture is based on the Transformer model, which is a type of neural network architecture that processes sequential input data, such as text, by applying self-attention mechanisms to capture the relationships between different words in the input sequence.
-BART includes both auto-regressive and bidirectional encoder-decoder transformer architectures, which enable it to perform both generation and prediction tasks
-BART was trained on a diverse range of NLP tasks, including machine translation, summarization, and question answering, and has shown strong performance across multiple benchmarks.
-Its training process involves corrupting text with different types of noise and training the model to reconstruct the original text, which has been shown to improve the model's ability to generalize to new tasks and outperform other pre-trained language models like GPT and BERT
-The model flavour which was chosen is that of "facebook/bart-base" and columns "after" is used as the source while "before" column is used as the targets.
-## Intended uses & limitations
-This model can be used as an out-of-the-box solution to the invesrse text normalization which can convert ASR generated un-normalized text such as
-"my c v v for my card is five six seven and it expires on november twenty three" -> "my CVV for my card is 567 and it expires on November 23"
-The model needs to be explored for various min and max length setting at the time of generation for your specific usecase
-### How to use
-```python
->>> from transformers import pipeline
->>> generator = pipeline(model="pavanBuduguppa/asr_inverse_text_normalization")
->>> generator("my c v v for my card is five six seven and it expires on november twenty three")
-```
-## Training data
-All credits and rights for the training data belongs to Google. The data was merely obtained and processed for this model and the original data can be found here https://www.kaggle.com/competitions/text-normalization-challenge-english-language/data

+---
+license: gpl-3.0
+language:
+  - en
+pipeline_tag: text2text-generation
+tags:
+  - code
+  - asr
+  - inverse text normalization
+datasets:
+  - text-normalization-challenge-english-language
+---
+# ASR Inverse Text Normalization
+This repository provides a **fine-tuned BART model** for the task of **ASR Inverse Text Normalization (ITN)**.
+The goal is to transform raw, unnormalized ASR transcripts into properly formatted text.
+---
+## Model Overview
+**BART (Bidirectional and Auto-Regressive Transformers)** is a transformer-based model introduced by Facebook AI Research.
+It is designed for both text understanding and generation tasks.
+- **Architecture**: Encoder–Decoder Transformer with self-attention.
+- **Pretraining objective**: Reconstruct original text from corrupted/noisy versions.
+- **Applications**: Summarization, machine translation, question answering, and text normalization.
+For this project:
+- Base model: `facebook/bart-base`
+- Training setup: Treated as a **sequence-to-sequence** problem
+- Dataset: [Text Normalization Challenge - English Language (Kaggle)](https://www.kaggle.com/competitions/text-normalization-challenge-english-language/data)
+- Columns used:
+  - Input: `"after"` (ASR-like text)
+  - Target: `"before"` (normalized text)
+---
+## Intended Use
+The model can be applied directly to **normalize ASR outputs** in speech-to-text pipelines.
+---
+## Quickstart
+```python
+from transformers import pipeline
+# Load pipeline
+generator = pipeline(model="pavanBuduguppa/asr_inverse_text_normalization")
+# Run inference
+result = generator("my c v v for my card is five six seven and it expires on november twenty three")
+print(result)