finetune the decoder on text only data?

by leestevennz - opened Jul 11, 2025

Discussion

leestevennz

Jul 11, 2025

Hi there,

Just wondering if it possible to finetune the decoder on text only data, for domain adaptation?

urroxyz

Aug 12, 2025

Canary's decoder is a Transformer LM conditioned on encoder outputs so it is possible to adapt the decoder of a sequence-to-sequence ASR model like Canary using only text data.

It can be done through shallow or cold/deep fusion, or (what I would recommend) continued pretraining.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment