NVIDIA Parakeet TDT 0.6B V3 (Multilingual) model converted to ONNX format for onnx-asr, and then some configuration changed for a custom version of Transformers.js used in the Subtitle Anything! browser extension.

  const model = 'ae9is/parakeet-tdt-0.6b-v3-onnx'
  let transcriber = await pipeline('automatic-speech-recognition', model)
  const url = 'https://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/french-audio.wav'
  let audioData = await loadAudio(url)
  // Transcribe French
  let output = await transcriber(audioData, { language: 'french', task: 'transcribe' })
  const expected = " J'adore, j'aime, je n'aime pas, je déteste."
  console.log('output', output?.text)
  console.log('expected', expected)
  await transcriber.dispose()
Downloads last month
39
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ae9is/parakeet-tdt-0.6b-v3-onnx

Quantized
(1)
this model