Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

pltobing
/
streaming-speech-translation

Text-to-Speech
ONNX
GGUF
speech-translation
streaming-speech-translation
speech
audio
speech-recognition
automatic-speech-recognition
streaming-asr
ASR
NeMo
ONNX
cache-aware ASR
FastConformer
RNNT
Parakeet
neural-machine-translation
NMT
gemma3
llama-cpp
GGUF
conversational
TTS
xtts
xttsv2
voice-clone
gpt2
hifigan
multilingual
vq
perceiver-encoder
websocket
Model card Files Files and versions
xet
Community
streaming-speech-translation
8.88 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 31 commits
pltobing's picture
pltobing
python_client: Avoid async in playback callback, use thread async for prebuffer, reduce blocksize and latency, keep minimal threshold and gain.
c5831b3 1 day ago
  • audio_ref
    Update README command & sample rate 4 days ago
  • clients
    python_client: Avoid async in playback callback, use thread async for prebuffer, reduce blocksize and latency, keep minimal threshold and gain. 1 day ago
  • examples
    Add the logs examples 2 days ago
  • models
    Add TTS model files 4 days ago
  • src
    Revert TTS pause flush back to 600 ms as it is more natural, target NMT below this. 1 day ago
  • .gitattributes
    36 kB
    Add the logs examples 2 days ago
  • .gitignore
    24 Bytes
    Add the logs examples 2 days ago
  • ARCHITECTURE.md
    10.7 kB
    Update README command & sample rate 4 days ago
  • Dockerfile
    353 Bytes
    Add project files 6 days ago
  • LICENSE
    1.87 kB
    Add project files 6 days ago
  • README.md
    7 kB
    Update README 3 days ago
  • app.py
    6.02 kB
    Formatting black, isort, flake8 5 days ago
  • requirements.txt
    253 Bytes
    Add project files 6 days ago
  • requirements_client.txt
    45 Bytes
    Add audio output file save on client too for checking 2 days ago