Spaces:

andito
/

parakeet-v3-streaming

Running

parakeet-v3-streaming / source /QUICKSTART.md

andito HF Staff

Restructure for HuggingFace Space compatibility

0f739b8 13 days ago

2.85 kB

Quick Start Guide

Install dependencies (already done):
```
cd parakeet-web-demo
npm install
```
Start development server:
```
npm run dev
```
Open browser:
- Navigate to: http://localhost:3000
- Use a WebGPU-compatible browser (Chrome 113+ or Edge 113+)
Use the demo:
- Click "Load Model" (downloads ~2GB ONNX model, one-time only)
- Wait for model to load (30s-2min depending on connection)
- Click "Start Recording" and grant microphone permissions
- Speak and watch real-time progressive transcriptions!
- Click "Stop Recording" when done

Latency: Time to process audio chunk
RTF (Real-time Factor): Processing speed vs audio duration
- <1.0 = faster than real-time ✓
- 1.0 = slower than real-time ⚠️
Window State:
- "growing" (0-15s): Accumulating audio for accuracy
- "sliding" (>15s): Smart sentence-aware windowing

npm run build
npm run preview

The build output will be in dist/ folder.

src/App.jsx - Main application component
src/worker.js - Web Worker for model inference
src/utils/progressive-streaming.js - Smart streaming algorithm (ported from Python)
src/utils/audio.js - Microphone capture and audio processing
src/components/TranscriptionDisplay.jsx - Live transcription UI
src/components/PerformanceMetrics.jsx - Developer metrics dashboard

Enjoy the demo! 🎤