---
title: Whisper Large V3 Turbo
emoji: 🎤
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.46.0
pinned: false
license: mit
short_description: Whisper LV3Turbo for ASR -API
---

# Whisper Large V3 Turbo - Speech Recognition

This Space provides a complete speech recognition solution using OpenAI's Whisper Large V3 Turbo model. Features a beautiful, modern web interface with direct transcription and download capabilities.

## Features

- 🚀 **Fast Inference**: Uses Whisper Large V3 Turbo (reduced from 32 to 4 decoding layers)
- 🎯 **High Accuracy**: State-of-the-art speech recognition
- 🌍 **Multilingual**: Supports 99 languages
- 🎨 **Modern UI**: Beautiful, responsive Gradio interface
- 📥 **Download Support**: Save transcriptions as text files
- 🔄 **API Endpoints**: RESTful API for external integration
- 🔒 **CORS Enabled**: Works with any frontend

## Usage

### Web Interface
Simply visit the Space and use the intuitive web interface:
1. **Upload** an audio/video file or **record** directly
2. Click **Transcribe Audio** to process
3. View results in the scrollable area
4. **Download** the transcription as a text file

### API Endpoints

#### Transcribe Audio
```bash
POST /transcribe
Content-Type: multipart/form-data

# Send audio file as form data
curl -X POST "https://your-space.hf.space/transcribe" \
  -F "file=@audio.mp3"
```

#### Health Check
```bash
GET /health

curl "https://your-space.hf.space/health"
```

## API Endpoints

### POST /transcribe
Transcribe an audio/video file.

**Request:**
- Method: POST
- Content-Type: multipart/form-data
- Body: File upload

**Response:**
```json
{
  "text": "Transcribed text here...",
  "success": true
}
```

### GET /health
Check API health status.

**Response:**
```json
{
  "status": "healthy",
  "model_loaded": true
}
```

## Usage Examples

### cURL Example
```bash
curl -X POST "https://your-space.hf.space/transcribe" \
  -F "file=@audio.mp3"
```

### Python Example
```python
import requests

# Transcribe audio file
with open('audio.mp3', 'rb') as f:
    files = {'file': ('audio.mp3', f, 'audio/mpeg')}
    response = requests.post('https://your-space.hf.space/transcribe', files=files)
    result = response.json()
    
    if result['success']:
        print(result['text'])
    else:
        print(f"Error: {result['error']}")
```

### JavaScript Example
```javascript
const formData = new FormData();
formData.append('file', audioFile);

fetch('https://your-space.hf.space/transcribe', {
    method: 'POST',
    body: formData
})
.then(response => response.json())
.then(result => {
    if (result.success) {
        console.log(result.text);
    } else {
        console.error(result.error);
    }
});
```

## Supported File Formats

### Audio Formats
- MP3, WAV, FLAC, M4A, OGG

### Video Formats  
- MP4, AVI, MOV, MKV

*Note: For video files, only the audio track will be processed.*

## Model Details

- **Model**: Whisper Large V3 Turbo (809M parameters)
- **Speed**: ~4x faster than standard Large V3
- **GPU**: Uses ZeroGPU for efficient inference
- **Languages**: Supports 99 languages
- **Accuracy**: Maintains high accuracy despite speed optimizations

## Example Response

```json
{
  "text": "Hello, this is a transcription of the audio file.",
  "success": true
}
```

## Troubleshooting

```json
{
  "error": "Transcription failed: [error message]",
  "success": false
}
```