File size: 4,545 Bytes
9af49af
4150268
 
9af49af
d3432ac
cf6949f
d3432ac
9af49af
d3432ac
9af49af
d3432ac
9af49af
d3432ac
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
faae234
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
---
title:  Arabic Transcriber Pro
emoji: πŸ—£οΈ
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.48.0
app_file: app.py
pinned: true
---
# πŸŽ™οΈ Arabic Transcriber Pro

> **Convert Arabic speech to text with precision β€” powered by NVIDIA NeMo and Streamlit.**  
> ✨ Live Demo: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)  
> πŸ”— Portfolio: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)

<div align="center">
  <img src="https://img.shields.io/badge/Python-3.8%2B-blue?style=for-the-badge&logo=python" />
  <img src="https://img.shields.io/badge/Streamlit-1.28.0+-orange?style=for-the-badge&logo=streamlit" />
  <img src="https://img.shields.io/badge/NVIDIA%20NeMo-ASR%20Model-blueviolet?style=for-the-badge&logo=nvidia" />
  <img src="https://img.shields.io/badge/Hugging%20Face-Spaces-FF4B4B?style=for-the-badge&logo=huggingface" />
  <img src="https://img.shields.io/badge/License-MIT-green?style=for-the-badge" />
</div>

<br />

![App Screenshot](https://via.placeholder.com/1200x800/0b132b/5bc0be?text=Arabic+Transcriber+Pro)  
*Screenshot: Gloomy-elegant UI with real-time transcription and audio visualization*

---

## 🌟 Overview

**Arabic Transcriber Pro** is a sleek, AI-powered web application that converts spoken **Arabic audio** into accurate, readable text using **NVIDIA’s state-of-the-art NeMo ASR model**. Designed with a modern, **gloomy-elegant aesthetic**, this tool delivers fast, reliable transcription for podcasts, interviews, lectures, and more β€” all within a user-friendly Streamlit interface hosted on **Hugging Face Spaces**.

Built by **Yahya Alnwsany** β€” AI Engineer, NLP Specialist, and Hugging Face Ambassador β€” this project reflects a deep commitment to advancing Arabic NLP and making AI accessible for real-world applications.

πŸ”— **Live Demo**: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)  
πŸ‘€ **Developer Portfolio**: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)

---

## πŸ”§ Features

- βœ… **High-Accuracy Arabic ASR** using `nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0`
- 🎧 **Multi-Format Support**: WAV, MP3, OGG, FLAC, M4A
- πŸ”„ **Auto Audio Conversion**: Resamples to 16kHz mono WAV for optimal model input
- ⚑ **Fast Processing** with real-time progress feedback
- πŸ’Ύ **Downloadable Transcripts** in `.txt` format
- 🌐 **Web-Based UI** with Streamlit β€” no installation needed
- 🎨 **Elegant Dark Theme** with RTL-ready Arabic text rendering
- πŸ“Š **Audio Metadata Display**: Duration, sample rate, channels
- πŸš€ **Cached Model Loading** for improved performance

---

## πŸ–ΌοΈ UI Design Highlights

- **Color Palette**: Deep navy (`#0b132b`, `#1c2541`) with teal (`#5bc0be`) and coral (`#e55934`) accents
- **Typography**: Clean, modern sans-serif with RTL support
- **Interactive Elements**: Smooth progress bars, hover effects, and responsive layout
- **Responsive Cards & Gradient Headers** for professional feel

---

## πŸ› οΈ Tech Stack

| Component        | Technology |
|------------------|----------|
| Frontend         | [Streamlit](https://streamlit.io) |
| ASR Engine       | [NVIDIA NeMo](https://github.com/NVIDIA/NeMo) |
| Audio Processing | `pydub`, `soundfile` |
| Styling          | Custom CSS (Dark Theme, RTL Support) |
| Hosting          | [Hugging Face Spaces](https://huggingface.co/spaces) |
| Deployment       | Docker / Streamlit / Git |

---

## ▢️ Try It Live

Visit the live app on Hugging Face:

πŸ‘‰ [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)

No setup required β€” just upload an Arabic audio file and get instant transcription.

---

## πŸ“¦ Project Structure
```python
Arabic-transcriber-pro/
β”‚
β”œβ”€β”€ app.py # Main Streamlit application
β”œβ”€β”€ requirements.txt # Python dependencies
β”œβ”€β”€ README.md # This file
```

---

## πŸ“‚ Supported Audio Formats

| Format | Extension | Notes |
|-------|----------|-------|
| WAV   | `.wav`   | Native support |
| MP3   | `.mp3`   | Requires `ffmpeg` |
| OGG   | `.ogg`   | Vorbis/Opus |
| FLAC  | `.flac`  | Lossless |
| M4A   | `.m4a`   | AAC audio |

> πŸ” All files are automatically converted to **16kHz mono WAV** before transcription.

---

## About the Developer

### πŸ‘€ [Yahya Alnwsany](https://nightprincey.github.io/Portfolio/)"# Arabic-Transcriber-Pro"