Update README.md
Browse files
README.md
CHANGED
|
@@ -2,11 +2,115 @@
|
|
| 2 |
title: Arabic ASR
|
| 3 |
emoji: π»
|
| 4 |
colorFrom: green
|
| 5 |
-
colorTo:
|
| 6 |
sdk: streamlit
|
| 7 |
-
sdk_version:
|
| 8 |
app_file: app.py
|
| 9 |
-
pinned:
|
| 10 |
---
|
|
|
|
| 11 |
|
| 12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
title: Arabic ASR
|
| 3 |
emoji: π»
|
| 4 |
colorFrom: green
|
| 5 |
+
colorTo: red
|
| 6 |
sdk: streamlit
|
| 7 |
+
sdk_version: 1.48.0
|
| 8 |
app_file: app.py
|
| 9 |
+
pinned: true
|
| 10 |
---
|
| 11 |
+
# ποΈ Arabic Transcriber Pro
|
| 12 |
|
| 13 |
+
> **Convert Arabic speech to text with precision β powered by NVIDIA NeMo and Streamlit.**
|
| 14 |
+
> β¨ Live Demo: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
|
| 15 |
+
> π Portfolio: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)
|
| 16 |
+
|
| 17 |
+
<div align="center">
|
| 18 |
+
<img src="https://img.shields.io/badge/Python-3.8%2B-blue?style=for-the-badge&logo=python" />
|
| 19 |
+
<img src="https://img.shields.io/badge/Streamlit-1.28.0+-orange?style=for-the-badge&logo=streamlit" />
|
| 20 |
+
<img src="https://img.shields.io/badge/NVIDIA%20NeMo-ASR%20Model-blueviolet?style=for-the-badge&logo=nvidia" />
|
| 21 |
+
<img src="https://img.shields.io/badge/Hugging%20Face-Spaces-FF4B4B?style=for-the-badge&logo=huggingface" />
|
| 22 |
+
<img src="https://img.shields.io/badge/License-MIT-green?style=for-the-badge" />
|
| 23 |
+
</div>
|
| 24 |
+
|
| 25 |
+
<br />
|
| 26 |
+
|
| 27 |
+

|
| 28 |
+
*Screenshot: Gloomy-elegant UI with real-time transcription and audio visualization*
|
| 29 |
+
|
| 30 |
+
---
|
| 31 |
+
|
| 32 |
+
## π Overview
|
| 33 |
+
|
| 34 |
+
**Arabic Transcriber Pro** is a sleek, AI-powered web application that converts spoken **Arabic audio** into accurate, readable text using **NVIDIAβs state-of-the-art NeMo ASR model**. Designed with a modern, **gloomy-elegant aesthetic**, this tool delivers fast, reliable transcription for podcasts, interviews, lectures, and more β all within a user-friendly Streamlit interface hosted on **Hugging Face Spaces**.
|
| 35 |
+
|
| 36 |
+
Built by **Yahya Alnwsany** β AI Engineer, NLP Specialist, and Hugging Face Ambassador β this project reflects a deep commitment to advancing Arabic NLP and making AI accessible for real-world applications.
|
| 37 |
+
|
| 38 |
+
π **Live Demo**: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
|
| 39 |
+
π€ **Developer Portfolio**: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)
|
| 40 |
+
|
| 41 |
+
---
|
| 42 |
+
|
| 43 |
+
## π§ Features
|
| 44 |
+
|
| 45 |
+
- β
**High-Accuracy Arabic ASR** using `nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0`
|
| 46 |
+
- π§ **Multi-Format Support**: WAV, MP3, OGG, FLAC, M4A
|
| 47 |
+
- π **Auto Audio Conversion**: Resamples to 16kHz mono WAV for optimal model input
|
| 48 |
+
- β‘ **Fast Processing** with real-time progress feedback
|
| 49 |
+
- πΎ **Downloadable Transcripts** in `.txt` format
|
| 50 |
+
- π **Web-Based UI** with Streamlit β no installation needed
|
| 51 |
+
- π¨ **Elegant Dark Theme** with RTL-ready Arabic text rendering
|
| 52 |
+
- π **Audio Metadata Display**: Duration, sample rate, channels
|
| 53 |
+
- π **Cached Model Loading** for improved performance
|
| 54 |
+
|
| 55 |
+
---
|
| 56 |
+
|
| 57 |
+
## πΌοΈ UI Design Highlights
|
| 58 |
+
|
| 59 |
+
- **Color Palette**: Deep navy (`#0b132b`, `#1c2541`) with teal (`#5bc0be`) and coral (`#e55934`) accents
|
| 60 |
+
- **Typography**: Clean, modern sans-serif with RTL support
|
| 61 |
+
- **Interactive Elements**: Smooth progress bars, hover effects, and responsive layout
|
| 62 |
+
- **Responsive Cards & Gradient Headers** for professional feel
|
| 63 |
+
|
| 64 |
+
---
|
| 65 |
+
|
| 66 |
+
## π οΈ Tech Stack
|
| 67 |
+
|
| 68 |
+
| Component | Technology |
|
| 69 |
+
|------------------|----------|
|
| 70 |
+
| Frontend | [Streamlit](https://streamlit.io) |
|
| 71 |
+
| ASR Engine | [NVIDIA NeMo](https://github.com/NVIDIA/NeMo) |
|
| 72 |
+
| Audio Processing | `pydub`, `soundfile` |
|
| 73 |
+
| Styling | Custom CSS (Dark Theme, RTL Support) |
|
| 74 |
+
| Hosting | [Hugging Face Spaces](https://huggingface.co/spaces) |
|
| 75 |
+
| Deployment | Docker / Streamlit / Git |
|
| 76 |
+
|
| 77 |
+
---
|
| 78 |
+
|
| 79 |
+
## βΆοΈ Try It Live
|
| 80 |
+
|
| 81 |
+
Visit the live app on Hugging Face:
|
| 82 |
+
|
| 83 |
+
π [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
|
| 84 |
+
|
| 85 |
+
No setup required β just upload an Arabic audio file and get instant transcription.
|
| 86 |
+
|
| 87 |
+
---
|
| 88 |
+
|
| 89 |
+
## π¦ Project Structure
|
| 90 |
+
```python
|
| 91 |
+
Arabic-transcriber-pro/
|
| 92 |
+
β
|
| 93 |
+
βββ app.py # Main Streamlit application
|
| 94 |
+
βββ requirements.txt # Python dependencies
|
| 95 |
+
βββ README.md # This file
|
| 96 |
+
```
|
| 97 |
+
|
| 98 |
+
---
|
| 99 |
+
|
| 100 |
+
## π Supported Audio Formats
|
| 101 |
+
|
| 102 |
+
| Format | Extension | Notes |
|
| 103 |
+
|-------|----------|-------|
|
| 104 |
+
| WAV | `.wav` | Native support |
|
| 105 |
+
| MP3 | `.mp3` | Requires `ffmpeg` |
|
| 106 |
+
| OGG | `.ogg` | Vorbis/Opus |
|
| 107 |
+
| FLAC | `.flac` | Lossless |
|
| 108 |
+
| M4A | `.m4a` | AAC audio |
|
| 109 |
+
|
| 110 |
+
> π All files are automatically converted to **16kHz mono WAV** before transcription.
|
| 111 |
+
|
| 112 |
+
---
|
| 113 |
+
|
| 114 |
+
## About the Developer
|
| 115 |
+
|
| 116 |
+
### π€ [Yahya Alnwsany](https://nightprincey.github.io/Portfolio/)
|