NightPrince commited on
Commit
d3432ac
Β·
verified Β·
1 Parent(s): 30f66dd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +108 -4
README.md CHANGED
@@ -2,11 +2,115 @@
2
  title: Arabic ASR
3
  emoji: πŸ’»
4
  colorFrom: green
5
- colorTo: yellow
6
  sdk: streamlit
7
- sdk_version: 1.48.0
8
  app_file: app.py
9
- pinned: false
10
  ---
 
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  title: Arabic ASR
3
  emoji: πŸ’»
4
  colorFrom: green
5
+ colorTo: red
6
  sdk: streamlit
7
+ sdk_version: 1.48.0
8
  app_file: app.py
9
+ pinned: true
10
  ---
11
+ # πŸŽ™οΈ Arabic Transcriber Pro
12
 
13
+ > **Convert Arabic speech to text with precision β€” powered by NVIDIA NeMo and Streamlit.**
14
+ > ✨ Live Demo: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
15
+ > πŸ”— Portfolio: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)
16
+
17
+ <div align="center">
18
+ <img src="https://img.shields.io/badge/Python-3.8%2B-blue?style=for-the-badge&logo=python" />
19
+ <img src="https://img.shields.io/badge/Streamlit-1.28.0+-orange?style=for-the-badge&logo=streamlit" />
20
+ <img src="https://img.shields.io/badge/NVIDIA%20NeMo-ASR%20Model-blueviolet?style=for-the-badge&logo=nvidia" />
21
+ <img src="https://img.shields.io/badge/Hugging%20Face-Spaces-FF4B4B?style=for-the-badge&logo=huggingface" />
22
+ <img src="https://img.shields.io/badge/License-MIT-green?style=for-the-badge" />
23
+ </div>
24
+
25
+ <br />
26
+
27
+ ![App Screenshot](https://via.placeholder.com/1200x800/0b132b/5bc0be?text=Arabic+Transcriber+Pro)
28
+ *Screenshot: Gloomy-elegant UI with real-time transcription and audio visualization*
29
+
30
+ ---
31
+
32
+ ## 🌟 Overview
33
+
34
+ **Arabic Transcriber Pro** is a sleek, AI-powered web application that converts spoken **Arabic audio** into accurate, readable text using **NVIDIA’s state-of-the-art NeMo ASR model**. Designed with a modern, **gloomy-elegant aesthetic**, this tool delivers fast, reliable transcription for podcasts, interviews, lectures, and more β€” all within a user-friendly Streamlit interface hosted on **Hugging Face Spaces**.
35
+
36
+ Built by **Yahya Alnwsany** β€” AI Engineer, NLP Specialist, and Hugging Face Ambassador β€” this project reflects a deep commitment to advancing Arabic NLP and making AI accessible for real-world applications.
37
+
38
+ πŸ”— **Live Demo**: [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
39
+ πŸ‘€ **Developer Portfolio**: [https://nightprincey.github.io/Portfolio/](https://nightprincey.github.io/Portfolio/)
40
+
41
+ ---
42
+
43
+ ## πŸ”§ Features
44
+
45
+ - βœ… **High-Accuracy Arabic ASR** using `nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0`
46
+ - 🎧 **Multi-Format Support**: WAV, MP3, OGG, FLAC, M4A
47
+ - πŸ”„ **Auto Audio Conversion**: Resamples to 16kHz mono WAV for optimal model input
48
+ - ⚑ **Fast Processing** with real-time progress feedback
49
+ - πŸ’Ύ **Downloadable Transcripts** in `.txt` format
50
+ - 🌐 **Web-Based UI** with Streamlit β€” no installation needed
51
+ - 🎨 **Elegant Dark Theme** with RTL-ready Arabic text rendering
52
+ - πŸ“Š **Audio Metadata Display**: Duration, sample rate, channels
53
+ - πŸš€ **Cached Model Loading** for improved performance
54
+
55
+ ---
56
+
57
+ ## πŸ–ΌοΈ UI Design Highlights
58
+
59
+ - **Color Palette**: Deep navy (`#0b132b`, `#1c2541`) with teal (`#5bc0be`) and coral (`#e55934`) accents
60
+ - **Typography**: Clean, modern sans-serif with RTL support
61
+ - **Interactive Elements**: Smooth progress bars, hover effects, and responsive layout
62
+ - **Responsive Cards & Gradient Headers** for professional feel
63
+
64
+ ---
65
+
66
+ ## πŸ› οΈ Tech Stack
67
+
68
+ | Component | Technology |
69
+ |------------------|----------|
70
+ | Frontend | [Streamlit](https://streamlit.io) |
71
+ | ASR Engine | [NVIDIA NeMo](https://github.com/NVIDIA/NeMo) |
72
+ | Audio Processing | `pydub`, `soundfile` |
73
+ | Styling | Custom CSS (Dark Theme, RTL Support) |
74
+ | Hosting | [Hugging Face Spaces](https://huggingface.co/spaces) |
75
+ | Deployment | Docker / Streamlit / Git |
76
+
77
+ ---
78
+
79
+ ## ▢️ Try It Live
80
+
81
+ Visit the live app on Hugging Face:
82
+
83
+ πŸ‘‰ [https://huggingface.co/spaces/NightPrince/Arabic-ASR](https://huggingface.co/spaces/NightPrince/Arabic-ASR)
84
+
85
+ No setup required β€” just upload an Arabic audio file and get instant transcription.
86
+
87
+ ---
88
+
89
+ ## πŸ“¦ Project Structure
90
+ ```python
91
+ Arabic-transcriber-pro/
92
+ β”‚
93
+ β”œβ”€β”€ app.py # Main Streamlit application
94
+ β”œβ”€β”€ requirements.txt # Python dependencies
95
+ β”œβ”€β”€ README.md # This file
96
+ ```
97
+
98
+ ---
99
+
100
+ ## πŸ“‚ Supported Audio Formats
101
+
102
+ | Format | Extension | Notes |
103
+ |-------|----------|-------|
104
+ | WAV | `.wav` | Native support |
105
+ | MP3 | `.mp3` | Requires `ffmpeg` |
106
+ | OGG | `.ogg` | Vorbis/Opus |
107
+ | FLAC | `.flac` | Lossless |
108
+ | M4A | `.m4a` | AAC audio |
109
+
110
+ > πŸ” All files are automatically converted to **16kHz mono WAV** before transcription.
111
+
112
+ ---
113
+
114
+ ## About the Developer
115
+
116
+ ### πŸ‘€ [Yahya Alnwsany](https://nightprincey.github.io/Portfolio/)