Spaces:

N4DerAX20
/

Arabic-Transcriber-app

Sleeping

App Files Files Community

N4DerAX20 commited on Jun 15, 2025

Commit

4951081

verified ·

1 Parent(s): 4533d63

Upload 3 files

Browse files

Files changed (3) hide show

README.md +37 -11
requirements.txt +3 -0
streamlit_app.py +53 -0

README.md CHANGED Viewed

@@ -1,14 +1,40 @@
 ---
-title: Arabic Transcriber App
-emoji: 🚀
-colorFrom: red
-colorTo: red
-sdk: gradio
-sdk_version: 5.34.0
-app_file: app.py
-pinned: false
-license: mit
-short_description: Arabic Transcription to SRT
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# 🎙️ Arabic VO to Subtitle Generator (.srt / .fcpxmld)
+This app takes an Arabic voiceover audio file (MP3 or WAV) and automatically transcribes it using OpenAI Whisper, producing subtitles in SRT format or FCPXMLD format for Final Cut Pro X.
+---
+## 🚀 Features
+- 🧠 Transcription using Whisper (via Faster-Whisper for speed)
+- 📝 Outputs `.srt` subtitle files for editing and broadcast
+- 🎬 Also exports `.fcpxmld` for direct use in Final Cut Pro X
+- 🔠 Custom options for vertical or horizontal layout
+- 🌍 Supports Arabic (RTL) and other languages
+---
+## 📂 How to Use
+1. Upload your Arabic MP3/WAV voiceover
+2. Choose:
+   - Layout: Vertical (mobile) or Horizontal (TV)
+   - Lines per subtitle: 1 or 2
+   - Export format: `.srt` or `.fcpxmld`
+3. Click **Transcribe**
+4. Preview subtitles in the browser
+5. Download the final file
 ---
+## 🖥️ Powered By
+- [Faster Whisper](https://github.com/guillaumekln/faster-whisper)
+- [Streamlit](https://streamlit.io)
+- [Hugging Face Spaces](https://huggingface.co/spaces)
 ---
+## 📜 License
+MIT — use it freely, credit appreciated!

requirements.txt ADDED Viewed

	@@ -0,0 +1,3 @@

+streamlit
+faster-whisper
+torch

streamlit_app.py ADDED Viewed

	@@ -0,0 +1,53 @@

+import streamlit as st
+import tempfile
+from faster_whisper import WhisperModel
+import textwrap
+from datetime import timedelta
+st.title("🎙️ Arabic VO to Subtitle Generator (.srt)")
+uploaded_file = st.file_uploader("Upload Arabic MP3 or WAV", type=["mp3", "wav"])
+model_size = st.selectbox("Model Size", ["tiny", "base", "small", "medium"], index=3)
+layout = st.selectbox("Video Layout", ["Horizontal (37 chars)", "Vertical (25 chars)"])
+lines = st.selectbox("Lines per Subtitle", [1, 2], index=1)
+def format_time(seconds):
+    td = timedelta(seconds=seconds)
+    result = str(td)[:11].replace(".", ",")
+    return result if "," in result else result + ",000"
+if uploaded_file:
+    with st.spinner("Transcribing with Whisper..."):
+        with tempfile.NamedTemporaryFile(delete=False) as temp_audio:
+            temp_audio.write(uploaded_file.read())
+            temp_audio.flush()
+            whisper = WhisperModel(model_size, device="cpu", compute_type="int8")
+            segments, _ = whisper.transcribe(temp_audio.name, language="ar")
+        max_chars = 25 if "Vertical" in layout else 37
+        max_lines = int(lines)
+        srt_text = ""
+        count = 1
+        for seg in segments:
+            start = seg.start
+            end = seg.end
+            text = seg.text.strip()
+            lines = textwrap.wrap(text, width=max_chars)
+            grouped = [lines[i:i+max_lines] for i in range(0, len(lines), max_lines)]
+            chunk_count = len(grouped)
+            duration = end - start
+            chunk_duration = duration / chunk_count if chunk_count > 0 else duration
+            for j, chunk in enumerate(grouped):
+                chunk_start = start + j * chunk_duration
+                chunk_end = chunk_start + chunk_duration
+                timestamp = f"{format_time(chunk_start)} --> {format_time(chunk_end)}"
+                content = "\n".join(chunk)
+                srt_text += f"{count}\n{timestamp}\n{content}\n\n"
+                count += 1
+        st.text_area("Preview SRT", value=srt_text, height=300)
+        st.download_button("⬇️ Download .srt", srt_text, file_name="output.srt", mime="text/plain")