MiakOnline commited on
Commit
78200a0
Β·
verified Β·
1 Parent(s): 09acc01

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -44
README.md CHANGED
@@ -1,68 +1,55 @@
1
- ---
2
- title: RecToTextPro
3
- emoji: πŸš€
4
- colorFrom: red
5
- colorTo: red
6
- sdk: streamlit
7
- app_port: 8501
8
- tags:
9
- - streamlit
10
- pinned: false
11
- short_description: Recording to Text
12
- license: mit
13
- sdk_version: 1.55.0
14
- ---
15
  # 🎀 RecToText Pro – Intelligent Lecture Transcriber
16
 
17
- RecToText Pro is an AI-powered web application that converts mixed Urdu and English lecture recordings into structured text. It supports Roman Urdu and English output formats and allows Excel export.
18
-
19
- ---
20
 
21
  ## πŸš€ Features
22
 
23
- - Upload .mp3, .wav, .m4a files
24
- - Automatic Urdu + English speech detection
25
- - Whisper-based transcription
26
  - Roman Urdu or English output
27
- - Text cleaning & formatting
28
- - Excel export (.xlsx)
29
- - Word count & processing time
30
- - Professional Streamlit UI
31
- - Hugging Face Spaces compatible
32
-
33
- ---
34
 
35
  ## πŸ›  Tech Stack
36
 
37
  - Python
38
  - Streamlit
39
- - OpenAI Whisper
40
  - openpyxl
 
41
  - pydub
42
 
43
- ---
44
 
45
- ## πŸ“¦ Hugging Face Deployment
46
-
47
- 1. Create a new Space
48
- 2. Choose Streamlit SDK
49
- 3. Upload:
50
  - app.py
51
  - requirements.txt
52
  - README.md
53
- 4. Commit changes
54
- 5. Wait for build to complete
55
-
56
- ---
57
 
58
  ## πŸ’» Run Locally
59
 
60
- ```bash
61
- pip install -r requirements.txt
62
  streamlit run app.py
63
- # Welcome to Streamlit!
64
 
65
- Edit `/src/streamlit_app.py` to customize this app to your heart's desire. :heart:
 
 
 
 
 
 
 
66
 
67
- If you have any questions, checkout our [documentation](https://docs.streamlit.io) and [community
68
- forums](https://discuss.streamlit.io).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # 🎀 RecToText Pro – Intelligent Lecture Transcriber
2
 
3
+ RecToText Pro is an AI-powered web application that converts mixed Urdu and English lecture recordings into structured, clean text output.
 
 
4
 
5
  ## πŸš€ Features
6
 
7
+ - Upload MP3, WAV, M4A, AAC files (Up to 200MB)
8
+ - Automatic Urdu + English detection
9
+ - Long audio (30–60 min) supported
10
  - Roman Urdu or English output
11
+ - Clean paragraph formatting
12
+ - Excel export with timestamps
13
+ - Word export with clean story formatting
14
+ - Language detection
15
+ - Word count
16
+ - Processing time display
17
+ - Hugging Face CPU compatible
18
 
19
  ## πŸ›  Tech Stack
20
 
21
  - Python
22
  - Streamlit
23
+ - faster-whisper (Whisper Open-Source Model)
24
  - openpyxl
25
+ - python-docx
26
  - pydub
27
 
28
+ ## πŸ“¦ Deployment on Hugging Face
29
 
30
+ 1. Create new Streamlit Space
31
+ 2. Upload:
 
 
 
32
  - app.py
33
  - requirements.txt
34
  - README.md
35
+ 3. Add a file named `packages.txt` containing:
36
+ ffmpeg
37
+ 4. Commit and wait for build
 
38
 
39
  ## πŸ’» Run Locally
40
 
41
+ pip install -r requirements.txt
 
42
  streamlit run app.py
 
43
 
44
+ ## ⚑ Model Selection
45
+
46
+ - base β†’ Faster, moderate accuracy
47
+ - small β†’ Better accuracy, slightly slower
48
+
49
+ For CPU deployment, base is recommended.
50
+
51
+ ## πŸ“Œ Notes
52
 
53
+ - Supports long lecture recordings.
54
+ - AAC format supported.
55
+ - Optimized for Hugging Face CPU Spaces.