Spaces:
Sleeping
Sleeping
File size: 6,514 Bytes
6609c06 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 | ---
title: SyncMaster Enhanced
emoji: ๐
colorFrom: red
colorTo: red
sdk: docker
app_port: 8501
tags:
- streamlit
- ai-translation
- speech-to-text
- multilingual
- education
pinned: false
short_description: AI-powered audio transcription
license: mit
---
# SyncMaster Enhanced - AI-Powered Audio Transcription & Translation
> **๐ New: Enhanced with AI Translation Support for International Students**
> **ุฌุฏูุฏ: ู
ุญุณู ู
ุน ุฏุนู
ุงูุชุฑุฌู
ุฉ ุจุงูุฐูุงุก ุงูุงุตุทูุงุนู ููุทูุงุจ ุงูุฏููููู**
SyncMaster is an intelligent audio-text synchronization platform specifically designed for international students in universities. It provides real-time audio recording, AI-powered transcription, and automatic translation to help students better understand and review their lectures.
## โจ Key Features
### ๐ Multi-Language Support
- **Full Arabic Interface**: Complete Arabic UI for better accessibility
- **AI-Powered Translation**: Automatic translation to Arabic, English, French, and Spanish
- **Language Detection**: Automatically detects the source language
- **Academic Context**: Specialized translation for academic content
### ๐๏ธ Enhanced Recording
- **Browser-based Recording**: Record directly from your web browser
- **Real-time Audio Visualization**: Visual feedback during recording
- **Important Markers**: Mark important points during lectures
- **Pause/Resume**: Full control over recording sessions
### ๐ค AI Technology
- **Gemini AI Integration**: Accurate transcription using Google's Gemini AI
- **Advanced Translation**: Context-aware translation for educational content
- **Parallel Processing**: Fast and efficient audio processing
### ๐ฑ Student-Friendly Features
- **Responsive Design**: Works on desktop, tablet, and mobile
- **Keyboard Shortcuts**: Quick access to common functions
- **Accessibility**: Screen reader support and RTL language support
- **Offline Capability**: Process recordings without constant internet
## ๐ Quick Start
### For International Students:
1. **Setup**:
```bash
# Clone or download the project
# Install Python 3.8+
python setup_enhanced.py
```
2. **Run**:
```bash
# Windows
start_enhanced.bat
# Linux/Mac
python setup_enhanced.py
```
3. **Configure**:
- Add your Gemini API key to `.env` file
- Choose your preferred language (Arabic/English)
- Enable translation and select target language
### API Key Setup:
1. Get a free Gemini API key from [Google AI Studio](https://makersuite.google.com/app/apikey)
2. Add it to your `.env` file:
```
GEMINI_API_KEY=your_api_key_here
```
## ๐ Usage Guide
### Recording Lectures:
1. Go to the **Record Audio** tab
2. Click **Start Recording**
3. Use **Mark Important** for key points
4. Click **Stop** when finished
5. Click **Extract Text** to process
### Translation:
1. Enable translation in settings
2. Select target language
3. Process your audio
4. Review both original and translated text
### Export Options:
- Copy text for notes
- Save as files for later review
- Generate synchronized videos (coming soon)
## ๐ For Students
### Arabic Students (ููุทูุงุจ ุงูุนุฑุจ):
- ุงุณุชุฎุฏู
ุงููุงุฌูุฉ ุงูุนุฑุจูุฉ ูุณูููุฉ ุงูุงุณุชุฎุฏุงู
- ูุนูู ุงูุชุฑุฌู
ุฉ ููุฅูุฌููุฒูุฉ ูููู
ุงูู
ุตุทูุญุงุช ุงูุชูููุฉ
- ุถุน ุนูุงู
ุงุช ุนูู ุงูู
ูุงููู
ุงูุฌุฏูุฏุฉ ุฃุซูุงุก ุงูู
ุญุงุถุฑุฉ
### International Students:
- Use translation to your native language for better understanding
- Mark important concepts during lectures
- Review both original and translated text together
## โจ๏ธ Keyboard Shortcuts
- **Space**: Start/Stop recording
- **M**: Mark important point
- **P**: Pause/Resume
- **R**: Re-record
## ๐ง Technical Requirements
### System Requirements:
- Python 3.8 or higher
- Modern web browser (Chrome, Firefox, Safari, Edge)
- Microphone access for recording
- Internet connection for AI processing
### Dependencies:
- Streamlit (Web interface)
- Google Generative AI (Transcription & Translation)
- Flask (Recording server)
- LibROSA (Audio processing)
## ๐ฑ Browser Compatibility
| Browser | Recording | Translation | UI |
|---------|-----------|-------------|----|
| Chrome | โ
| โ
| โ
|
| Firefox | โ
| โ
| โ
|
| Safari | โ
| โ
| โ
|
| Edge | โ
| โ
| โ
|
## ๐ ๏ธ Troubleshooting
### Common Issues:
**Microphone not working:**
- Grant microphone permission to your browser
- Check system audio settings
- Try a different browser
**Translation errors:**
- Check internet connection
- Verify Gemini API key
- Try processing again
**Poor transcription quality:**
- Ensure clear audio recording
- Reduce background noise
- Speak clearly and at moderate pace
## ๐ฎ Roadmap
### Coming Soon:
- **Smart Content Analysis**: Automatic extraction of key concepts
- **Study Cards**: Generate flashcards from lectures
- **Platform Integration**: Connect with Moodle, Canvas, etc.
- **Collaborative Features**: Share recordings with classmates
- **Advanced Analytics**: Learning progress tracking
## ๐ Documentation
- [**Arabic Guide**](README_AR.md) - ุฏููู ุจุงููุบุฉ ุงูุนุฑุจูุฉ
- [**API Documentation**](docs/api.md) - Technical API reference
- [**Troubleshooting**](docs/troubleshooting.md) - Detailed problem solving
## ๐ค Contributing
We welcome contributions from the international student community:
1. Fork the repository
2. Create a feature branch
3. Add your improvements
4. Submit a pull request
### Areas for Contribution:
- Additional language support
- UI improvements
- Mobile optimization
- Documentation translation
## ๐ License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
## ๐ Acknowledgments
- Google Gemini AI for transcription and translation
- Streamlit team for the amazing web framework
- International student community for feedback and testing
## ๐ Support
For technical support or questions:
- Check the browser console (F12) for error details
- Review log files in the application directory
- Ensure all dependencies are up to date
---
**Made with โค๏ธ for international students worldwide**
**ุตููุน ุจู โค๏ธ ููุทูุงุจ ุงูุฏููููู ุญูู ุงูุนุงูู
**
---
### Quick Links:
- ๐ [Quick Start Guide](docs/quickstart.md)
- ๐ [Arabic Documentation](README_AR.md)
- ๐ [Student Guide](docs/student-guide.md)
- ๐ง [Technical Setup](docs/technical-setup.md)
|