Spaces:

sematech
/

sema-api

Sleeping

App Files Files Community

sema-api / docs /deploy_to_hf.md

kamau1

update: Fastapi codebase structure with api endpoints

a7d24e3 6 months ago

preview code

raw

history blame contribute delete

4.02 kB

	# Deployment Instructions for HuggingFace Spaces

	## Files Ready for Deployment

	Your HuggingFace Space needs these files (all created and ready):

	1. `sema_translation_api.py` - Main API application
	2. `requirements.txt` - Python dependencies
	3. `Dockerfile` - Container configuration
	4. `README.md` - Space documentation and metadata

	## Deployment Steps

	### Option 1: Using Git (Recommended)

	1. Navigate to your existing HF Space repository:
	```bash
	cd backend/sema-api
	```

	2. The files are ready to deploy as-is:
	```bash
	# All files are ready:
	# - sema_translation_api.py (main application)
	# - requirements.txt
	# - Dockerfile
	# - README.md
	```

	3. Commit and push to HuggingFace:
	```bash
	git add .
	git commit -m "Update to use consolidated sema-utils models with new API"
	git push origin main
	```

	### Option 2: Using HuggingFace Web Interface

	1. Go to your Space: `https://huggingface.co/spaces/sematech/sema-api`
	2. Click on "Files" tab
	3. Upload/replace these files:
	- Upload `sema_translation_api.py`
	- Replace `requirements.txt`
	- Replace `Dockerfile`
	- Replace `README.md`

	## What Happens After Deployment

	1. Automatic Build: HF Spaces will automatically start building your Docker container
	2. Model Download: During build, the app will download models from `sematech/sema-utils`:
	- `spm.model` (SentencePiece tokenizer)
	- `lid218e.bin` (Language detection)
	- `translation_models/sematrans-3.3B/` (Translation model)
	3. API Startup: Once built, your API will be available at the Space URL

	## Testing Your Deployed API

	### 1. Health Check
	```bash
	curl https://sematech-sema-api.hf.space/
	```

	### 2. Translation with Auto-Detection
	```bash
	curl -X POST "https://sematech-sema-api.hf.space/translate" \
	-H "Content-Type: application/json" \
	-d '{
	"text": "Habari ya asubuhi",
	"target_language": "eng_Latn"
	}'
	```

	### 3. Translation with Source Language
	```bash
	curl -X POST "https://sematech-sema-api.hf.space/translate" \
	-H "Content-Type: application/json" \
	-d '{
	"text": "Wĩ mwega?",
	"source_language": "kik_Latn",
	"target_language": "eng_Latn"
	}'
	```

	### 4. Interactive Documentation
	Visit: `https://sematech-sema-api.hf.space/docs`

	## Expected Build Time

	- First build: 10-15 minutes (downloading models ~5GB)
	- Subsequent builds: 2-5 minutes (models cached)

	## Monitoring the Build

	1. Go to your Space page
	2. Click on "Logs" tab to see build progress
	3. Look for these key messages:
	- "📥 Downloading models from sematech/sema-utils..."
	- "✅ All models loaded successfully!"
	- "🎉 API started successfully!"

	## Troubleshooting

	### If Build Fails:
	1. Check the logs for specific error messages
	2. Common issues:
	- Model download timeout (retry build)
	- Memory issues (models are large)
	- Network connectivity issues

	### If API Doesn't Respond:
	1. Check if the Space is "Running" (green status)
	2. Try the health check endpoint first
	3. Check logs for runtime errors

	## Key Improvements in This Version

	1. Consolidated Models: Uses your unified `sema-utils` repository
	2. Better Error Handling: Clear error messages and validation
	3. Performance Monitoring: Tracks inference time
	4. Clean API Design: Follows FastAPI best practices
	5. Automatic Documentation: Built-in OpenAPI docs
	6. Flexible Input: Auto-detection or manual source language

	## Next Steps After Deployment

	1. Test the API with various language pairs
	2. Monitor performance and response times
	3. Update documentation with your actual Space URL
	4. Consider adding rate limiting for production use
	5. Add authentication if needed for private use

	## Important Note About File Structure

	The Dockerfile correctly references `sema_translation_api:app` (not `app:app`) since our main file is `sema_translation_api.py`. No need to rename files - deploy as-is!

	---

	Your new API is ready to deploy! 🚀