Spaces:

Pulastya0
/

Data-Science-Agent

Running

App Files Files Community

Data-Science-Agent / RENDER_DEPLOYMENT.md

Pulastya B

feat: Add Render deployment configuration

c10a976 9 days ago

preview code

raw

history blame contribute delete

6.79 kB

Render Deployment Guide

Prerequisites

A Render account (free tier available)
Your GitHub repository connected to Render
Google Gemini API key

Quick Deploy (Recommended)

Option 1: Using render.yaml (Infrastructure as Code)

Push your code to GitHub (already done)
Create a new Web Service on Render:
- Go to https://dashboard.render.com/
- Click "New +" → "Blueprint"
- Connect your GitHub repository: Pulastya-B/DevSprint-Data-Science-Agent
- Render will automatically detect the render.yaml file
- Click "Apply"
Add Secret Environment Variable:
- Go to your service dashboard
- Navigate to "Environment" tab
- Add your GOOGLE_API_KEY (this is sensitive and not included in render.yaml)
- Click "Save Changes"
Deploy:
- Render will automatically build and deploy your application
- Wait for the build to complete (~5-10 minutes for first deploy)
- Your app will be available at: https://data-science-agent.onrender.com

Option 2: Manual Setup

Create a new Web Service:
- Go to https://dashboard.render.com/
- Click "New +" → "Web Service"
- Connect your GitHub repository
Configure the service:
- Name: data-science-agent
- Region: Oregon (US West)
- Branch: main
- Runtime: Docker
- Plan: Free (or Starter for production)

Add Environment Variables:

LLM_PROVIDER=gemini
GOOGLE_API_KEY=<your-api-key-here>
GEMINI_MODEL=gemini-2.5-flash
REASONING_EFFORT=medium
CACHE_DB_PATH=/tmp/cache_db/cache.db
CACHE_TTL_SECONDS=86400
OUTPUT_DIR=/tmp/outputs
DATA_DIR=/tmp/data
MAX_PARALLEL_TOOLS=5
MAX_RETRIES=3
TIMEOUT_SECONDS=300
PORT=8080
ARTIFACT_BACKEND=local

Configure Health Check:
- Health Check Path: /api/health
Deploy:
- Click "Create Web Service"
- Wait for the build to complete

Important Notes

Free Tier Limitations

Spin down after inactivity: Free tier services spin down after 15 minutes of inactivity
Cold starts: First request after spin-down will take 30-60 seconds
Memory: 512 MB RAM (may be tight for large ML models)
Build time: Free tier has slower build times

Upgrading to Paid Plan

For production use, consider upgrading to at least the Starter plan ($7/month):

No spin-down
Faster builds
More memory (512 MB → 2 GB)
Better performance

Storage Considerations

Render uses ephemeral storage - files are lost on restart
For persistent storage, consider:
- Connecting to external storage (S3, GCS)
- Using Render's persistent disk (paid plans only)
- Storing only temporary analysis results

Performance Optimization

Use caching: The app includes SQLite caching for repeated queries
Monitor memory usage: Large datasets may exceed free tier limits
Optimize docker image: The multi-stage build already optimizes image size
Regional selection: Choose a region close to your users

Deployment Commands

Manual Rebuild (if needed)

# Trigger rebuild via Render Dashboard
# or use Render API
curl -X POST https://api.render.com/v1/services/<service-id>/deploys \
  -H "Authorization: Bearer <your-api-key>"

Check Logs

# View logs in Render Dashboard
# or use Render CLI
render logs -s data-science-agent

Custom Domain (Optional)

Go to your service dashboard
Click "Settings" → "Custom Domain"
Add your domain (e.g., agent.yourdomain.com)
Update your DNS records as instructed
Render automatically provisions SSL certificates

Troubleshooting

Build Fails

Issue: Docker build timeout

Solution: Increase build timeout in Render settings
Alternative: Optimize Dockerfile to reduce build time

Issue: Out of memory during build

Solution: Upgrade to paid plan with more memory
Alternative: Reduce dependencies in requirements.txt

App Crashes on Startup

Issue: Missing environment variables

Solution: Verify all required env vars are set in Render dashboard

Issue: Port binding error

Solution: Ensure app listens on 0.0.0.0 and PORT env variable

Slow Performance

Issue: Cold starts on free tier

Solution: Upgrade to paid plan to prevent spin-down
Workaround: Use a cron job to ping your app every 10 minutes

Issue: Large dataset processing timeout

Solution: Increase TIMEOUT_SECONDS env variable
Consider: Processing large datasets asynchronously

Monitoring

Health Check

Your app exposes a health check endpoint at /api/health:

curl https://data-science-agent.onrender.com/api/health

Logs

View real-time logs in Render Dashboard
Configure log drains for external monitoring (paid plans)

Metrics

Render provides built-in metrics:

CPU usage
Memory usage
Request count
Response time

Security Best Practices

Never commit API keys to Git (use environment variables)
Enable CORS only for trusted domains in production
Use HTTPS (Render provides this automatically)
Rotate API keys regularly
Monitor usage to detect anomalies

Cost Estimation

Free Tier

Cost: $0/month
Best for: Development, testing, hackathons
Limitations: Spin-down, slower builds, 512MB RAM

Starter Plan ($7/month)

No spin-down
512MB RAM → 2GB RAM
Faster builds
Better for: Small production apps

Standard Plan ($25/month)

4GB RAM
High performance
Best for: Production apps with moderate traffic

Deployment Checklist

Code pushed to GitHub
render.yaml committed to repository
Render account created
GitHub repository connected to Render
Blueprint deployed (or manual service created)
GOOGLE_API_KEY added as secret environment variable
Health check endpoint verified
Application accessible at Render URL
Custom domain configured (optional)
Monitoring and alerts set up

Support

Render Documentation: https://render.com/docs
Render Community: https://community.render.com/
GitHub Issues: https://github.com/Pulastya-B/DevSprint-Data-Science-Agent/issues

Next Steps

After successful deployment:

Test the deployment:

curl https://data-science-agent.onrender.com/api/health

Upload a test dataset via the web interface
Monitor logs for any errors
Configure custom domain (optional)
Set up monitoring and alerts
Share your deployed app! 🚀

Your app will be live at: https://data-science-agent.onrender.com

(URL will be different if you choose a different service name)