---
title: Gemini Task Automation
emoji: π€
colorFrom: blue
colorTo: purple
sdk: docker
app_port: 8080
pinned: false
---
# π€ Gemini Task Automation System
**An AI-powered task automation service that receives task descriptions, generates complete web applications using Gemini AI, and automatically deploys them to GitHub Pages.**
## π― What Does This Project Do?
This is an **automated code generation and deployment pipeline** that:
1. **Receives Task Requests** via REST API (POST /ready endpoint)
2. **Generates Code** using Google's Gemini AI based on natural language descriptions
3. **Creates GitHub Repositories** automatically for each task
4. **Deploys to GitHub Pages** making the generated apps instantly accessible
5. **Notifies Completion** by sending deployment details to a callback URL
### π Complete Workflow
```
User sends task request β API validates β Gemini generates code β
Creates GitHub repo β Commits & pushes β Enables GitHub Pages β
Sends notification with live URL
```
## β¨ Key Features
- **Fully Generic** - No hardcoded templates, pure AI-driven generation
- **Background Processing** - Returns HTTP 200 immediately, processes asynchronously
- **Round-based Updates** - Round 1 creates new repos, Round 2+ updates existing ones
- **Attachment Support** - Can include images (logos, mockups, sample data) for AI context
- **Robust Error Handling** - Detailed logging with specific error types
- **JSON Schema Enforcement** - Ensures structured, parseable AI responses
- **Exponential Backoff** - Retries for GitHub API operations
- **Docker Ready** - Production-ready containerization
## π How It Works (Technical Deep Dive)
### 1οΈβ£ Request Reception
```json
POST /ready
{
"email": "user@example.com",
"secret": "auth-token",
"task": "chess-game",
"round": 1,
"brief": "Create a chess game with...",
"checks": ["Has license", "Works in browser"],
"evaluation_url": "https://callback.example.com",
"attachments": []
}
```
### 2οΈβ£ AI Code Generation
- Sends task brief + checks + attachments to **Gemini 2.5 Flash**
- Uses **JSON schema** to enforce structured output
- AI generates all files (HTML, CSS, JS, README, LICENSE)
- Returns: `{"files": [{"path": "index.html", "content": "..."}]}`
### 3οΈβ£ GitHub Repository Setup
- **Round 1:** Creates new repository via GitHub API
- **Round 2+:** Clones existing repo, updates files
- Configures git with user credentials
- Commits with descriptive messages
### 4οΈβ£ Deployment
- Pushes to GitHub with retry logic (5 attempts, exponential backoff)
- Enables GitHub Pages via API
- Waits for Pages to become active
### 5οΈβ£ Notification
- POSTs deployment results to `evaluation_url`:
```json
{
"email": "user@example.com",
"task": "chess-game",
"repo_url": "https://github.com/user/chess-game",
"pages_url": "https://user.github.io/chess-game",
"commit_sha": "abc123..."
}
```
## π Deployment Options
### Option 1: Docker (Recommended)
```bash
docker build -t gemini-automation .
docker run -p 8080:8080 \
-e GEMINI_API_KEY=your_key \
-e GITHUB_TOKEN=your_token \
-e GITHUB_USERNAME=your_username \
-e STUDENT_SECRET=your_secret \
gemini-automation
```
### Option 2: Cloud Platform
Deploy to any platform supporting Docker:
- **Hugging Face Spaces** (includes GPU option)
- **Google Cloud Run** (serverless, auto-scaling)
- **AWS ECS/Fargate** (enterprise-grade)
- **Azure Container Instances** (pay-per-use)
- **DigitalOcean App Platform** (simple, affordable)
### Option 3: Local Development
```bash
# 1. Clone repository
git clone https://github.com/YOUR_USERNAME/GEMINI_TDS_PROJECT1.git
cd GEMINI_TDS_PROJECT1
# 2. Create virtual environment
python -m venv .venv
source .venv/bin/activate # Linux/Mac
# OR
.venv\Scripts\Activate.ps1 # Windows
# 3. Install dependencies
pip install -r requirements.txt
# 4. Configure environment
cp .env.example .env
# Edit .env with your API keys
# 5. Run server
uvicorn main:app --reload --port 8080
```
Access at: `http://localhost:8080`
## π Required API Keys
### 1. Google Gemini API Key
- Go to: https://aistudio.google.com/app/apikey
- Click "Create API Key"
- Copy the key (starts with `AIza...`)
- **Free tier:** 15 requests/minute, 1500 requests/day
### 2. GitHub Personal Access Token
- Go to: GitHub Settings β Developer settings β Personal access tokens β Tokens (classic)
- Click "Generate new token (classic)"
- Select scopes: `repo` (full control of private repositories)
- Generate and copy token (starts with `ghp_...`)
- **Never commit this token!**
### 3. Student Secret (Custom Auth)
- Create your own secret string (e.g., `my-secret-key-12345`)
- Used to authenticate incoming requests
- Can be any string you choose
## βοΈ Environment Variables
Create a `.env` file in the project root:
```env
GEMINI_API_KEY=AIzaSy...your_key_here
GITHUB_TOKEN=ghp_...your_token_here
GITHUB_USERNAME=your_github_username
STUDENT_SECRET=your_custom_secret_string
```
| Variable | Required | Description |
|----------|----------|-------------|
| `GEMINI_API_KEY` | β
Yes | Google Generative AI API key for code generation |
| `GITHUB_TOKEN` | β
Yes | GitHub PAT with `repo` scope for repo operations |
| `GITHUB_USERNAME` | β
Yes | Your GitHub username for repository creation |
| `STUDENT_SECRET` | β
Yes | Shared secret for authenticating incoming requests |
## π Project Architecture
```
βββββββββββββββ ββββββββββββββββ βββββββββββββββ
β Client βββββββΆβ FastAPI βββββββΆβ Gemini AI β
β (Postman) ββββββββ /ready ββββββββ (Code Gen) β
βββββββββββββββ ββββββββββββββββ βββββββββββββββ
β
βΌ
ββββββββββββββββ
β GitPython β
β (Local Ops) β
ββββββββββββββββ
β
βΌ
ββββββββββββββββ βββββββββββββββ
β GitHub API βββββββΆβGitHub Pages β
β (Create Repo)β β (Deploy) β
ββββββββββββββββ βββββββββββββββ
β
βΌ
ββββββββββββββββ
β Callback URL β
β (Notify Done)β
ββββββββββββββββ
```
## π οΈ Technology Stack
| Component | Technology | Purpose |
|-----------|-----------|---------|
| **API Framework** | FastAPI | High-performance REST API |
| **AI Model** | Gemini 2.5 Flash | Code generation from natural language |
| **Validation** | Pydantic | Request/config validation |
| **Git Operations** | GitPython | Local repo management |
| **GitHub Integration** | GitHub REST API | Repo creation, Pages deployment |
| **Async Tasks** | asyncio | Background task processing |
| **HTTP Client** | httpx | Async HTTP requests |
| **Container** | Docker | Production deployment |
## π Project Structure
```
GEMINI_TDS_PROJECT1/
βββ main.py # FastAPI app + orchestration logic
βββ config.py # Environment config with validation
βββ models.py # Pydantic request/response models
βββ requirements.txt # Python dependencies
βββ Dockerfile # Production container definition
βββ .dockerignore # Docker build exclusions
βββ .gitignore # Git exclusions
βββ .env.example # Template for environment variables
βββ LICENSE # MIT license
βββ README.md # This file
```
## π API Documentation
### POST /ready
**Description:** Submit a task for AI-powered code generation and deployment
**Request Body:**
```json
{
"email": "user@example.com",
"secret": "your_student_secret",
"task": "unique-task-id",
"round": 1,
"nonce": "unique-request-id",
"brief": "Detailed description of what to build...",
"checks": ["Requirement 1", "Requirement 2"],
"evaluation_url": "https://webhook.site/your-id",
"attachments": [
{
"name": "logo.png",
"url": "..."
}
]
}
```
**Response:**
```json
{
"message": "Task received successfully!",
"task_id": "unique-task-id"
}
```
**Status Codes:**
- `200 OK` - Task accepted, processing in background
- `403 Forbidden` - Invalid secret
- `422 Unprocessable Entity` - Invalid request format
### Callback Notification
When deployment completes, the API POSTs to your `evaluation_url`:
```json
{
"email": "user@example.com",
"task": "unique-task-id",
"round": 1,
"nonce": "unique-request-id",
"repo_url": "https://github.com/username/unique-task-id",
"commit_sha": "abc123def456...",
"pages_url": "https://username.github.io/unique-task-id"
}
```
## π§ͺ Testing
### Test with Postman / cURL
**1. Get a webhook URL:**
- Go to https://webhook.site
- Copy your unique URL
**2. Send test request:**
```bash
curl -X POST http://localhost:8080/ready \
-H "Content-Type: application/json" \
-d '{
"email": "test@example.com",
"secret": "your_student_secret",
"task": "hello-world-test",
"round": 1,
"nonce": "test-001",
"brief": "Create a simple hello world webpage with a gradient background and centered text saying Hello World!",
"checks": ["Has index.html", "Has MIT license", "Text displays"],
"evaluation_url": "YOUR_WEBHOOK_URL_HERE",
"attachments": []
}'
```
**3. Check results:**
- API returns immediately: `{"message": "Task received successfully!"}`
- Watch webhook.site for completion notification (~30-60 seconds)
- Visit the `pages_url` in notification to see live site
### Example Tasks
Calculator App
```json
{
"email": "test@example.com",
"secret": "your_secret",
"task": "calculator-app",
"round": 1,
"nonce": "calc-001",
"brief": "Create a calculator with: 1) Basic operations (+, -, Γ, Γ·), 2) Clear button, 3) Decimal support, 4) Keyboard input, 5) Responsive design with Tailwind CSS",
"checks": [
"Has MIT license",
"README explains usage",
"Calculator performs addition",
"Calculator performs subtraction",
"Has clear button",
"Responsive design"
],
"evaluation_url": "https://webhook.site/your-id",
"attachments": []
}
```
Todo List
```json
{
"email": "test@example.com",
"secret": "your_secret",
"task": "todo-list-app",
"round": 1,
"nonce": "todo-001",
"brief": "Create a todo list with: 1) Add new tasks, 2) Mark tasks as complete, 3) Delete tasks, 4) LocalStorage persistence, 5) Filter by All/Active/Completed, 6) Task counter, 7) Beautiful UI with animations",
"checks": [
"Can add tasks",
"Can mark complete",
"Can delete tasks",
"Tasks persist on refresh",
"Has filter buttons",
"Shows task count"
],
"evaluation_url": "https://webhook.site/your-id",
"attachments": []
}
```
Chess Game (With Attachments)
```json
{
"email": "test@example.com",
"secret": "your_secret",
"task": "chess-game-pro",
"round": 1,
"nonce": "chess-001",
"brief": "Create a chess game with: 1) Full chess rules, 2) Drag-and-drop pieces, 3) Move validation, 4) Check/Checkmate detection, 5) Timed modes (Blitz 5min, Rapid 10min), 6) Move history, 7) Captured pieces display",
"checks": [
"All pieces move correctly",
"Check detection works",
"Checkmate ends game",
"Timer counts down",
"Move history displays"
],
"evaluation_url": "https://webhook.site/your-id",
"attachments": []
}
```
## π Troubleshooting
### Common Issues
**Problem:** `403 Forbidden` response
- **Solution:** Check that `secret` in request matches `STUDENT_SECRET` env var
**Problem:** Task accepted but no notification received
- **Solution:** Check Hugging Face Space logs or local console for errors. Common causes:
- Invalid GitHub token or insufficient permissions
- Gemini API quota exceeded
- Invalid evaluation_url
**Problem:** GitHub API errors (403, 404)
- **Solution:** Verify GitHub token has `repo` scope:
```bash
curl -H "Authorization: token YOUR_TOKEN" https://api.github.com/user
```
**Problem:** Gemini AI returns invalid JSON
- **Solution:** Check logs for response. The system now has improved error handling with specific error messages.
**Problem:** Pages deployment times out
- **Solution:** GitHub Pages can take 1-2 minutes to activate. The system retries 5 times with exponential backoff.
### Debug Mode
Enable detailed logging:
```python
# In main.py, add at top:
import logging
logging.basicConfig(level=logging.DEBUG)
```
Or set environment variable:
```bash
export LOG_LEVEL=DEBUG # Linux/Mac
$env:LOG_LEVEL="DEBUG" # Windows PowerShell
```
### Viewing Logs
**Docker:**
```bash
docker logs -f CONTAINER_ID
```
**Hugging Face Space:**
Go to Space β "Logs" tab
## π Security Best Practices
1. **Never commit `.env` file** - Already in `.gitignore`
2. **Rotate API keys regularly** - Every 90 days recommended
3. **Use environment-specific secrets** - Different keys for dev/prod
4. **Limit GitHub token scope** - Only `repo` or `public_repo` needed
5. **Validate incoming requests** - `secret` field prevents unauthorized access
6. **Monitor API usage** - Check Gemini and GitHub API quotas
## π Performance & Limits
| Metric | Value | Notes |
|--------|-------|-------|
| Average task duration | 30-60s | Depends on complexity |
| Gemini API rate limit | 15/min | Free tier |
| GitHub API rate limit | 5000/hour | Authenticated |
| Max attachment size | ~10MB | Base64 encoding adds 33% |
| Concurrent tasks | Unlimited | Background processing |
## π€ Contributing
Contributions welcome! Areas for improvement:
- [ ] Add support for GitLab/Bitbucket deployment
- [ ] Implement task queue with Redis
- [ ] Add progress tracking API
- [ ] Support multiple AI models (Claude, GPT-4)
- [ ] Add unit tests
- [ ] Implement rate limiting
- [ ] Add metrics/monitoring
## π License
MIT License - see [LICENSE](LICENSE) file for details
## π Acknowledgments
- **Google Gemini AI** - Code generation capabilities
- **FastAPI** - Modern Python web framework
- **GitHub** - Repository hosting and Pages deployment
- **Hugging Face** - Spaces platform for easy deployment
---
**Built for TDS Project 1** - Automated task generation and deployment system