Spaces:

realruneet
/

Campus-AI

Sleeping

App Files Files Community

Campus-AI / docs /SETUP.md

realruneett

Final Release: CampusGen AI Pipeline & Compositor

a8aea21 about 2 months ago

preview code

raw

history blame contribute delete

5.41 kB

A newer version of the Gradio SDK is available: 6.12.0

Upgrade

CampusGen AI – Setup Guide

Prerequisites

OS: Windows 10/11 or Ubuntu 22.04+
Python: 3.11+
GPU: NVIDIA GPU with 12GB+ VRAM (RTX 5070 Ti used for development)
CUDA: 12.1+ with matching drivers
Disk: 100GB+ free space
Chrome: Latest version (for Pinterest scraping)

1. Environment Setup

# Create conda environment
conda create -n campus-ai python=3.11 -y
conda activate campus-ai

# Install dependencies
pip install -r requirements.txt

# Verify GPU
python -c "import torch; print(f'CUDA: {torch.cuda.is_available()}, GPU: {torch.cuda.get_device_name(0) if torch.cuda.is_available() else \"N/A\"}')"

2. Configuration

Edit configs/config.yaml:

project:
  creator: "YOUR_NAME"      # ← Change this

deployment:
  hf_username: "YOUR_HF_USERNAME"   # ← Change this

API Keys

Service	Where to Get	Config Key
Kaggle	kaggle.com/settings	`api_keys.kaggle`
Unsplash	unsplash.com/developers	`api_keys.unsplash`
Pexels	pexels.com/api	`api_keys.pexels`
Groq	console.groq.com	Environment: `GROQ_API_KEY`
HuggingFace	huggingface.co/settings/tokens	CLI: `huggingface-cli login`

3. Data Pipeline

Step 1: Scrape Images 🖥️ CPU (~6-12 hours)

python scripts/pinterest_scraper.py
# Or scrape a single category:
python scripts/pinterest_scraper.py
# Or scrape a single category:
python scripts/pinterest_scraper.py --category tech_fest
# Or targeted top-up for specific counts:
python scripts/pinterest_scraper.py --category workshops/coding --target 2800

Output: data/raw/{category}/{subcategory}/ with ~1900 images per theme

Step 2: Quality Filter 🎮 GPU (~5 min)

python scripts/quality_filter.py

Uses GPU-accelerated sharpness detection (Laplacian via PyTorch CUDA) and color analysis. Auto-detects GPU, falls back to CPU.

Output: data/processed/{category}/ with ~1300+ high-quality images per theme

Step 3: Caption Generation 🎮 GPU (~6-12 hours)

python scripts/caption_generator.py

Florence-2 runs in float16 on GPU. Includes campus_ai_poster trigger word and category-aware prefixes.

Output: data/final/{category}/ with image + .txt caption pairs + metadata.json

Step 4: Dataset Split 🖥️ CPU (~1 min)

python scripts/split_dataset.py

Fixed counts: 1000 train / 200 val / 100 test per theme.

Output: data/train/, data/val/, data/test/

4. Training 🎮 GPU (~7.5 hours total)

Install ai-toolkit

git clone https://github.com/ostris/ai-toolkit.git
cd ai-toolkit
pip install -e .
cd ..

Phase 1: Layout Pass (~3 hours)

Generates the initial configuration and trains block-in composition.

python scripts/create_training_config.py
# Outputs: configs/train_sdxl_lora.yaml

cd ai-toolkit
set HF_TOKEN=your_token_here
python run.py ../configs/train_sdxl_lora.yaml
cd ..

Phase 2: Perfection Pass (~4.5 hours)

Uses the static configs/train_sdxl_lora_phase2.yaml (0.1 dropout, 2e-5 LR) to refine micro-details across the entire dataset (train/val/test).

cd ai-toolkit
set HF_TOKEN=your_token_here
python run.py ../configs/train_sdxl_lora_phase2.yaml
cd ..

Monitor

# In a separate terminal
nvidia-smi -l 30

# TensorBoard
tensorboard --logdir logs/tensorboard

Test Checkpoints

python scripts/test_checkpoint.py

5. Deployment 🖥️ CPU → ☁️ Cloud

Upload LoRA to Hugging Face

huggingface-cli login
huggingface-cli upload YOUR_USERNAME/campus-ai-poster-sdxl models/sdxl/checkpoints/campus_ai_poster_sdxl/ .

Create & Deploy HF Space

cd deployment
git init
huggingface-cli repo create campus-ai-poster-generator --type space --space-sdk gradio
git remote add space https://huggingface.co/spaces/YOUR_USERNAME/campus-ai-poster-generator
git add app.py pipelines.py prompt_engine.py requirements.txt README.md
git commit -m "Deploy CampusGen AI"
git push space main

Configure Secrets

In Space Settings → Variables and Secrets:

Secret Name	Value
`HF_USERNAME`	your HF username
`GROQ_API_KEY`	your Groq API key

GPU Usage Summary

Step	Device	Time
Scraping	🖥️ CPU	~6-12h (network-bound)
Quality Filter	🎮 GPU	~5 min
Captioning	🎮 GPU	~6-12h
Split	🖥️ CPU	~1 min
Training (Phase 1)	🎮 GPU	~3h
Training (Phase 2)	🎮 GPU	~4.5h
Upload	🖥️ CPU	~5 min
Live Demo	☁️ Cloud GPU	HF ZeroGPU

Troubleshooting

Issue	Solution
CUDA OOM during training	Set `batch_size: 1` and `gradient_accumulation_steps: 4` in config
Pinterest blocking	Increase sleep time, use VPN, or try alt sources
Blurry outputs	Increase `num_inference_steps` to 40
Slow cold start on HF	Send Space link 24h before demo to warm it up
Groq rate limit	Create multiple accounts, rotate API keys
GPU not detected	Verify CUDA install: `python -c "import torch; print(torch.cuda.is_available())"`