Spaces:

calebhan
/

rescored

Sleeping

App Files Files Community

calebhan commited on Dec 25, 2025

Commit

44a2550

1 Parent(s): c27ae8d

mvp scope

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.gitignore +31 -1
README.md +264 -1
backend/.env.example +16 -0
backend/Dockerfile +25 -0
backend/Dockerfile.worker +33 -0
backend/celery_app.py +41 -0
backend/config.py +50 -0
backend/main.py +407 -0
backend/pipeline.py +881 -0
backend/pytest.ini +45 -0
backend/requirements-test.txt +14 -0
backend/requirements.txt +35 -0
backend/scripts/README.md +184 -0
backend/scripts/analyze_transcription.py +175 -0
backend/scripts/diagnose_pipeline.py +307 -0
backend/scripts/test_accuracy.py +277 -0
backend/scripts/test_demucs_models.py +199 -0
backend/scripts/test_e2e.py +106 -0
backend/scripts/test_quick_verify.py +142 -0
backend/tasks.py +205 -0
backend/tests/__init__.py +1 -0
backend/tests/conftest.py +169 -0
backend/tests/test_api.py +369 -0
backend/tests/test_pipeline.py +102 -0
backend/tests/test_tasks.py +243 -0
backend/tests/test_utils.py +147 -0
backend/utils.py +79 -0
docker-compose.yml +79 -0
docs/testing/backend-testing.md +520 -0
docs/testing/baseline-accuracy.md +178 -0
docs/testing/failure-modes.md +216 -0
docs/testing/frontend-testing.md +653 -0
docs/testing/overview.md +315 -0
docs/testing/test-videos.md +371 -0
frontend/.env.example +1 -0
frontend/.gitignore +24 -0
frontend/Dockerfile +19 -0
frontend/README.md +73 -0
frontend/eslint.config.js +23 -0
frontend/index.html +13 -0
frontend/package-lock.json +0 -0
frontend/package.json +44 -0
frontend/public/vite.svg +1 -0
frontend/scripts/debug-parser.cjs +58 -0
frontend/scripts/test-chord-handling.cjs +42 -0
frontend/src/App.css +30 -0
frontend/src/App.tsx +36 -0
frontend/src/api/client.ts +143 -0
frontend/src/assets/react.svg +1 -0
frontend/src/components/JobSubmission.css +83 -0

.gitignore CHANGED Viewed

@@ -156,6 +156,7 @@ ENV/
 env.bak/
 venv.bak/
 CLAUDE.md
 # Spyder project settings
 .spyderproject
@@ -213,4 +214,33 @@ marimo/_lsp/
 __marimo__/
 # Streamlit
-.streamlit/secrets.toml

 env.bak/
 venv.bak/
 CLAUDE.md
+.claude
 # Spyder project settings
 .spyderproject
 __marimo__/
 # Streamlit
+.streamlit/secrets.toml
+# Rescored specific
+# Backend
+backend/.env
+backend/storage/
+backend/*.musicxml
+backend/*.mid
+backend/*.wav
+# Frontend
+frontend/node_modules/
+frontend/dist/
+frontend/.env.local
+frontend/.env.production
+# Storage (contains sensitive cookies)
+storage/*.txt
+storage/*.json
+storage/youtube_cookies*
+!storage/README.txt
+storage/outputs/*
+storage/temp/*
+# Temp files
+/tmp/
+*.tmp
+# Docker volumes
+docker-compose.override.yml

README.md CHANGED Viewed

	@@ -1 +1,264 @@
1	- # ~~rescored~~

+# Rescored - AI Music Transcription
+Convert YouTube videos into editable sheet music using AI.
+## Overview
+Rescored transcribes YouTube videos to professional-quality music notation:
+1. **Submit** a YouTube URL
+2. **AI Processing** extracts audio, separates instruments, and transcribes to MIDI
+3. **Edit** the notation in an interactive editor
+4. **Export** as MusicXML or MIDI
+**Tech Stack**:
+- **Backend**: Python/FastAPI + Celery + Redis
+- **Frontend**: React + VexFlow (notation) + Tone.js (playback)
+- **ML**: Demucs (source separation) + basic-pitch (transcription)
+## Quick Start
+### Prerequisites
+- **Docker Desktop** (recommended) OR:
+  - Python 3.11+
+  - Node.js 18+
+  - Redis 7+
+  - FFmpeg
+  - (Optional) NVIDIA GPU with CUDA for faster processing
+### Option 1: Docker Compose (Recommended)
+```bash
+# Clone repository
+git clone https://github.com/yourusername/rescored.git
+cd rescored
+```
+#### ⚠️ REQUIRED: YouTube Cookies Setup
+YouTube requires authentication for video downloads (as of December 2024). You **MUST** export your YouTube cookies before the application will work.
+**Quick Setup (5 minutes):**
+1. **Install Browser Extension**
+   - Install [Get cookies.txt LOCALLY](https://chrome.google.com/webstore/detail/cclelndahbckbenkjhflpdbgdldlbecc) for Chrome/Edge/Brave
+2. **Export Cookies**
+   - Open a **NEW private/incognito window** (this is important!)
+   - **Sign in to YouTube** with your Google account
+   - **Visit any YouTube video page**
+   - **Click the extension icon** in your browser toolbar
+   - **Click "Export"** or "Download"
+   - **Save the file** to your computer
+3. **Place Cookie File**
+   ```bash
+   # Create storage directory
+   mkdir -p storage
+   # Move the exported file (adjust path if needed)
+   mv ~/Downloads/youtube.com_cookies.txt ./storage/youtube_cookies.txt
+   # OR on Windows:
+   # move %USERPROFILE%\Downloads\youtube.com_cookies.txt storage\youtube_cookies.txt
+   ```
+4. **Start Services**
+   ```bash
+   docker-compose up
+   # Services will be available at:
+   # - Frontend: http://localhost:5173
+   # - Backend API: http://localhost:8000
+   # - API Docs: http://localhost:8000/docs
+   ```
+**Verification:**
+```bash
+docker-compose exec worker ls -lh /app/storage/youtube_cookies.txt
+```
+You should see the file listed.
+**Troubleshooting:**
+- **"Please sign in" error**: Make sure you exported from a private/incognito window. Export fresh cookies (don't reuse old ones). Ensure the file is named exactly `youtube_cookies.txt` and isn't empty.
+- **File format errors**: The first line should be `# Netscape HTTP Cookie File`. If not, use the browser extension method.
+- **Cookies expire quickly**: Export from a NEW incognito window each time. You may need to re-export periodically.
+**Security Note:** ⚠️ Never commit `youtube_cookies.txt` to git (it's already in `.gitignore`). Your cookies contain authentication tokens for your Google account—keep them private!
+**Why Is This Required?** YouTube implemented bot detection in late 2024 that blocks unauthenticated downloads. Even though our tool is for legitimate transcription purposes, YouTube's systems can't distinguish it from scrapers. By providing your cookies, you're proving you're a real user who has agreed to YouTube's terms of service.
+### Option 2: Manual Setup
+**Backend**:
+```bash
+cd backend
+# Create virtual environment
+python3 -m venv venv
+source venv/bin/activate  # On Windows: venv\Scripts\activate
+# Install dependencies
+pip install -r requirements.txt
+# Copy environment file
+cp .env.example .env
+# Start Redis (in separate terminal)
+redis-server
+# Start Celery worker (in separate terminal)
+celery -A tasks worker --loglevel=info
+# Start API server
+python main.py
+```
+**Frontend**:
+```bash
+cd frontend
+# Install dependencies
+npm install
+# Start dev server
+npm run dev
+```
+## Usage
+1. Open [http://localhost:5173](http://localhost:5173)
+2. Paste a YouTube URL (piano music recommended for best results)
+3. Wait 1-2 minutes for transcription (with GPU) or 10-15 minutes (CPU)
+4. Edit the notation in the interactive editor
+5. Export as MusicXML or MIDI
+## MVP Features
+✅ YouTube URL input and validation
+✅ Piano-only transcription (MVP limitation)
+✅ Single staff notation (treble clef)
+✅ Basic editing: select, delete, add notes
+✅ Play/pause with tempo control
+✅ Export MusicXML
+### Coming in Phase 2
+- Multi-instrument transcription
+- Grand staff (treble + bass)
+- Advanced editing (copy/paste, undo/redo)
+- MIDI export
+- PDF export
+## Project Structure
+```
+rescored/
+├── backend/                # Python/FastAPI backend
+│   ├── main.py            # REST API + WebSocket server
+│   ├── tasks.py           # Celery background workers
+│   ├── pipeline.py        # Audio processing pipeline
+│   ├── config.py          # Configuration
+│   └── requirements.txt   # Python dependencies
+├── frontend/              # React frontend
+│   ├── src/
+│   │   ├── components/    # UI components
+│   │   ├── store/         # Zustand state management
+│   │   └── api/           # API client
+│   └── package.json       # Node dependencies
+├── docs/                  # Comprehensive documentation
+└── docker-compose.yml     # Docker setup
+```
+## Documentation
+Comprehensive documentation is available in the [`docs/`](docs/) directory:
+- [Getting Started](docs/getting-started.md)
+- [Architecture Overview](docs/architecture/overview.md)
+- [Backend Pipeline](docs/backend/pipeline.md)
+- [Frontend Rendering](docs/frontend/notation-rendering.md)
+- [MVP Scope](docs/features/mvp.md)
+- [Known Challenges](docs/research/challenges.md)
+## Performance
+**With GPU (RTX 3080)**:
+- Download: ~10 seconds
+- Source separation: ~45 seconds
+- Transcription: ~5 seconds
+- **Total: ~1-2 minutes**
+**With CPU**:
+- Download: ~10 seconds
+- Source separation: ~8-10 minutes
+- Transcription: ~30 seconds
+- **Total: ~10-15 minutes**
+## Accuracy Expectations
+Transcription is **70-80% accurate** for simple piano music, **60-70%** for complex pieces. The interactive editor is designed to make fixing errors easy.
+## Development
+### Running Tests
+```bash
+# Backend tests
+cd backend
+pytest
+# Frontend tests
+cd frontend
+npm test
+```
+### API Documentation
+Once the backend is running, visit:
+- Swagger UI: [http://localhost:8000/docs](http://localhost:8000/docs)
+- ReDoc: [http://localhost:8000/redoc](http://localhost:8000/redoc)
+## Troubleshooting
+**Worker not processing jobs?**
+- Check Redis is running: `redis-cli ping` (should return PONG)
+- Check worker logs: `docker-compose logs worker`
+**GPU not detected?**
+- Install NVIDIA Docker runtime
+- Uncomment GPU section in `docker-compose.yml`
+- Set `GPU_ENABLED=true` in `.env`
+**YouTube download fails?**
+- Video may be age-restricted or private
+- Check yt-dlp is up to date: `pip install -U yt-dlp`
+## Contributing
+See [CLAUDE.md](CLAUDE.md) for development guidelines.
+## License
+MIT License - see [LICENSE](LICENSE) for details.
+## Acknowledgments
+- **Demucs** (Meta AI Research) - Source separation
+- **basic-pitch** (Spotify) - Audio transcription
+- **VexFlow** - Music notation rendering
+- **Tone.js** - Web audio synthesis
+## Roadmap
+- **Phase 1 (MVP)**: ✅ Piano transcription with basic editing
+- **Phase 2**: Multi-instrument, advanced editing, PDF export
+- **Phase 3**: User accounts, cloud storage, collaboration
+- **Phase 4**: Mobile app, real-time collaboration
+---
+**Note**: This is an educational project. Users are responsible for copyright compliance when transcribing YouTube content.

backend/.env.example ADDED Viewed

	@@ -0,0 +1,16 @@

+# Redis Configuration
+REDIS_URL=redis://localhost:6379/0
+# Storage Configuration
+STORAGE_PATH=/tmp/rescored
+# API Configuration
+API_HOST=0.0.0.0
+API_PORT=8000
+# Worker Configuration
+GPU_ENABLED=true
+MAX_VIDEO_DURATION=900  # 15 minutes in seconds
+# CORS Origins (comma-separated)
+CORS_ORIGINS=http://localhost:5173,http://localhost:3000

backend/Dockerfile ADDED Viewed

	@@ -0,0 +1,25 @@

+FROM python:3.11-slim
+# Install system dependencies
+RUN apt-get update && apt-get install -y \
+    ffmpeg \
+    git \
+    && rm -rf /var/lib/apt/lists/*
+# Set working directory
+WORKDIR /app
+# Copy requirements
+COPY requirements.txt .
+# Install Python dependencies
+RUN pip install --no-cache-dir -r requirements.txt
+# Copy application code
+COPY . .
+# Expose API port
+EXPOSE 8000
+# Default command (can be overridden in docker-compose)
+CMD ["uvicorn", "main:app", "--host", "0.0.0.0", "--port", "8000"]

backend/Dockerfile.worker ADDED Viewed

	@@ -0,0 +1,33 @@

+# Use NVIDIA CUDA base image for GPU support
+# For CPU-only, use: FROM python:3.11-slim
+FROM nvidia/cuda:11.8.0-runtime-ubuntu22.04
+# Install Python and system dependencies
+RUN apt-get update && apt-get install -y \
+    python3.11 \
+    python3-pip \
+    ffmpeg \
+    git \
+    && rm -rf /var/lib/apt/lists/*
+# Set working directory
+WORKDIR /app
+# Copy requirements
+COPY requirements.txt .
+# Install Python dependencies
+RUN pip3 install --no-cache-dir -r requirements.txt
+# Copy application code
+COPY . .
+# Create a wrapper script to patch torchaudio to use soundfile backend
+RUN echo '#!/bin/bash\n\
+# Force torchaudio to use soundfile backend\n\
+export TORCHAUDIO_USE_BACKEND_DISPATCHER=0\n\
+exec celery -A tasks worker --loglevel=info --concurrency=1\n\
+' > /app/start-worker.sh && chmod +x /app/start-worker.sh
+# Default command
+CMD ["/app/start-worker.sh"]

backend/celery_app.py ADDED Viewed

	@@ -0,0 +1,41 @@

+"""Celery application configuration."""
+from celery import Celery
+from kombu import Exchange, Queue
+from config import settings
+# Initialize Celery
+celery_app = Celery(
+    "rescored",
+    broker=settings.redis_url,
+    backend=settings.redis_url,
+)
+# Configuration
+celery_app.conf.update(
+    task_serializer="json",
+    accept_content=["json"],
+    result_serializer="json",
+    timezone="UTC",
+    enable_utc=True,
+    # Task settings
+    task_track_started=True,
+    task_time_limit=600,  # 10 minutes max per task
+    task_soft_time_limit=540,  # Soft limit at 9 minutes
+    task_acks_late=True,  # Acknowledge task after completion (safer)
+    worker_prefetch_multiplier=1,  # Take 1 task at a time
+    # Retry settings
+    task_autoretry_for=(Exception,),
+    task_retry_kwargs={'max_retries': 3},
+    task_retry_backoff=True,  # Exponential backoff
+    task_retry_backoff_max=600,
+    # Priority queues
+    task_queues=(
+        Queue('default', Exchange('default'), routing_key='default', priority=5),
+        Queue('high_priority', Exchange('high_priority'), routing_key='high_priority', priority=10),
+    ),
+    task_default_queue='default',
+    task_default_routing_key='default',
+)

backend/config.py ADDED Viewed

	@@ -0,0 +1,50 @@

+"""Configuration module for Rescored backend."""
+from pydantic_settings import BaseSettings
+from pathlib import Path
+class Settings(BaseSettings):
+    """Application settings."""
+    # Redis Configuration
+    redis_url: str = "redis://localhost:6379/0"
+    # Storage Configuration
+    storage_path: Path = Path("/tmp/rescored")
+    # API Configuration
+    api_host: str = "0.0.0.0"
+    api_port: int = 8000
+    # Worker Configuration
+    gpu_enabled: bool = True
+    max_video_duration: int = 900  # 15 minutes
+    # CORS Configuration
+    cors_origins: str = "http://localhost:5173,http://localhost:3000"
+    class Config:
+        env_file = ".env"
+        env_file_encoding = "utf-8"
+    @property
+    def cors_origins_list(self) -> list[str]:
+        """Parse CORS origins as list."""
+        return [origin.strip() for origin in self.cors_origins.split(",")]
+    @property
+    def temp_audio_path(self) -> Path:
+        """Temporary audio storage path."""
+        path = self.storage_path / "temp_audio"
+        path.mkdir(parents=True, exist_ok=True)
+        return path
+    @property
+    def outputs_path(self) -> Path:
+        """Output files storage path."""
+        path = self.storage_path / "outputs"
+        path.mkdir(parents=True, exist_ok=True)
+        return path
+settings = Settings()

backend/main.py ADDED Viewed

	@@ -0,0 +1,407 @@

+"""FastAPI application for Rescored backend."""
+from fastapi import FastAPI, HTTPException, WebSocket, WebSocketDisconnect, Request
+from fastapi.middleware.cors import CORSMiddleware
+from fastapi.responses import FileResponse
+from pydantic import BaseModel, HttpUrl
+from uuid import uuid4
+from datetime import datetime
+from pathlib import Path
+from starlette.middleware.base import BaseHTTPMiddleware
+from starlette.responses import JSONResponse
+import redis
+import json
+import asyncio
+from config import settings
+from utils import validate_youtube_url, check_video_availability
+from tasks import process_transcription_task
+# Initialize FastAPI
+app = FastAPI(
+    title="Rescored API",
+    description="AI-powered music transcription from YouTube videos",
+    version="1.0.0"
+)
+# Redis client (initialized before middleware)
+redis_client = redis.Redis.from_url(settings.redis_url, decode_responses=True)
+# === Rate Limiting Middleware ===
+class RateLimitMiddleware(BaseHTTPMiddleware):
+    """
+    Rate limiting middleware to prevent abuse.
+    Limits: 10 transcription jobs per IP per hour (security requirement).
+    Uses Redis with sliding window counter.
+    """
+    async def dispatch(self, request: Request, call_next):
+        # Only rate limit the transcribe endpoint
+        if request.url.path == "/api/v1/transcribe" and request.method == "POST":
+            # Get client IP (handle proxies)
+            client_ip = request.client.host
+            if "x-forwarded-for" in request.headers:
+                client_ip = request.headers["x-forwarded-for"].split(",")[0].strip()
+            # Redis key for this IP
+            rate_limit_key = f"ratelimit:{client_ip}"
+            # Get current count
+            current_count = redis_client.get(rate_limit_key)
+            if current_count and int(current_count) >= 10:
+                return JSONResponse(
+                    status_code=429,
+                    content={
+                        "detail": "Rate limit exceeded. Maximum 10 transcription jobs per hour per IP."
+                    }
+                )
+            # Increment counter
+            pipe = redis_client.pipeline()
+            pipe.incr(rate_limit_key)
+            pipe.expire(rate_limit_key, 3600)  # 1 hour TTL
+            pipe.execute()
+        response = await call_next(request)
+        return response
+# CORS middleware
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=settings.cors_origins_list,
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+# Rate limiting middleware
+app.add_middleware(RateLimitMiddleware)
+# === Request/Response Models ===
+class TranscribeRequest(BaseModel):
+    """Request model for transcription."""
+    youtube_url: HttpUrl
+    options: dict = {"instruments": ["piano"]}
+class TranscribeResponse(BaseModel):
+    """Response model for transcription submission."""
+    job_id: str
+    status: str
+    created_at: datetime
+    estimated_duration_seconds: int
+    websocket_url: str
+class JobStatusResponse(BaseModel):
+    """Response model for job status."""
+    job_id: str
+    status: str
+    progress: int
+    current_stage: str | None
+    status_message: str | None
+    created_at: str
+    started_at: str | None
+    completed_at: str | None
+    failed_at: str | None
+    error: dict | None
+    result_url: str | None
+# === WebSocket Connection Manager ===
+class ConnectionManager:
+    """Manages WebSocket connections."""
+    def __init__(self):
+        self.active_connections: dict[str, list[WebSocket]] = {}
+    async def connect(self, websocket: WebSocket, job_id: str):
+        """Accept and register a WebSocket connection."""
+        await websocket.accept()
+        if job_id not in self.active_connections:
+            self.active_connections[job_id] = []
+        self.active_connections[job_id].append(websocket)
+    def disconnect(self, websocket: WebSocket, job_id: str):
+        """Remove a WebSocket connection."""
+        if job_id in self.active_connections:
+            self.active_connections[job_id].remove(websocket)
+            if not self.active_connections[job_id]:
+                del self.active_connections[job_id]
+    async def broadcast(self, job_id: str, message: dict):
+        """Broadcast message to all clients connected to a job."""
+        if job_id in self.active_connections:
+            dead_connections = []
+            for connection in self.active_connections[job_id]:
+                try:
+                    await connection.send_json(message)
+                except:
+                    dead_connections.append(connection)
+            # Clean up dead connections
+            for conn in dead_connections:
+                self.disconnect(conn, job_id)
+manager = ConnectionManager()
+# === REST Endpoints ===
+@app.get("/")
+async def root():
+    """Root endpoint."""
+    return {
+        "name": "Rescored API",
+        "version": "1.0.0",
+        "docs": "/docs"
+    }
+@app.post("/api/v1/transcribe", response_model=TranscribeResponse, status_code=201)
+async def submit_transcription(request: TranscribeRequest):
+    """
+    Submit a YouTube URL for transcription.
+    Args:
+        request: Transcription request with YouTube URL
+    Returns:
+        Job information including job ID and WebSocket URL
+    """
+    # Validate YouTube URL
+    is_valid, video_id_or_error = validate_youtube_url(str(request.youtube_url))
+    if not is_valid:
+        raise HTTPException(status_code=400, detail=video_id_or_error)
+    video_id = video_id_or_error
+    # Check video availability
+    availability = check_video_availability(video_id, settings.max_video_duration)
+    if not availability['available']:
+        raise HTTPException(status_code=422, detail=availability['reason'])
+    # Create job
+    job_id = str(uuid4())
+    job_data = {
+        "job_id": job_id,
+        "status": "queued",
+        "youtube_url": str(request.youtube_url),
+        "video_id": video_id,
+        "options": json.dumps(request.options),
+        "created_at": datetime.utcnow().isoformat(),
+        "progress": 0,
+        "current_stage": "queued",
+        "status_message": "Job queued for processing",
+    }
+    # Store in Redis
+    redis_client.hset(f"job:{job_id}", mapping=job_data)
+    # Queue Celery task
+    process_transcription_task.delay(job_id)
+    return TranscribeResponse(
+        job_id=job_id,
+        status="queued",
+        created_at=datetime.utcnow(),
+        estimated_duration_seconds=120,
+        websocket_url=f"ws://localhost:{settings.api_port}/api/v1/jobs/{job_id}/stream"
+    )
+@app.get("/api/v1/jobs/{job_id}", response_model=JobStatusResponse)
+async def get_job_status(job_id: str):
+    """
+    Get job status.
+    Args:
+        job_id: Job identifier
+    Returns:
+        Job status information
+    """
+    job_data = redis_client.hgetall(f"job:{job_id}")
+    if not job_data:
+        raise HTTPException(status_code=404, detail="Job not found")
+    # Parse error if present
+    error = None
+    if 'error' in job_data:
+        try:
+            error = json.loads(job_data['error'])
+        except:
+            error = {"message": job_data['error']}
+    # Construct result URL if completed
+    result_url = None
+    if job_data.get('status') == 'completed':
+        result_url = f"/api/v1/scores/{job_id}"
+    return JobStatusResponse(
+        job_id=job_id,
+        status=job_data.get('status', 'unknown'),
+        progress=int(job_data.get('progress', 0)),
+        current_stage=job_data.get('current_stage'),
+        status_message=job_data.get('status_message'),
+        created_at=job_data.get('created_at', ''),
+        started_at=job_data.get('started_at'),
+        completed_at=job_data.get('completed_at'),
+        failed_at=job_data.get('failed_at'),
+        error=error,
+        result_url=result_url
+    )
+@app.get("/api/v1/scores/{job_id}")
+async def download_score(job_id: str):
+    """
+    Download MusicXML score.
+    Args:
+        job_id: Job identifier
+    Returns:
+        MusicXML file
+    """
+    job_data = redis_client.hgetall(f"job:{job_id}")
+    if not job_data or job_data.get('status') != 'completed':
+        raise HTTPException(status_code=404, detail="Score not available")
+    output_path = job_data.get('output_path')
+    if not output_path:
+        raise HTTPException(status_code=404, detail="Score file path not found")
+    file_path = Path(output_path)
+    if not file_path.exists():
+        raise HTTPException(status_code=404, detail="Score file not found")
+    return FileResponse(
+        path=file_path,
+        media_type="application/vnd.recordare.musicxml+xml",
+        filename=f"score_{job_id}.musicxml"
+    )
+@app.get("/api/v1/scores/{job_id}/midi")
+async def download_midi(job_id: str):
+    """
+    Download MIDI version of score.
+    For MVP, this returns the cleaned MIDI from transcription (piano_clean.mid).
+    Args:
+        job_id: Job identifier
+    Returns:
+        MIDI file
+    """
+    job_data = redis_client.hgetall(f"job:{job_id}")
+    if not job_data or job_data.get('status') != 'completed':
+        raise HTTPException(status_code=404, detail="MIDI not available")
+    midi_path_str = job_data.get('midi_path')
+    if not midi_path_str:
+        raise HTTPException(status_code=404, detail="MIDI file path not found")
+    file_path = Path(midi_path_str)
+    if not file_path.exists():
+        raise HTTPException(status_code=404, detail="MIDI file not found")
+    return FileResponse(
+        path=file_path,
+        media_type="audio/midi",
+        filename=f"score_{job_id}.mid"
+    )
+# === WebSocket Endpoint ===
+@app.websocket("/api/v1/jobs/{job_id}/stream")
+async def websocket_endpoint(websocket: WebSocket, job_id: str):
+    """
+    WebSocket endpoint for real-time progress updates.
+    Args:
+        websocket: WebSocket connection
+        job_id: Job identifier
+    """
+    await manager.connect(websocket, job_id)
+    try:
+        # Subscribe to Redis pub/sub for this job
+        pubsub = redis_client.pubsub()
+        pubsub.subscribe(f"job:{job_id}:updates")
+        # Listen for updates in a separate task
+        async def listen_for_updates():
+            for message in pubsub.listen():
+                if message['type'] == 'message':
+                    update = json.loads(message['data'])
+                    await websocket.send_json(update)
+                    # Close connection if job completed or failed
+                    if update.get('type') in ['completed', 'error']:
+                        break
+        # Send initial status
+        job_data = redis_client.hgetall(f"job:{job_id}")
+        if job_data:
+            initial_update = {
+                "type": "progress",
+                "job_id": job_id,
+                "progress": int(job_data.get('progress', 0)),
+                "stage": job_data.get('current_stage', 'queued'),
+                "message": job_data.get('status_message', 'Starting...'),
+                "timestamp": datetime.utcnow().isoformat(),
+            }
+            await websocket.send_json(initial_update)
+        # Listen for updates (blocking)
+        await listen_for_updates()
+    except WebSocketDisconnect:
+        manager.disconnect(websocket, job_id)
+    finally:
+        pubsub.unsubscribe(f"job:{job_id}:updates")
+        pubsub.close()
+# === Health Check ===
+@app.get("/health")
+async def health_check():
+    """Health check endpoint."""
+    # Check Redis connection
+    try:
+        redis_client.ping()
+        redis_status = "healthy"
+    except:
+        redis_status = "unhealthy"
+    return {
+        "status": "healthy" if redis_status == "healthy" else "degraded",
+        "redis": redis_status,
+        "storage": str(settings.storage_path)
+    }
+if __name__ == "__main__":
+    import uvicorn
+    uvicorn.run(
+        "main:app",
+        host=settings.api_host,
+        port=settings.api_port,
+        reload=True
+    )

backend/pipeline.py ADDED Viewed

	@@ -0,0 +1,881 @@

+"""
+AI-powered music transcription pipeline.
+Processes YouTube videos to extract audio, separate sources, transcribe to MIDI,
+and generate MusicXML notation.
+"""
+import subprocess
+from pathlib import Path
+import tempfile
+from typing import Optional
+import mido
+import librosa
+from piano_transcription_inference import PianoTranscription, sample_rate
+from music21 import converter, key, meter, tempo, note, clef, stream, chord as m21_chord
+class TranscriptionPipeline:
+    """Handles the complete transcription workflow."""
+    def __init__(self, job_id: str, youtube_url: str, storage_path: Path):
+        self.job_id = job_id
+        self.youtube_url = youtube_url
+        self.storage_path = storage_path
+        self.temp_dir = storage_path / "temp" / job_id
+        self.temp_dir.mkdir(parents=True, exist_ok=True)
+        self.progress_callback = None
+        # Initialize ByteDance piano transcription model (lazy loading)
+        self._transcriptor = None
+    def set_progress_callback(self, callback):
+        """Set callback for progress updates: callback(percent, stage, message)"""
+        self.progress_callback = callback
+    def progress(self, percent: int, stage: str, message: str):
+        """Report progress if callback is set."""
+        if self.progress_callback:
+            self.progress_callback(percent, stage, message)
+    def run(self) -> Path:
+        """
+        Execute full pipeline and return path to MusicXML file.
+        Raises:
+            Exception: If any stage fails
+        """
+        try:
+            self.progress(0, "download", "Starting audio download")
+            audio_path = self.download_audio()
+            self.progress(20, "separate", "Starting source separation")
+            stems = self.separate_sources(audio_path)
+            self.progress(50, "transcribe", "Starting MIDI transcription")
+            midi_path = self.transcribe_to_midi(stems['other'])
+            self.progress(90, "musicxml", "Generating MusicXML")
+            musicxml_path = self.generate_musicxml(midi_path)
+            self.progress(100, "complete", "Transcription complete")
+            return musicxml_path
+        except Exception as e:
+            self.progress(0, "error", str(e))
+            raise
+    def download_audio(self) -> Path:
+        """Download audio from YouTube URL using yt-dlp."""
+        output_path = self.temp_dir / "audio.wav"
+        cmd = [
+            "yt-dlp",
+            "-x",  # Extract audio
+            "--audio-format", "wav",
+            "--audio-quality", "0",  # Best quality
+            "--output", str(output_path.with_suffix('')),  # yt-dlp adds .wav
+            # Workarounds for YouTube restrictions
+            "--extractor-args", "youtube:player_client=android,web",
+            "--no-check-certificates",
+            self.youtube_url
+        ]
+        result = subprocess.run(cmd, capture_output=True, text=True)
+        if result.returncode != 0:
+            raise RuntimeError(f"yt-dlp failed: {result.stderr}")
+        if not output_path.exists():
+            raise RuntimeError("Audio file not created")
+        return output_path
+    def separate_sources(self, audio_path: Path) -> dict:
+        """
+        Separate audio into 4 stems using Demucs.
+        Returns:
+            dict with keys: drums, bass, vocals, other
+        """
+        # Run Demucs
+        cmd = [
+            "demucs",
+            "--two-stems=other",  # For piano, we only need "other" stem
+            "-o", str(self.temp_dir),
+            str(audio_path)
+        ]
+        result = subprocess.run(cmd, capture_output=True, text=True)
+        if result.returncode != 0:
+            raise RuntimeError(f"Demucs failed: {result.stderr}")
+        # Demucs creates: temp/htdemucs/audio/*.wav
+        demucs_output = self.temp_dir / "htdemucs" / audio_path.stem
+        stems = {
+            'other': demucs_output / "other.wav",
+            'no_other': demucs_output / "no_other.wav",
+        }
+        # Verify output
+        if not stems['other'].exists():
+            raise RuntimeError("Demucs did not create expected output files")
+        return stems
+    def _get_transcriptor(self):
+        """Lazy load ByteDance piano transcription model."""
+        if self._transcriptor is None:
+            import torch
+            device = 'cuda' if torch.cuda.is_available() else 'cpu'
+            print(f"   Loading ByteDance piano transcription model on {device}...")
+            self._transcriptor = PianoTranscription(device=device, checkpoint_path=None)
+        return self._transcriptor
+    def transcribe_to_midi(self, audio_path: Path) -> Path:
+        """
+        Transcribe audio to MIDI using ByteDance piano_transcription.
+        Args:
+            audio_path: Path to audio file (should be 'other' stem for piano)
+        Returns:
+            Path to generated MIDI file
+        """
+        output_dir = self.temp_dir
+        midi_path = output_dir / "piano.mid"
+        # Load audio with librosa (ByteDance expects specific sample rate and mono)
+        print(f"   Loading audio from {audio_path}...")
+        audio, _ = librosa.load(str(audio_path), sr=sample_rate, mono=True)
+        # Get transcriptor (lazy loaded)
+        transcriptor = self._get_transcriptor()
+        # Transcribe to MIDI
+        print(f"   Transcribing with ByteDance model...")
+        transcriptor.transcribe(audio, str(midi_path))
+        if not midi_path.exists():
+            raise RuntimeError("ByteDance transcription did not create MIDI file")
+        # Post-process MIDI (quantize, clean up)
+        cleaned_midi = self.clean_midi(midi_path)
+        return cleaned_midi
+    def clean_midi(self, midi_path: Path) -> Path:
+        """
+        Clean up MIDI file: filter invalid notes, remove very short notes, light quantization.
+        Args:
+            midi_path: Path to raw MIDI file
+        Returns:
+            Path to cleaned MIDI file
+        """
+        mid = mido.MidiFile(midi_path)
+        # First pass: collect all notes with timing info to filter by duration
+        for track in mid.tracks:
+            absolute_time = 0
+            active_notes = {}  # note_number -> (start_time, start_msg_index, velocity)
+            note_durations = {}  # msg_index -> duration_ticks
+            messages_with_abs_time = []
+            # Build list of messages with absolute timing
+            for msg_idx, msg in enumerate(track):
+                absolute_time += msg.time
+                messages_with_abs_time.append((msg_idx, msg, absolute_time))
+                if msg.type == 'note_on' and msg.velocity > 0:
+                    active_notes[msg.note] = (absolute_time, msg_idx, msg.velocity)
+                elif msg.type in ['note_off', 'note_on']:  # note_on with vel=0 is note_off
+                    if msg.note in active_notes:
+                        start_time, start_idx, velocity = active_notes.pop(msg.note)
+                        duration = absolute_time - start_time
+                        note_durations[start_idx] = duration
+            # Second pass: filter messages based on criteria
+            messages_to_keep = []
+            min_duration_ticks = mid.ticks_per_beat // 8  # Minimum 32nd note duration
+            min_velocity = 20  # Filter very quiet notes (likely noise)
+            notes_to_skip = set()  # Track note_on indices to skip
+            # Identify notes to skip based on duration
+            for msg_idx in note_durations:
+                if note_durations[msg_idx] < min_duration_ticks:
+                    notes_to_skip.add(msg_idx)
+            for msg_idx, msg, abs_time in messages_with_abs_time:
+                # Filter out notes outside piano range (A0 = 21, C8 = 108)
+                if hasattr(msg, 'note') and (msg.note < 21 or msg.note > 108):
+                    continue
+                # Filter very quiet notes (likely false positives)
+                if msg.type == 'note_on' and msg.velocity > 0 and msg.velocity < min_velocity:
+                    notes_to_skip.add(msg_idx)
+                    continue
+                # Skip notes marked for removal (very short)
+                if msg.type == 'note_on' and msg_idx in notes_to_skip:
+                    continue
+                # Skip note_off for notes we filtered out
+                if msg.type in ['note_off', 'note_on'] and hasattr(msg, 'note'):
+                    # Check if this note_off corresponds to a filtered note_on
+                    should_skip = False
+                    for skip_idx in notes_to_skip:
+                        if skip_idx < msg_idx:
+                            skip_msg = messages_with_abs_time[skip_idx][1]
+                            if skip_msg.type == 'note_on' and skip_msg.note == msg.note:
+                                should_skip = True
+                                break
+                    if should_skip and msg.type == 'note_off':
+                        continue
+                messages_to_keep.append((msg, abs_time))
+            # Third pass: rebuild track with delta times and light quantization
+            track.clear()
+            previous_time = 0
+            # Use 16th note quantization grid (less aggressive than 8th)
+            ticks_per_16th = mid.ticks_per_beat // 4
+            for msg, abs_time in messages_to_keep:
+                if msg.type in ['note_on', 'note_off']:
+                    # Light quantization - only snap if close to grid (within 10%)
+                    nearest_grid = round(abs_time / ticks_per_16th) * ticks_per_16th
+                    snap_threshold = ticks_per_16th * 0.1
+                    if abs(abs_time - nearest_grid) < snap_threshold:
+                        abs_time = nearest_grid
+                # Set delta time from previous message
+                msg.time = max(0, abs_time - previous_time)
+                previous_time = abs_time
+                track.append(msg)
+        # Save cleaned MIDI
+        cleaned_path = midi_path.with_stem(f"{midi_path.stem}_clean")
+        mid.save(cleaned_path)
+        return cleaned_path
+    def generate_musicxml(self, midi_path: Path) -> Path:
+        """
+        Convert MIDI to MusicXML using music21, with grand staff for piano.
+        Args:
+            midi_path: Path to input MIDI file
+        Returns:
+            Path to output MusicXML file
+        """
+        self.progress(92, "musicxml", "Parsing MIDI")
+        # Parse MIDI
+        score = converter.parse(midi_path)
+        self.progress(94, "musicxml", "Analyzing key signature")
+        # Detect key signature
+        try:
+            analyzed_key = score.analyze('key')
+            score.insert(0, analyzed_key)
+        except:
+            # Default to C major if analysis fails
+            score.insert(0, key.Key('C'))
+        # Set time signature (default 4/4)
+        score.insert(0, meter.TimeSignature('4/4'))
+        # Extract or default tempo
+        midi_tempo = self._extract_tempo(score)
+        score.insert(0, tempo.MetronomeMark(number=midi_tempo))
+        self.progress(95, "musicxml", "Deduplicating overlapping notes")
+        # Fix overlapping polyphonic notes from basic-pitch before creating measures
+        # This prevents MusicXML corruption where measures have >4.0 beats
+        score = self._deduplicate_overlapping_notes(score)
+        self.progress(96, "musicxml", "Creating measures")
+        # For MVP: Use single staff with treble clef
+        # Grand staff splitting causes issues with overlapping polyphonic notes from basic-pitch
+        # TODO: Implement proper grand staff in Phase 2 with better note splitting algorithm
+        # Add treble clef (most piano music reads treble, bass notes will show ledger lines)
+        for part in score.parts:
+            part.insert(0, clef.TrebleClef())
+            part.partName = "Piano"
+        # Create measures
+        score = score.makeMeasures()
+        # Remove impossible note durations that makeMeasures() might have created
+        score = self._remove_impossible_durations(score)
+        # Fix tuplets containing impossible durations (must be done AFTER makeMeasures)
+        # This prevents "Cannot convert 2048th duration to MusicXML" errors during export
+        score = self._fix_tuplet_durations(score)
+        # Validate measure durations to catch any remaining issues
+        self._validate_measures(score)
+        self.progress(97, "musicxml", "Finalizing score")
+        self.progress(98, "musicxml", "Writing MusicXML file")
+        # Write MusicXML with retry logic for 2048th note errors
+        output_path = self.temp_dir / f"{self.job_id}.musicxml"
+        max_retries = 10  # Prevent infinite loop
+        retry_count = 0
+        while retry_count < max_retries:
+            try:
+                score.write('musicxml', fp=str(output_path))
+                break  # Success!
+            except Exception as e:
+                error_msg = str(e)
+                # Check if this is a 2048th note error
+                if 'Cannot convert "2048th" duration to MusicXML' in error_msg or \
+                   'Cannot convert "4096th" duration to MusicXML' in error_msg:
+                    # Extract measure number from error message
+                    import re
+                    match = re.search(r'measure \((\d+)\)', error_msg)
+                    if match:
+                        measure_num = int(match.group(1))
+                        print(f"   Fixing 2048th note error in measure {measure_num}...")
+                        # Remove ALL tuplets from this measure as a last resort
+                        for part in score.parts:
+                            measures = list(part.getElementsByClass('Measure'))
+                            if measure_num <= len(measures):
+                                problem_measure = measures[measure_num - 1]
+                                # Remove ALL notes/rests from the problematic measure
+                                # The 2048th note error is created BY music21 during export
+                                # We can't prevent it, so we just empty the measure
+                                to_remove = list(problem_measure.recurse().notesAndRests)
+                                for element in to_remove:
+                                    # Remove from its container
+                                    element.activeSite.remove(element)
+                                # Clear caches
+                                problem_measure.coreElementsChanged()
+                                part.coreElementsChanged()
+                                print(f"   Removed all {len(to_remove)} elements from measure {measure_num}")
+                        retry_count += 1
+                    else:
+                        # Can't parse measure number, give up
+                        raise
+                else:
+                    # Different error, give up
+                    raise
+        if retry_count >= max_retries:
+            raise RuntimeError(f"Failed to fix 2048th note errors after {max_retries} attempts")
+        return output_path
+    def _deduplicate_overlapping_notes(self, score):
+        """
+        Deduplicate overlapping notes from basic-pitch to prevent MusicXML corruption.
+        Problem: basic-pitch outputs multiple notes at the same timestamp for polyphonic detection.
+        When music21's makeMeasures() processes these, it creates measures with >4.0 beats.
+        Solution: Group simultaneous notes (within 10ms) into chords, merge duplicate pitches.
+        Args:
+            score: music21 Score object before makeMeasures()
+        Returns:
+            Cleaned score with deduplicated notes
+        """
+        from music21 import stream, note, chord as m21_chord
+        from collections import defaultdict
+        # Process each part
+        for part in score.parts:
+            # Collect all notes with their absolute offsets
+            notes_by_time = defaultdict(list)  # offset_ms -> [notes]
+            for element in part.flatten().notesAndRests:
+                if isinstance(element, note.Rest):
+                    continue  # Skip rests for deduplication
+                # Get absolute offset in quarter notes, convert to milliseconds for bucketing
+                offset_qn = element.offset
+                offset_ms = round(offset_qn * 1000)  # Convert to ms for 10ms bucketing
+                # Bucket into 10ms slots (merge notes within 10ms of each other)
+                bucket = (offset_ms // 10) * 10
+                if isinstance(element, note.Note):
+                    notes_by_time[bucket].append(element)
+                elif isinstance(element, m21_chord.Chord):
+                    # Explode chords into individual notes for deduplication
+                    for pitch in element.pitches:
+                        n = note.Note(pitch)
+                        n.quarterLength = element.quarterLength
+                        n.offset = element.offset
+                        notes_by_time[bucket].append(n)
+            # Rebuild part with deduplicated notes
+            new_part = stream.Part()
+            # Copy metadata (key, tempo, time signature will be added later)
+            new_part.id = part.id
+            new_part.partName = part.partName
+            for bucket_ms in sorted(notes_by_time.keys()):
+                bucket_notes = notes_by_time[bucket_ms]
+                if not bucket_notes:
+                    continue
+                # Group by pitch to remove duplicates
+                pitch_groups = defaultdict(list)
+                for n in bucket_notes:
+                    pitch_groups[n.pitch.midi].append(n)
+                # For each unique pitch, keep the note with longest duration
+                unique_notes = []
+                for midi_pitch, pitch_notes in pitch_groups.items():
+                    # Sort by duration (longest first)
+                    # Get velocity as integer for comparison (handle None values)
+                    def get_velocity(note):
+                        if hasattr(note, 'volume') and hasattr(note.volume, 'velocity'):
+                            vel = note.volume.velocity
+                            return vel if vel is not None else 64
+                        return 64
+                    pitch_notes.sort(key=lambda x: (x.quarterLength, get_velocity(x)), reverse=True)
+                    best_note = pitch_notes[0]
+                    # Filter out extremely short notes (< 64th note = 0.0625 quarter notes)
+                    # MusicXML can't handle notes shorter than 1024th
+                    if best_note.quarterLength >= 0.0625:
+                        unique_notes.append(best_note)
+                if not unique_notes:
+                    continue  # Skip if all notes were too short
+                # Convert back to quarter notes for offset
+                offset_qn = bucket_ms / 1000.0
+                if len(unique_notes) == 1:
+                    # Single note - snap duration to avoid impossible tuplets
+                    n = note.Note(unique_notes[0].pitch)
+                    n.quarterLength = self._snap_duration(unique_notes[0].quarterLength)
+                    new_part.insert(offset_qn, n)
+                elif len(unique_notes) > 1:
+                    # Multiple notes at same time -> create chord
+                    # Use the shortest duration to avoid overlaps, then snap
+                    min_duration = min(n.quarterLength for n in unique_notes)
+                    c = m21_chord.Chord([n.pitch for n in unique_notes])
+                    c.quarterLength = self._snap_duration(min_duration)
+                    new_part.insert(offset_qn, c)
+            # Replace old part with new part
+            score.replace(part, new_part)
+        return score
+    def _snap_duration(self, duration):
+        """
+        Snap duration to nearest MusicXML-valid note value to avoid impossible tuplets.
+        Valid durations: whole (4.0), half (2.0), quarter (1.0), eighth (0.5),
+        sixteenth (0.25), thirty-second (0.125), sixty-fourth (0.0625)
+        Args:
+            duration: Quarter length as float or Fraction
+        Returns:
+            Snapped quarter length
+        """
+        valid_durations = [4.0, 2.0, 1.0, 0.5, 0.25, 0.125, 0.0625]
+        # Convert to float for comparison
+        dur_float = float(duration)
+        # Find nearest valid duration
+        nearest = min(valid_durations, key=lambda x: abs(x - dur_float))
+        return nearest
+    def _remove_impossible_durations(self, score):
+        """
+        Remove notes/rests with durations too short for MusicXML export (<128th note).
+        music21's makeMeasures() can create rests with impossible durations (2048th notes)
+        when filling gaps. This removes them to prevent MusicXML export errors.
+        Args:
+            score: music21 Score with measures
+        Returns:
+            Cleaned score
+        """
+        from music21 import note, stream
+        # Be VERY aggressive - remove anything shorter than 16th note
+        # ByteDance transcription creates many very short notes that cause music21
+        # to generate complex tuplets with impossible durations (2048th notes)
+        # By filtering aggressively, we prevent this MusicXML export error
+        MIN_DURATION = 0.25  # 16th note (1.0 / 4)
+        removed_count = 0
+        for part in score.parts:
+            for measure in part.getElementsByClass('Measure'):
+                # Collect elements to remove
+                to_remove = []
+                for element in measure.notesAndRests:
+                    if element.quarterLength < MIN_DURATION:
+                        to_remove.append(element)
+                        removed_count += 1
+                # Remove impossible durations
+                for element in to_remove:
+                    measure.remove(element)
+        if removed_count > 0:
+            print(f"   Removed {removed_count} notes/rests shorter than 16th note to prevent tuplet errors")
+        return score
+    def _fix_tuplet_durations(self, score):
+        """
+        Fix tuplets containing notes/rests with impossible durations for MusicXML export.
+        The error occurs during MusicXML export when music21 tries to convert tuplet
+        durationNormal.type to MusicXML format. If a tuplet contains a 2048th note or
+        shorter, it will fail with MusicXMLExportException.
+        This method removes or fixes problematic elements within tuplets BEFORE export.
+        Args:
+            score: music21 Score with measures and tuplets
+        Returns:
+            Cleaned score
+        """
+        from music21 import note, stream, duration
+        # List of impossible duration types that MusicXML cannot represent
+        IMPOSSIBLE_TYPES = {'2048th', '4096th', '8192th', '16384th', '32768th'}
+        removed_count = 0
+        fixed_tuplets = 0
+        for part in score.parts:
+            for measure_idx, measure in enumerate(part.getElementsByClass('Measure')):
+                # Collect elements to remove (can't modify while iterating)
+                to_remove = []
+                # Check all notes and rests in the measure (not flattened - direct children)
+                for element in measure.notesAndRests:
+                    should_remove = False
+                    # Check if this element is part of a tuplet
+                    if element.duration.tuplets:
+                        # Check each tuplet attached to this element
+                        for tuplet in element.duration.tuplets:
+                            # Check if the tuplet's durationNormal has an impossible type
+                            if hasattr(tuplet, 'durationNormal') and tuplet.durationNormal:
+                                dur_type = tuplet.durationNormal.type
+                                if dur_type in IMPOSSIBLE_TYPES:
+                                    should_remove = True
+                                    fixed_tuplets += 1
+                                    break
+                    # Also check the element's own duration type
+                    if element.duration.type in IMPOSSIBLE_TYPES:
+                        should_remove = True
+                        fixed_tuplets += 1
+                    if should_remove:
+                        to_remove.append(element)
+                # Remove problematic elements
+                for element in to_remove:
+                    try:
+                        measure.remove(element)
+                        removed_count += 1
+                    except Exception as e:
+                        print(f"   Warning: Could not remove element from measure {measure_idx + 1}: {e}")
+                        continue
+        if removed_count > 0:
+            print(f"   Fixed {fixed_tuplets} tuplets by removing {removed_count} elements with impossible durations")
+        return score
+    def _validate_measures(self, score):
+        """
+        Validate that all measures have correct durations matching their time signature.
+        Logs warnings for any measures that are overfull or underfull.
+        Args:
+            score: music21 Score with measures already created
+        """
+        for part_idx, part in enumerate(score.parts):
+            for measure_idx, measure in enumerate(part.getElementsByClass('Measure')):
+                # Get time signature for this measure
+                ts = measure.timeSignature or measure.getContextByClass('TimeSignature')
+                if not ts:
+                    continue  # Skip if no time signature
+                expected_duration = ts.barDuration.quarterLength
+                actual_duration = measure.duration.quarterLength
+                # Allow small floating-point tolerance (0.01 quarter notes = ~10ms at 120 BPM)
+                tolerance = 0.01
+                if abs(actual_duration - expected_duration) > tolerance:
+                    print(f"WARNING: Measure {measure_idx + 1} in part {part_idx} has duration {float(actual_duration):.2f} "
+                          f"(expected {float(expected_duration):.2f} for {ts.ratioString} time)")
+    def _split_into_grand_staff(self, score):
+        """
+        Split a measured score into treble and bass parts for piano grand staff.
+        Notes >= Middle C (C4/MIDI 60) go to treble clef (right hand)
+        Notes < Middle C go to bass clef (left hand)
+        This method processes a score that ALREADY has measures created by makeMeasures().
+        """
+        from music21 import stream, note, chord as m21_chord
+        # If score already has multiple parts, just add clefs and return
+        if len(score.parts) > 1:
+            for part_idx, part in enumerate(score.parts):
+                if part_idx == 0:
+                    part.insert(0, clef.TrebleClef())
+                else:
+                    part.insert(0, clef.BassClef())
+            return score
+        # Get the single part from the score
+        original_part = score.parts[0] if len(score.parts) > 0 else None
+        if not original_part:
+            return score
+        # Create new score with two parts
+        new_score = stream.Score()
+        # Copy metadata from original score
+        for element in score.flatten():
+            if isinstance(element, (key.Key, meter.TimeSignature, tempo.MetronomeMark)):
+                new_score.insert(0, element)
+        # Create right hand (treble) and left hand (bass) parts
+        treble_part = stream.Part()
+        treble_part.insert(0, clef.TrebleClef())
+        treble_part.partName = "Piano Right Hand"
+        bass_part = stream.Part()
+        bass_part.insert(0, clef.BassClef())
+        bass_part.partName = "Piano Left Hand"
+        # Middle C (C4) is MIDI note 60
+        SPLIT_POINT = 60
+        # Process each measure from the original part
+        for measure in original_part.getElementsByClass('Measure'):
+            # Create corresponding measures for treble and bass
+            treble_measure = stream.Measure(number=measure.number)
+            bass_measure = stream.Measure(number=measure.number)
+            # Copy time signature if present
+            for ts in measure.getElementsByClass(meter.TimeSignature):
+                treble_measure.insert(0, ts)
+                bass_measure.insert(0, ts)
+            # Process all notes and rests in this measure
+            for element in measure.notesAndRests:
+                offset = element.getOffsetInHierarchy(measure)
+                if isinstance(element, note.Rest):
+                    # Skip rests - music21 will add them automatically where needed
+                    continue
+                elif isinstance(element, note.Note):
+                    # Single note - assign to treble or bass based on pitch
+                    new_note = note.Note(element.pitch, quarterLength=element.quarterLength)
+                    if element.pitch.midi >= SPLIT_POINT:
+                        # Treble: add note only
+                        treble_measure.insert(offset, new_note)
+                    else:
+                        # Bass: add note only
+                        bass_measure.insert(offset, new_note)
+                elif isinstance(element, m21_chord.Chord):
+                    # Chord - split notes between treble and bass
+                    treble_pitches = []
+                    bass_pitches = []
+                    for pitch in element.pitches:
+                        if pitch.midi >= SPLIT_POINT:
+                            treble_pitches.append(pitch)
+                        else:
+                            bass_pitches.append(pitch)
+                    # Create elements for treble (only if has notes)
+                    if treble_pitches:
+                        treble_chord = m21_chord.Chord(treble_pitches, quarterLength=element.quarterLength)
+                        treble_measure.insert(offset, treble_chord)
+                    # Create elements for bass (only if has notes)
+                    if bass_pitches:
+                        bass_chord = m21_chord.Chord(bass_pitches, quarterLength=element.quarterLength)
+                        bass_measure.insert(offset, bass_chord)
+            # Add measures to parts
+            treble_part.append(treble_measure)
+            bass_part.append(bass_measure)
+        # Add parts to score (treble first for proper ordering)
+        new_score.insert(0, treble_part)
+        new_score.insert(0, bass_part)
+        # Let music21 add rests where needed and fix measure boundaries
+        try:
+            new_score.makeRests(inPlace=True, fillGaps=True)
+        except:
+            # If makeRests fails, continue anyway
+            pass
+        return new_score
+    def _extract_tempo(self, score) -> int:
+        """Extract tempo from MIDI or default to 120 BPM."""
+        for element in score.flatten():
+            if isinstance(element, tempo.MetronomeMark):
+                return int(element.number)
+        return 120
+    def cleanup(self):
+        """Delete temporary files (except output)."""
+        # Don't delete entire temp_dir yet - output file is still there
+        # Delete individual temp files instead
+        for file in self.temp_dir.glob("*.wav"):
+            file.unlink(missing_ok=True)
+        for file in self.temp_dir.glob("*_clean.mid"):
+            if file.name != "piano_clean.mid":
+                file.unlink(missing_ok=True)
+# === Module-level convenience functions for backward compatibility ===
+def download_audio(youtube_url: str, storage_path: Path) -> Path:
+    """Download audio from YouTube URL (module-level wrapper)."""
+    pipeline = TranscriptionPipeline("compat_job", youtube_url, storage_path)
+    return pipeline.download_audio()
+def separate_sources(audio_path: Path, storage_path: Path) -> dict:
+    """Separate audio sources (module-level wrapper)."""
+    pipeline = TranscriptionPipeline("compat_job", "http://example.com", storage_path)
+    return pipeline.separate_sources(audio_path)
+def transcribe_audio(
+    audio_path: Path,
+    storage_path: Path,
+    onset_threshold: float = 0.4,
+    frame_threshold: float = 0.35
+) -> Path:
+    """Transcribe audio to MIDI (module-level wrapper)."""
+    pipeline = TranscriptionPipeline("compat_job", "http://example.com", storage_path)
+    # Note: The class method doesn't support these parameters in the current signature
+    # But we create a job and transcribe
+    midi_path = pipeline.transcribe_to_midi(audio_path)
+    return midi_path
+def quantize_midi(midi_path: Path, resolution: int = 480) -> Path:
+    """Quantize MIDI file (module-level wrapper)."""
+    pipeline = TranscriptionPipeline("compat_job", "http://example.com", midi_path.parent)
+    return pipeline.clean_midi(midi_path)
+def remove_duplicate_notes(midi_path: Path) -> Path:
+    """Remove duplicate notes from MIDI (included in clean_midi)."""
+    # The implementation includes this in clean_midi
+    pipeline = TranscriptionPipeline("compat_job", "http://example.com", midi_path.parent)
+    return pipeline.clean_midi(midi_path)
+def remove_short_notes(midi_path: Path, min_duration: int = 60) -> Path:
+    """Remove short notes from MIDI (included in clean_midi)."""
+    # The implementation includes this in clean_midi
+    pipeline = TranscriptionPipeline("compat_job", "http://example.com", midi_path.parent)
+    return pipeline.clean_midi(midi_path)
+def generate_musicxml(midi_path: Path, storage_path: Path) -> Path:
+    """Generate MusicXML from MIDI (module-level wrapper)."""
+    pipeline = TranscriptionPipeline("compat_job", "http://example.com", storage_path)
+    return pipeline.generate_musicxml(midi_path)
+def detect_key_signature(midi_path: Path) -> dict:
+    """Detect key signature from MIDI."""
+    score = converter.parse(midi_path)
+    try:
+        analyzed_key = score.analyze('key')
+        return {
+            'tonic': analyzed_key.tonic.name,
+            'mode': analyzed_key.mode
+        }
+    except:
+        return {'tonic': 'C', 'mode': 'major'}
+def detect_time_signature(midi_path: Path) -> dict:
+    """Detect time signature from MIDI."""
+    score = converter.parse(midi_path)
+    for ts in score.flatten().getElementsByClass(meter.TimeSignature):
+        return {
+            'numerator': ts.numerator,
+            'denominator': ts.denominator
+        }
+    return {'numerator': 4, 'denominator': 4}
+def detect_tempo(midi_path: Path) -> int:
+    """Detect tempo from MIDI."""
+    score = converter.parse(midi_path)
+    for t in score.flatten().getElementsByClass(tempo.MetronomeMark):
+        return int(t.number)
+    return 120
+def run_transcription_pipeline(youtube_url: str, storage_path: Path) -> dict:
+    """Run the full transcription pipeline (module-level wrapper)."""
+    pipeline = TranscriptionPipeline("compat_job", youtube_url, storage_path)
+    try:
+        result = pipeline.run()
+        return {
+            'status': 'success',
+            'musicxml_path': str(result)
+        }
+    except Exception as e:
+        return {
+            'status': 'failed',
+            'error': str(e)
+        }

backend/pytest.ini ADDED Viewed

	@@ -0,0 +1,45 @@

+[pytest]
+testpaths = tests
+python_files = test_*.py
+python_classes = Test*
+python_functions = test_*
+# Show extra test summary info
+addopts =
+    -v
+    --strict-markers
+    --tb=short
+    --disable-warnings
+    --cov=.
+    --cov-report=term-missing
+    --cov-report=html
+    --cov-branch
+# Markers for categorizing tests
+markers =
+    unit: Unit tests for individual functions
+    integration: Integration tests for multiple components
+    slow: Tests that take longer to run
+    gpu: Tests that require GPU
+    network: Tests that require network access
+# Asyncio configuration
+asyncio_mode = auto
+asyncio_default_fixture_loop_scope = function
+# Coverage options
+[coverage:run]
+omit =
+    tests/*
+    __pycache__/*
+    */site-packages/*
+    venv/*
+[coverage:report]
+exclude_lines =
+    pragma: no cover
+    def __repr__
+    raise AssertionError
+    raise NotImplementedError
+    if __name__ == .__main__.:
+    if TYPE_CHECKING:

backend/requirements-test.txt ADDED Viewed

	@@ -0,0 +1,14 @@

+# Test dependencies for Rescored backend
+-r requirements.txt
+# Testing framework
+pytest==8.2.0
+pytest-asyncio==0.24.0
+pytest-cov==4.1.0
+pytest-mock==3.12.0
+# HTTP testing
+httpx==0.26.0
+# Test utilities
+faker==22.5.1

backend/requirements.txt ADDED Viewed

	@@ -0,0 +1,35 @@

+# Web Framework
+# Note: This file now includes torch/torchaudio as they are required by demucs on macOS
+fastapi==0.115.5
+uvicorn[standard]==0.32.1
+python-multipart==0.0.20
+# Task Queue
+celery==5.4.0
+redis==5.2.1
+# Audio Processing
+yt-dlp>=2025.12.8
+soundfile==0.12.1
+scipy
+torch>=2.0.0
+torchaudio>=2.9.1
+torchcodec>=0.9.1
+demucs>=3.0.6
+# Pitch detection (macOS default runtime is CoreML)
+basic-pitch==0.4.0
+# Music Processing
+music21==9.3.0
+mido==1.3.3
+# Utilities
+python-dotenv==1.0.1
+tenacity==9.0.0
+pydantic==2.10.4
+pydantic-settings==2.7.0
+numpy<2.0.0
+# WebSocket
+websockets==14.1

backend/scripts/README.md ADDED Viewed

	@@ -0,0 +1,184 @@

+# Backend Scripts
+Utility scripts for testing and analyzing the Rescored transcription pipeline.
+## Scripts
+### test_accuracy.py
+**NEW** - Comprehensive accuracy testing suite that tests the pipeline with 10 diverse piano videos covering different styles and difficulty levels.
+**Usage:**
+```bash
+cd backend
+python scripts/test_accuracy.py
+```
+**Output:**
+- Progress for each of 10 test videos
+- Success/failure status per video
+- Metrics: note count, measure count, separation quality
+- Summary statistics (success rate, average metrics)
+- Full results saved to JSON: `/tmp/rescored/accuracy_test_results.json`
+**Test Videos** (varying difficulty):
+- **Easy**: Simple scales, Twinkle Twinkle
+- **Medium**: Für Elise, Canon in D, River Flows in You, Moonlight Sonata, Jazz Blues
+- **Hard**: Chopin Nocturne, Clair de Lune
+- **Very Hard**: La Campanella (Liszt)
+**Expected Runtime**: 30-60 minutes for all 10 videos
+**Purpose**: Establish baseline accuracy metrics for the MVP pipeline, identify common failure modes, and track improvements across phases.
+### test_e2e.py
+End-to-end pipeline testing script. Downloads a YouTube video, runs the full transcription pipeline, and displays results.
+**Usage:**
+```bash
+cd backend
+python scripts/test_e2e.py "<youtube_url>"
+```
+**Example:**
+```bash
+python scripts/test_e2e.py "https://www.youtube.com/watch?v=PAE88urB1xs"
+```
+**Output:**
+- Progress updates for each pipeline stage
+- Total processing time
+- MusicXML file path and size
+- List of intermediate files
+- Preview of generated MusicXML
+**Test Videos:**
+- Simple piano melody: https://www.youtube.com/watch?v=WyTb3DTu88c
+- Classical piano: https://www.youtube.com/watch?v=fJ9rUzIMcZQ
+---
+### analyze_transcription.py
+MIDI file analysis tool. Provides detailed statistics about transcribed notes to identify quality issues.
+**Usage:**
+```bash
+cd backend
+python scripts/analyze_transcription.py <midi_path>
+```
+**Example:**
+```bash
+python scripts/analyze_transcription.py /tmp/rescored/temp/test_e2e/piano.mid
+python scripts/analyze_transcription.py /tmp/rescored/temp/test_e2e/piano_clean.mid
+```
+**Analysis Includes:**
+- Total note count and density (notes/second)
+- Pitch range and distribution
+- Note duration statistics (average, median, min, max)
+- Velocity (dynamics) analysis
+- Polyphony (simultaneous notes)
+- Detection of potential issues:
+  - Very short notes (< 100ms) - likely false positives
+  - Very quiet notes (velocity < 30) - likely noise
+  - High note density - over-transcription
+  - Extreme polyphony - detecting noise as notes
+  - Notes outside piano range
+**Output Example:**
+```
+============================================================
+MIDI Transcription Analysis
+============================================================
+File: piano.mid
+Duration: 248.1 seconds
+Total notes: 1333
+Notes per second: 5.37
+Pitch Range:
+  Lowest: 35 (MIDI) = B1
+  Highest: 86 (MIDI) = D6
+  Range: 51 semitones
+Note Durations:
+  Average: 0.433 seconds
+  Median: 0.325 seconds
+  Very short notes (< 100ms): 0 (0.0%)
+Potential Issues:
+  ✓ No obvious issues detected
+============================================================
+```
+---
+## Workflow
+1. **Test the pipeline:**
+   ```bash
+   python scripts/test_e2e.py "https://www.youtube.com/watch?v=VIDEO_ID"
+   ```
+2. **Analyze the raw output:**
+   ```bash
+   python scripts/analyze_transcription.py /tmp/rescored/temp/test_e2e/piano.mid
+   ```
+3. **Analyze the cleaned output:**
+   ```bash
+   python scripts/analyze_transcription.py /tmp/rescored/temp/test_e2e/piano_clean.mid
+   ```
+4. **Listen to the result:**
+   ```bash
+   # Using MuseScore
+   musescore /tmp/rescored/temp/test_e2e/test_e2e.musicxml
+   # Or using timidity (MIDI playback)
+   timidity /tmp/rescored/temp/test_e2e/piano_clean.mid
+   ```
+---
+## Interpreting Results
+### Good Transcription Indicators
+- Notes/second: 3-8 for piano (depends on complexity)
+- Very short notes: < 10%
+- Max polyphony: 3-10 simultaneous notes (piano is typically 2-6)
+- Pitch range: Within MIDI 21-108 (A0 to C8)
+- No significant issues detected
+### Warning Signs
+- Notes/second > 10: Likely over-transcribing (too many false positives)
+- Very short notes > 30%: Detecting noise as notes
+- Max polyphony > 15: Probably including noise
+- Many notes outside piano range: Need better filtering
+### Tuning Recommendations
+If you see issues, adjust parameters in [pipeline.py](../pipeline.py):
+**For too many false positives:**
+- Increase `onset-threshold` (0.5 → 0.6)
+- Increase `frame-threshold` (0.4 → 0.45)
+- Increase `minimum-note-length` (127 → 150ms)
+**For too many missing notes:**
+- Decrease `onset-threshold` (0.5 → 0.45)
+- Decrease `frame-threshold` (0.4 → 0.35)
+**For timing issues:**
+- Adjust quantization in `clean_midi()` method
+- Change `ticks_per_16th` to `ticks_per_32nd` for lighter quantization
+---
+## Notes
+- Scripts must be run from the `backend` directory (they use relative imports)
+- Temporary files are stored in `/tmp/rescored/temp/<job_id>/`
+- MusicXML output is saved in the temp directory with the job_id as filename
+- Analysis works on both raw and cleaned MIDI files for comparison

backend/scripts/analyze_transcription.py ADDED Viewed

	@@ -0,0 +1,175 @@

+#!/usr/bin/env python3
+"""
+Analyze transcription quality and identify common issues.
+Usage (from backend directory):
+    python scripts/analyze_transcription.py <midi_path>
+Example:
+    python scripts/analyze_transcription.py /tmp/rescored/temp/test_e2e/piano.mid
+"""
+import sys
+from pathlib import Path
+import mido
+from collections import Counter
+import statistics
+def analyze_midi(midi_path: Path):
+    """Analyze MIDI file for common transcription issues."""
+    mid = mido.MidiFile(midi_path)
+    # Collect all notes with timing
+    notes = []  # (time, pitch, velocity, duration)
+    for track in mid.tracks:
+        absolute_time = 0
+        active_notes = {}  # pitch -> (start_time, velocity)
+        for msg in track:
+            absolute_time += msg.time
+            if msg.type == 'note_on' and msg.velocity > 0:
+                active_notes[msg.note] = (absolute_time, msg.velocity)
+            elif msg.type in ['note_off', 'note_on']:  # note_on with velocity 0 is also note_off
+                if msg.note in active_notes:
+                    start_time, velocity = active_notes.pop(msg.note)
+                    duration = absolute_time - start_time
+                    notes.append((start_time, msg.note, velocity, duration))
+    if not notes:
+        print("No notes found in MIDI file!")
+        return
+    # Sort notes by time
+    notes.sort(key=lambda n: n[0])
+    # Analysis
+    print("=" * 60)
+    print("MIDI Transcription Analysis")
+    print("=" * 60)
+    print(f"File: {midi_path.name}")
+    print(f"Duration: {mid.length:.1f} seconds")
+    print(f"Total notes: {len(notes)}")
+    print(f"Notes per second: {len(notes) / mid.length:.2f}")
+    print()
+    # Pitch analysis
+    pitches = [n[1] for n in notes]
+    pitch_counts = Counter(pitches)
+    print("Pitch Range:")
+    print(f"  Lowest: {min(pitches)} (MIDI) = {_midi_to_note(min(pitches))}")
+    print(f"  Highest: {max(pitches)} (MIDI) = {_midi_to_note(max(pitches))}")
+    print(f"  Range: {max(pitches) - min(pitches)} semitones")
+    print()
+    # Duration analysis
+    durations_ticks = [n[3] for n in notes]
+    durations_seconds = [mido.tick2second(d, mid.ticks_per_beat, 500000) for d in durations_ticks]
+    print("Note Durations:")
+    print(f"  Average: {statistics.mean(durations_seconds):.3f} seconds")
+    print(f"  Median: {statistics.median(durations_seconds):.3f} seconds")
+    print(f"  Min: {min(durations_seconds):.3f} seconds")
+    print(f"  Max: {max(durations_seconds):.3f} seconds")
+    # Identify very short notes (likely noise/false positives)
+    very_short_notes = [d for d in durations_seconds if d < 0.1]  # < 100ms
+    short_notes = [d for d in durations_seconds if d < 0.2]  # < 200ms
+    print(f"  Very short notes (< 100ms): {len(very_short_notes)} ({len(very_short_notes)/len(notes)*100:.1f}%)")
+    print(f"  Short notes (< 200ms): {len(short_notes)} ({len(short_notes)/len(notes)*100:.1f}%)")
+    print()
+    # Velocity analysis
+    velocities = [n[2] for n in notes]
+    print("Velocity (dynamics):")
+    print(f"  Average: {statistics.mean(velocities):.1f}")
+    print(f"  Min: {min(velocities)}")
+    print(f"  Max: {max(velocities)}")
+    print(f"  Range: {max(velocities) - min(velocities)}")
+    # Identify very quiet notes (likely noise/false positives)
+    quiet_notes = [v for v in velocities if v < 30]
+    print(f"  Very quiet notes (velocity < 30): {len(quiet_notes)} ({len(quiet_notes)/len(notes)*100:.1f}%)")
+    print()
+    # Polyphony analysis (notes happening at same time)
+    time_windows = {}  # time_window -> count
+    window_size = 50  # 50 ticks
+    for note_time, _, _, _ in notes:
+        window = note_time // window_size
+        time_windows[window] = time_windows.get(window, 0) + 1
+    max_polyphony = max(time_windows.values())
+    avg_polyphony = statistics.mean(time_windows.values())
+    print("Polyphony (simultaneous notes):")
+    print(f"  Max simultaneous: ~{max_polyphony}")
+    print(f"  Average: ~{avg_polyphony:.1f}")
+    print()
+    # Most common pitches
+    print("Most frequent pitches (top 10):")
+    for pitch, count in pitch_counts.most_common(10):
+        print(f"  {_midi_to_note(pitch):>3s} (MIDI {pitch:>2d}): {count:>4d} times ({count/len(notes)*100:>5.1f}%)")
+    print()
+    # Identify potential issues
+    print("Potential Issues:")
+    issues = []
+    if len(very_short_notes) / len(notes) > 0.3:
+        issues.append(f"⚠️  {len(very_short_notes)/len(notes)*100:.1f}% of notes are very short (< 100ms) - likely false positives")
+    if len(quiet_notes) / len(notes) > 0.3:
+        issues.append(f"⚠️  {len(quiet_notes)/len(notes)*100:.1f}% of notes are very quiet (velocity < 30) - likely noise")
+    if len(notes) / mid.length > 15:
+        issues.append(f"⚠️  Very high note density ({len(notes) / mid.length:.1f} notes/sec) - likely over-transcribing")
+    if max_polyphony > 20:
+        issues.append(f"⚠️  Very high polyphony (max {max_polyphony} notes) - likely detecting noise as notes")
+    if min(pitches) < 21 or max(pitches) > 108:
+        issues.append(f"⚠️  Notes outside piano range (MIDI 21-108) detected")
+    if not issues:
+        print("  ✓ No obvious issues detected")
+    else:
+        for issue in issues:
+            print(f"  {issue}")
+    print()
+    print("Recommendations:")
+    if len(very_short_notes) / len(notes) > 0.3:
+        print("  • Increase minimum-note-length threshold in basic-pitch")
+    if len(quiet_notes) / len(notes) > 0.3:
+        print("  • Increase frame-threshold in basic-pitch to ignore quieter notes")
+    if len(notes) / mid.length > 15:
+        print("  • Increase onset-threshold in basic-pitch to be less sensitive")
+    if max_polyphony > 20:
+        print("  • Use median filtering or harmonic analysis to remove noise")
+    print("=" * 60)
+def _midi_to_note(midi_num):
+    """Convert MIDI number to note name."""
+    notes = ['C', 'C#', 'D', 'D#', 'E', 'F', 'F#', 'G', 'G#', 'A', 'A#', 'B']
+    octave = (midi_num // 12) - 1
+    note = notes[midi_num % 12]
+    return f"{note}{octave}"
+if __name__ == "__main__":
+    if len(sys.argv) < 2:
+        print("Usage: python analyze_transcription.py <midi_path>")
+        print("\nExample:")
+        print("  python analyze_transcription.py /tmp/rescored/temp/test_e2e/piano.mid")
+        sys.exit(1)
+    midi_path = Path(sys.argv[1])
+    if not midi_path.exists():
+        print(f"Error: File not found: {midi_path}")
+        sys.exit(1)
+    analyze_midi(midi_path)

backend/scripts/diagnose_pipeline.py ADDED Viewed

	@@ -0,0 +1,307 @@

+#!/usr/bin/env python3
+"""
+Diagnose pipeline accuracy issues by analyzing each stage.
+Usage (from backend directory):
+    python scripts/diagnose_pipeline.py <job_id>
+Example:
+    python scripts/diagnose_pipeline.py test_e2e
+"""
+import sys
+from pathlib import Path
+import soundfile as sf
+import numpy as np
+import mido
+# Add parent directory to path for imports
+sys.path.insert(0, str(Path(__file__).parent.parent))
+from config import settings
+def analyze_audio_file(audio_path: Path, label: str):
+    """Analyze audio file characteristics."""
+    print(f"\n{label}:")
+    print(f"  Path: {audio_path}")
+    if not audio_path.exists():
+        print(f"  ❌ File not found!")
+        return
+    # Read audio
+    data, samplerate = sf.read(audio_path)
+    # Calculate statistics
+    duration = len(data) / samplerate
+    channels = 1 if len(data.shape) == 1 else data.shape[1]
+    # RMS energy (loudness)
+    if channels == 1:
+        rms = np.sqrt(np.mean(data**2))
+    else:
+        rms = np.sqrt(np.mean(data**2, axis=0))
+    # Peak amplitude
+    peak = np.max(np.abs(data))
+    # Dynamic range
+    if channels == 1:
+        dynamic_range = 20 * np.log10(peak / (rms + 1e-10))
+    else:
+        dynamic_range = 20 * np.log10(peak / (np.mean(rms) + 1e-10))
+    print(f"  Duration: {duration:.1f}s")
+    print(f"  Sample rate: {samplerate} Hz")
+    print(f"  Channels: {channels}")
+    print(f"  Peak amplitude: {peak:.3f}")
+    if channels == 1:
+        print(f"  RMS energy: {rms:.3f}")
+    else:
+        print(f"  RMS energy (L/R): {rms[0]:.3f} / {rms[1]:.3f}")
+    print(f"  Dynamic range: {dynamic_range:.1f} dB")
+    # Check for clipping
+    clipped_samples = np.sum(np.abs(data) >= 0.99)
+    if clipped_samples > 0:
+        print(f"  ⚠️  Clipped samples: {clipped_samples} ({clipped_samples/len(data)*100:.2f}%)")
+    # Check for silence
+    silence_threshold = 0.01
+    if channels == 1:
+        silent_samples = np.sum(np.abs(data) < silence_threshold)
+    else:
+        silent_samples = np.sum(np.max(np.abs(data), axis=1) < silence_threshold)
+    if silent_samples > len(data) * 0.1:
+        print(f"  ⚠️  Silence: {silent_samples/len(data)*100:.1f}% of audio")
+    # Check if mostly quiet (could indicate poor separation)
+    if isinstance(rms, np.ndarray):
+        avg_rms = np.mean(rms)
+    else:
+        avg_rms = rms
+    if avg_rms < 0.01:
+        print(f"  ⚠️  Very quiet audio (RMS: {avg_rms:.4f}) - may indicate poor source separation")
+    elif avg_rms < 0.05:
+        print(f"  ⚠️  Quiet audio (RMS: {avg_rms:.4f}) - basic-pitch may struggle")
+def analyze_midi_file(midi_path: Path, label: str):
+    """Analyze MIDI file."""
+    print(f"\n{label}:")
+    print(f"  Path: {midi_path}")
+    if not midi_path.exists():
+        print(f"  ❌ File not found!")
+        return
+    mid = mido.MidiFile(midi_path)
+    # Count notes
+    note_count = 0
+    note_pitches = []
+    note_velocities = []
+    for track in mid.tracks:
+        for msg in track:
+            if msg.type == 'note_on' and msg.velocity > 0:
+                note_count += 1
+                note_pitches.append(msg.note)
+                note_velocities.append(msg.velocity)
+    print(f"  Duration: {mid.length:.1f}s")
+    print(f"  Total notes: {note_count}")
+    print(f"  Notes per second: {note_count / mid.length:.2f}")
+    if note_pitches:
+        print(f"  Pitch range: {min(note_pitches)} - {max(note_pitches)}")
+        print(f"  Avg velocity: {np.mean(note_velocities):.1f}")
+        print(f"  Velocity range: {min(note_velocities)} - {max(note_velocities)}")
+def diagnose_job(job_id: str):
+    """Diagnose a specific transcription job."""
+    storage_path = Path(settings.storage_path)
+    job_dir = storage_path / "temp" / job_id
+    print("=" * 60)
+    print("PIPELINE DIAGNOSTIC REPORT")
+    print("=" * 60)
+    print(f"Job ID: {job_id}")
+    print(f"Job Directory: {job_dir}")
+    if not job_dir.exists():
+        print(f"\n❌ Job directory not found: {job_dir}")
+        print("\nRun test_e2e.py first to create a job:")
+        print(f'  python scripts/test_e2e.py "https://www.youtube.com/watch?v=VIDEO_ID"')
+        sys.exit(1)
+    print("\n" + "=" * 60)
+    print("STAGE 1: AUDIO DOWNLOAD")
+    print("=" * 60)
+    audio_path = job_dir / "audio.wav"
+    analyze_audio_file(audio_path, "Downloaded Audio")
+    print("\n" + "=" * 60)
+    print("STAGE 2: SOURCE SEPARATION (Demucs)")
+    print("=" * 60)
+    demucs_dir = job_dir / "htdemucs" / "audio"
+    other_stem = demucs_dir / "other.wav"
+    no_other_stem = demucs_dir / "no_other.wav"
+    analyze_audio_file(other_stem, "Other Stem (Piano/Melodic)")
+    analyze_audio_file(no_other_stem, "No-Other Stem (Drums/Bass/Vocals)")
+    # Compare separation quality
+    if audio_path.exists() and other_stem.exists() and no_other_stem.exists():
+        print("\n  Separation Quality Check:")
+        # Read all audio
+        original, sr = sf.read(audio_path)
+        other, _ = sf.read(other_stem)
+        no_other, _ = sf.read(no_other_stem)
+        # Calculate energy distribution
+        original_energy = np.sum(original**2)
+        other_energy = np.sum(other**2)
+        no_other_energy = np.sum(no_other**2)
+        total_separated_energy = other_energy + no_other_energy
+        print(f"  Original energy: {original_energy:.2e}")
+        print(f"  Other energy: {other_energy:.2e} ({other_energy/original_energy*100:.1f}%)")
+        print(f"  No-other energy: {no_other_energy:.2e} ({no_other_energy/original_energy*100:.1f}%)")
+        print(f"  Energy preservation: {total_separated_energy/original_energy*100:.1f}%")
+        # Check if 'other' stem is too quiet (bad separation)
+        if other_energy / original_energy < 0.1:
+            print(f"  ⚠️  'Other' stem has very low energy - poor separation for melodic content")
+        elif other_energy / original_energy < 0.2:
+            print(f"  ⚠️  'Other' stem has low energy - separation may not be ideal")
+    print("\n" + "=" * 60)
+    print("STAGE 3: TRANSCRIPTION (basic-pitch)")
+    print("=" * 60)
+    piano_midi = job_dir / "piano.mid"
+    analyze_midi_file(piano_midi, "Raw MIDI Output")
+    print("\n" + "=" * 60)
+    print("STAGE 4: MIDI CLEANING")
+    print("=" * 60)
+    clean_midi = job_dir / "piano_clean.mid"
+    analyze_midi_file(clean_midi, "Cleaned MIDI Output")
+    # Compare raw vs cleaned
+    if piano_midi.exists() and clean_midi.exists():
+        raw_mid = mido.MidiFile(piano_midi)
+        clean_mid = mido.MidiFile(clean_midi)
+        raw_notes = sum(1 for track in raw_mid.tracks for msg in track if msg.type == 'note_on' and msg.velocity > 0)
+        clean_notes = sum(1 for track in clean_mid.tracks for msg in track if msg.type == 'note_on' and msg.velocity > 0)
+        removed_notes = raw_notes - clean_notes
+        print(f"\n  Cleaning Impact:")
+        print(f"  Notes removed: {removed_notes} ({removed_notes/raw_notes*100:.1f}%)")
+        if removed_notes / raw_notes > 0.5:
+            print(f"  ⚠️  Removed >50% of notes - cleaning may be too aggressive")
+    print("\n" + "=" * 60)
+    print("DIAGNOSIS SUMMARY")
+    print("=" * 60)
+    # Provide recommendations based on analysis
+    print("\nPotential Issues:")
+    issues_found = False
+    # Check 1: Source separation quality
+    if other_stem.exists():
+        other_data, _ = sf.read(other_stem)
+        other_rms = np.sqrt(np.mean(other_data**2))
+        if other_rms < 0.05:
+            print("  ⚠️  'Other' stem is very quiet - Demucs may not be separating piano well")
+            print("     → This is the most likely cause of poor transcription accuracy")
+            print("     → The piano might be mixed with other instruments in different stems")
+            issues_found = True
+    # Check 2: Note density
+    if piano_midi.exists():
+        mid = mido.MidiFile(piano_midi)
+        note_count = sum(1 for track in mid.tracks for msg in track if msg.type == 'note_on' and msg.velocity > 0)
+        density = note_count / mid.length
+        if density < 2:
+            print("  ⚠️  Very low note density - basic-pitch may be too conservative")
+            print("     → Try decreasing onset-threshold and frame-threshold")
+            issues_found = True
+        elif density > 10:
+            print("  ⚠️  Very high note density - basic-pitch may be too aggressive")
+            print("     → Current thresholds might already be good; check if it's detecting noise")
+            issues_found = True
+    if not issues_found:
+        print("  No obvious technical issues detected")
+        print("  The problem may be:")
+        print("    • Music is too complex for current models")
+        print("    • Need better source separation (try different Demucs model)")
+        print("    • basic-pitch limitations with this type of music")
+    print("\n" + "=" * 60)
+    print("RECOMMENDATIONS")
+    print("=" * 60)
+    print("""
+Next steps to improve accuracy:
+1. LISTEN to the separated stems:
+   - Play 'other.wav' to verify piano is properly separated
+   - If piano is barely audible, source separation failed
+2. Try different Demucs models:
+   - Current: htdemucs with --two-stems=other
+   - Try: htdemucs_6s (6-stem with dedicated piano separation)
+   - Command: demucs --model htdemucs_6s audio.wav
+3. Test with simpler music:
+   - Solo piano (no other instruments)
+   - Clear, slow melodies
+   - This helps isolate if issue is separation or transcription
+4. Compare with ground truth:
+   - Find sheet music for the test song
+   - Compare transcribed notes with actual notes
+   - Identify patterns (missing high notes? wrong octaves?)
+5. Try alternative transcription models:
+   - MT3 (Music Transformer) - slower but more accurate
+   - Omnizart piano model - specialized for piano
+""")
+    print("=" * 60)
+    print("\nTo listen to the separated 'other' stem:")
+    print(f"  play {other_stem}")
+    print(f"  # or")
+    print(f"  ffplay {other_stem}")
+    print("=" * 60)
+if __name__ == "__main__":
+    if len(sys.argv) < 2:
+        print("Usage: python scripts/diagnose_pipeline.py <job_id>")
+        print("\nExample:")
+        print("  python scripts/diagnose_pipeline.py test_e2e")
+        print("\nFirst run test_e2e.py to create a job:")
+        print('  python scripts/test_e2e.py "https://www.youtube.com/watch?v=VIDEO_ID"')
+        sys.exit(1)
+    job_id = sys.argv[1]
+    diagnose_job(job_id)

backend/scripts/test_accuracy.py ADDED Viewed

	@@ -0,0 +1,277 @@

+#!/usr/bin/env python3
+"""
+Accuracy Testing Suite for Rescored Pipeline
+Tests transcription accuracy on 10 diverse piano videos covering different styles and complexities.
+"""
+import sys
+from pathlib import Path
+sys.path.insert(0, str(Path(__file__).parent.parent))
+from pipeline import TranscriptionPipeline
+from config import settings
+import json
+from datetime import datetime
+# Test videos with varying complexity
+TEST_VIDEOS = [
+    {
+        "id": "simple_melody",
+        "url": "https://www.youtube.com/watch?v=TK1Ij_-mank",
+        "description": "Simple piano melody - C major scale practice",
+        "difficulty": "easy",
+        "expected_accuracy": ">80%",
+        "notes": "Slow tempo, single notes, clear recording"
+    },
+    {
+        "id": "twinkle_twinkle",
+        "url": "https://www.youtube.com/watch?v=YCZ_d_4ZEqk",
+        "description": "Twinkle Twinkle Little Star - Beginner piano",
+        "difficulty": "easy",
+        "expected_accuracy": ">75%",
+        "notes": "Very simple melody, slow tempo"
+    },
+    {
+        "id": "fur_elise",
+        "url": "https://www.youtube.com/watch?v=_mVW8tgGY_w",
+        "description": "Beethoven - Für Elise (simplified)",
+        "difficulty": "medium",
+        "expected_accuracy": "60-70%",
+        "notes": "Classic piece, moderate tempo, some ornaments"
+    },
+    {
+        "id": "chopin_nocturne",
+        "url": "https://www.youtube.com/watch?v=9E6b3swbnWg",
+        "description": "Chopin - Nocturne Op. 9 No. 2",
+        "difficulty": "hard",
+        "expected_accuracy": "50-60%",
+        "notes": "Complex harmonies, expressive dynamics, rubato"
+    },
+    {
+        "id": "canon_in_d",
+        "url": "https://www.youtube.com/watch?v=NlprozGcs80",
+        "description": "Pachelbel - Canon in D (piano arrangement)",
+        "difficulty": "medium",
+        "expected_accuracy": "60-70%",
+        "notes": "Repetitive patterns, moderate polyphony"
+    },
+    {
+        "id": "river_flows",
+        "url": "https://www.youtube.com/watch?v=7maJOI3QMu0",
+        "description": "Yiruma - River Flows in You",
+        "difficulty": "medium",
+        "expected_accuracy": "60-70%",
+        "notes": "Modern piano, flowing arpeggios"
+    },
+    {
+        "id": "moonlight_sonata",
+        "url": "https://www.youtube.com/watch?v=4Tr0otuiQuU",
+        "description": "Beethoven - Moonlight Sonata (1st movement)",
+        "difficulty": "medium",
+        "expected_accuracy": "60-70%",
+        "notes": "Slow tempo, triplet arpeggios, bass notes"
+    },
+    {
+        "id": "jazz_blues",
+        "url": "https://www.youtube.com/watch?v=F3W_alUuFkA",
+        "description": "Simple jazz blues piano",
+        "difficulty": "medium",
+        "expected_accuracy": "55-65%",
+        "notes": "Swing rhythm, blue notes, syncopation"
+    },
+    {
+        "id": "claire_de_lune",
+        "url": "https://www.youtube.com/watch?v=WNcsUNKlAKw",
+        "description": "Debussy - Clair de Lune",
+        "difficulty": "hard",
+        "expected_accuracy": "50-60%",
+        "notes": "Impressionist harmony, complex textures"
+    },
+    {
+        "id": "la_campanella",
+        "url": "https://www.youtube.com/watch?v=MD6xMyuZls0",
+        "description": "Liszt - La Campanella",
+        "difficulty": "very_hard",
+        "expected_accuracy": "40-50%",
+        "notes": "Virtuosic, extremely fast, wide range, many notes"
+    }
+]
+def run_accuracy_test(video, verbose=True):
+    """
+    Run transcription pipeline on a test video and collect metrics.
+    Args:
+        video: Dictionary with video metadata
+        verbose: Print progress messages
+    Returns:
+        Dictionary with test results and metrics
+    """
+    if verbose:
+        print(f"\n{'='*70}")
+        print(f"Testing: {video['description']}")
+        print(f"Difficulty: {video['difficulty']} | Expected: {video['expected_accuracy']}")
+        print(f"{'='*70}")
+    job_id = f"accuracy_test_{video['id']}"
+    storage_path = Path(settings.storage_path)
+    # Progress callback
+    def progress_callback(percent, stage, message):
+        if verbose:
+            print(f"[{percent:3d}%] {stage:12s} | {message}")
+    result = {
+        "video_id": video["id"],
+        "description": video["description"],
+        "difficulty": video["difficulty"],
+        "url": video["url"],
+        "timestamp": datetime.utcnow().isoformat(),
+        "success": False,
+        "error": None,
+        "metrics": {}
+    }
+    try:
+        # Run pipeline
+        pipeline = TranscriptionPipeline(job_id, video["url"], storage_path)
+        pipeline.set_progress_callback(progress_callback)
+        musicxml_path = pipeline.run()
+        # Get intermediate file paths for analysis
+        temp_dir = pipeline.temp_dir
+        original_audio = temp_dir / "audio.wav"
+        other_stem = temp_dir / "htdemucs" / job_id / "other.wav"
+        midi_path = temp_dir / "other_basic_pitch.mid"
+        clean_midi = temp_dir / "piano_clean.mid"
+        # Collect metrics
+        import soundfile as sf
+        import mido
+        # Audio metrics
+        if original_audio.exists():
+            audio_data, sr = sf.read(original_audio)
+            result["metrics"]["audio_duration_seconds"] = len(audio_data) / sr
+        # Separation quality (simple energy ratio)
+        if original_audio.exists() and other_stem.exists():
+            import numpy as np
+            original_data, _ = sf.read(original_audio)
+            other_data, _ = sf.read(other_stem)
+            original_energy = np.sum(original_data ** 2)
+            other_energy = np.sum(other_data ** 2)
+            result["metrics"]["separation"] = {
+                "other_energy_ratio": other_energy / original_energy if original_energy > 0 else 0
+            }
+        # MIDI analysis (simple note count)
+        if clean_midi.exists():
+            mid = mido.MidiFile(clean_midi)
+            note_count = sum(1 for track in mid.tracks for msg in track if msg.type == 'note_on')
+            result["metrics"]["midi"] = {
+                "total_notes": note_count,
+                "duration_seconds": mid.length
+            }
+        # MusicXML analysis (measure count, etc)
+        if musicxml_path.exists():
+            from music21 import converter
+            score = converter.parse(musicxml_path)
+            measures = score.parts[0].getElementsByClass('Measure') if score.parts else []
+            result["metrics"]["musicxml"] = {
+                "total_measures": len(measures),
+                "file_size_kb": musicxml_path.stat().st_size / 1024
+            }
+        result["success"] = True
+        result["output_files"] = {
+            "musicxml": str(musicxml_path),
+            "midi": str(clean_midi),
+            "temp_dir": str(temp_dir)
+        }
+        if verbose:
+            print(f"\n✅ SUCCESS - Output: {musicxml_path}")
+            print(f"   MIDI notes: {result['metrics']['midi']['total_notes']}")
+            print(f"   Measures: {result['metrics']['musicxml']['total_measures']}")
+            if 'separation' in result['metrics']:
+                sep = result['metrics']['separation']
+                print(f"   Separation: {sep['other_energy_ratio']:.1%} energy in 'other' stem")
+    except Exception as e:
+        result["error"] = str(e)
+        if verbose:
+            print(f"\n❌ FAILED - Error: {e}")
+    return result
+def main():
+    """Run accuracy tests on all test videos."""
+    print("="*70)
+    print("Rescored Accuracy Testing Suite")
+    print("="*70)
+    print(f"Testing {len(TEST_VIDEOS)} videos with varying difficulty")
+    print(f"Storage: {settings.storage_path}")
+    print()
+    # Run tests
+    results = []
+    for i, video in enumerate(TEST_VIDEOS, 1):
+        print(f"\n[{i}/{len(TEST_VIDEOS)}] Starting test: {video['id']}")
+        result = run_accuracy_test(video, verbose=True)
+        results.append(result)
+    # Summary
+    print("\n" + "="*70)
+    print("ACCURACY TEST SUMMARY")
+    print("="*70)
+    successful = [r for r in results if r["success"]]
+    failed = [r for r in results if not r["success"]]
+    print(f"\nTotal: {len(results)} | Success: {len(successful)} | Failed: {len(failed)}")
+    print(f"Success Rate: {len(successful)/len(results)*100:.1f}%")
+    if successful:
+        print("\n✅ Successful Transcriptions:")
+        for r in successful:
+            midi_notes = r["metrics"]["midi"]["total_notes"]
+            measures = r["metrics"]["musicxml"]["total_measures"]
+            print(f"  - {r['video_id']:20s} | {midi_notes:4d} notes | {measures:3d} measures | {r['difficulty']}")
+    if failed:
+        print("\n❌ Failed Transcriptions:")
+        for r in failed:
+            print(f"  - {r['video_id']:20s} | Error: {r['error'][:60]}")
+    # Save results to JSON
+    output_path = Path(settings.storage_path) / "accuracy_test_results.json"
+    output_path.parent.mkdir(parents=True, exist_ok=True)
+    with open(output_path, 'w') as f:
+        json.dump({
+            "test_date": datetime.utcnow().isoformat(),
+            "total_tests": len(results),
+            "successful": len(successful),
+            "failed": len(failed),
+            "success_rate": len(successful) / len(results),
+            "results": results
+        }, f, indent=2)
+    print(f"\n📊 Full results saved to: {output_path}")
+    return 0 if not failed else 1
+if __name__ == "__main__":
+    sys.exit(main())

backend/scripts/test_demucs_models.py ADDED Viewed

	@@ -0,0 +1,199 @@

+#!/usr/bin/env python3
+"""
+Test different Demucs models to find the best source separation.
+Usage (from backend directory):
+    python scripts/test_demucs_models.py <audio_path>
+Example:
+    python scripts/test_demucs_models.py /tmp/rescored/temp/test_e2e/audio.wav
+"""
+import sys
+from pathlib import Path
+import subprocess
+import soundfile as sf
+import numpy as np
+import tempfile
+import shutil
+def test_demucs_model(audio_path: Path, model_name: str, stems: str = None):
+    """Test a specific Demucs model."""
+    print(f"\n{'='*60}")
+    print(f"Testing: {model_name}")
+    print(f"{'='*60}")
+    # Create temp directory for this test
+    with tempfile.TemporaryDirectory() as temp_dir:
+        temp_path = Path(temp_dir)
+        # Build command
+        cmd = ["demucs", "--model", model_name, "-o", str(temp_path), str(audio_path)]
+        if stems:
+            cmd.extend(["--two-stems", stems])
+        print(f"Command: {' '.join(cmd)}")
+        print("Running... (this may take a minute)")
+        # Run Demucs
+        try:
+            result = subprocess.run(cmd, capture_output=True, text=True, timeout=300)
+            if result.returncode != 0:
+                print(f"❌ Failed: {result.stderr[:500]}")
+                return None
+            # Find output directory
+            model_output_dir = temp_path / model_name / audio_path.stem
+            if not model_output_dir.exists():
+                print(f"❌ Output directory not found: {model_output_dir}")
+                return None
+            # Analyze stems
+            print("\nStem Analysis:")
+            original_data, sr = sf.read(audio_path)
+            original_energy = np.sum(original_data**2)
+            stem_energies = {}
+            for stem_file in sorted(model_output_dir.glob("*.wav")):
+                stem_name = stem_file.stem
+                stem_data, _ = sf.read(stem_file)
+                stem_energy = np.sum(stem_data**2)
+                stem_rms = np.sqrt(np.mean(stem_data**2))
+                percentage = (stem_energy / original_energy) * 100
+                stem_energies[stem_name] = (stem_energy, stem_rms, percentage)
+                print(f"  {stem_name:15s}: {percentage:5.1f}% energy, RMS: {stem_rms:.3f}")
+            # Find best stem for piano/melodic content
+            # Usually 'other', 'piano', or 'other' in 2-stem
+            print("\nBest stem for piano:")
+            if 'piano' in stem_energies:
+                best_stem = 'piano'
+                print(f"  ✓ Dedicated 'piano' stem found")
+            elif 'other' in stem_energies:
+                best_stem = 'other'
+                print(f"  ✓ Using 'other' stem")
+            else:
+                # Find stem with most energy
+                best_stem = max(stem_energies.items(), key=lambda x: x[1][0])[0]
+                print(f"  → Using '{best_stem}' (highest energy)")
+            energy, rms, percentage = stem_energies[best_stem]
+            print(f"  Energy: {percentage:.1f}%, RMS: {rms:.3f}")
+            if percentage < 15:
+                print(f"  ⚠️  Very low energy - may not work well")
+            elif percentage < 25:
+                print(f"  ⚠️  Low energy - borderline")
+            else:
+                print(f"  ✓ Good energy level")
+            return {
+                'model': model_name,
+                'best_stem': best_stem,
+                'energy_percentage': percentage,
+                'rms': rms,
+                'all_stems': stem_energies
+            }
+        except subprocess.TimeoutExpired:
+            print(f"❌ Timeout after 5 minutes")
+            return None
+        except Exception as e:
+            print(f"❌ Error: {e}")
+            return None
+def main():
+    if len(sys.argv) < 2:
+        print("Usage: python scripts/test_demucs_models.py <audio_path>")
+        print("\nExample:")
+        print("  python scripts/test_demucs_models.py /tmp/rescored/temp/test_e2e/audio.wav")
+        sys.exit(1)
+    audio_path = Path(sys.argv[1])
+    if not audio_path.exists():
+        print(f"Error: Audio file not found: {audio_path}")
+        sys.exit(1)
+    print("=" * 60)
+    print("DEMUCS MODEL COMPARISON")
+    print("=" * 60)
+    print(f"Audio file: {audio_path}")
+    print(f"Duration: ~{sf.info(audio_path).duration:.1f}s")
+    # Test different models
+    results = []
+    # Test 1: Current model (htdemucs 2-stem)
+    print("\n\n" + "="*60)
+    print("TEST 1: htdemucs (2-stem: other)")
+    print("="*60)
+    result = test_demucs_model(audio_path, "htdemucs", stems="other")
+    if result:
+        results.append(result)
+    # Test 2: htdemucs_6s (6-stem with dedicated piano)
+    print("\n\n" + "="*60)
+    print("TEST 2: htdemucs_6s (6-stem with piano)")
+    print("="*60)
+    result = test_demucs_model(audio_path, "htdemucs_6s")
+    if result:
+        results.append(result)
+    # Test 3: htdemucs full 4-stem
+    print("\n\n" + "="*60)
+    print("TEST 3: htdemucs (4-stem)")
+    print("="*60)
+    result = test_demucs_model(audio_path, "htdemucs")
+    if result:
+        results.append(result)
+    # Summary
+    print("\n\n" + "="*60)
+    print("SUMMARY & RECOMMENDATIONS")
+    print("="*60)
+    if not results:
+        print("No successful tests!")
+        sys.exit(1)
+    # Sort by energy percentage
+    results.sort(key=lambda x: x['energy_percentage'], reverse=True)
+    print("\nRanking (by piano/melodic energy):")
+    for i, result in enumerate(results, 1):
+        print(f"{i}. {result['model']:20s} - {result['best_stem']:10s} - "
+              f"{result['energy_percentage']:5.1f}% energy, RMS: {result['rms']:.3f}")
+    best_result = results[0]
+    print(f"\n✓ RECOMMENDED: Use {best_result['model']} with '{best_result['best_stem']}' stem")
+    if best_result['energy_percentage'] < 20:
+        print("\n⚠️  WARNING: Even the best model has low energy (<20%)")
+        print("   This suggests:")
+        print("   - The audio may not have much piano/melodic content")
+        print("   - The piano may be heavily mixed with other instruments")
+        print("   - You may need to try a different test video")
+    print("\nTo update pipeline.py:")
+    if best_result['model'] == 'htdemucs_6s':
+        print(f"  1. Change line ~98: --two-stems=other → remove this flag")
+        print(f"  2. Change line ~96: demucs_output / 'htdemucs_6s' / audio_path.stem")
+        print(f"  3. Use stem: {best_result['best_stem']}.wav")
+    elif best_result['model'] == 'htdemucs' and '--two-stems' not in str(best_result):
+        print(f"  1. Change line ~98: --two-stems=other → remove this flag")
+        print(f"  2. Use stem: {best_result['best_stem']}.wav")
+    print("\n" + "="*60)
+if __name__ == "__main__":
+    main()

backend/scripts/test_e2e.py ADDED Viewed

	@@ -0,0 +1,106 @@

+#!/usr/bin/env python3
+"""
+End-to-end test script for the transcription pipeline.
+Usage (from backend directory):
+    python scripts/test_e2e.py <youtube_url>
+Example:
+    python scripts/test_e2e.py "https://www.youtube.com/watch?v=dQw4w9WgXcQ"
+"""
+import sys
+from pathlib import Path
+# Add parent directory to path for imports
+sys.path.insert(0, str(Path(__file__).parent.parent))
+from pipeline import TranscriptionPipeline
+from config import settings
+import time
+def progress_callback(percent: int, stage: str, message: str):
+    """Print progress updates."""
+    print(f"[{percent:3d}%] {stage:12s} | {message}")
+def main():
+    if len(sys.argv) < 2:
+        print("Usage: python test_e2e.py <youtube_url>")
+        print("\nExample simple piano videos to test:")
+        print("1. Twinkle Twinkle: https://www.youtube.com/watch?v=WyTb3DTu88c")
+        print("2. Simple melody: https://www.youtube.com/watch?v=fJ9rUzIMcZQ")
+        sys.exit(1)
+    youtube_url = sys.argv[1]
+    job_id = "test_e2e"
+    storage_path = Path(settings.storage_path)
+    print("=" * 60)
+    print("Rescored End-to-End Pipeline Test")
+    print("=" * 60)
+    print(f"YouTube URL: {youtube_url}")
+    print(f"Job ID: {job_id}")
+    print(f"Storage: {storage_path}")
+    print("=" * 60)
+    print()
+    # Create pipeline
+    pipeline = TranscriptionPipeline(job_id, youtube_url, storage_path)
+    pipeline.set_progress_callback(progress_callback)
+    # Run pipeline
+    try:
+        start_time = time.time()
+        musicxml_path = pipeline.run()
+        elapsed_time = time.time() - start_time
+        print()
+        print("=" * 60)
+        print("SUCCESS!")
+        print("=" * 60)
+        print(f"Total time: {elapsed_time:.1f} seconds")
+        print(f"MusicXML file: {musicxml_path}")
+        print(f"File size: {musicxml_path.stat().st_size / 1024:.1f} KB")
+        print()
+        # Show temp directory contents
+        print("Intermediate files:")
+        temp_dir = storage_path / "temp" / job_id
+        for file in sorted(temp_dir.rglob("*")):
+            if file.is_file():
+                size_kb = file.stat().st_size / 1024
+                rel_path = file.relative_to(temp_dir)
+                print(f"  {rel_path} ({size_kb:.1f} KB)")
+        print()
+        # Preview MusicXML
+        print("MusicXML preview (first 50 lines):")
+        print("-" * 60)
+        with open(musicxml_path, 'r') as f:
+            for i, line in enumerate(f):
+                if i >= 50:
+                    print("... (truncated)")
+                    break
+                print(line.rstrip())
+        print("-" * 60)
+        print()
+        print("Next steps:")
+        print(f"1. Open in MuseScore: musescore {musicxml_path}")
+        print(f"2. Inspect MIDI: timidity {temp_dir}/piano_clean.mid")
+        print(f"3. Review temp files: ls -lh {temp_dir}")
+    except Exception as e:
+        print()
+        print("=" * 60)
+        print("FAILED!")
+        print("=" * 60)
+        print(f"Error: {e}")
+        import traceback
+        traceback.print_exc()
+        sys.exit(1)
+if __name__ == "__main__":
+    main()

backend/scripts/test_quick_verify.py ADDED Viewed

	@@ -0,0 +1,142 @@

+#!/usr/bin/env python3
+"""
+Quick verification test - only runs the 6 videos that had code bugs (now fixed).
+This is faster than the full suite and verifies our bug fixes work.
+"""
+import sys
+from pathlib import Path
+sys.path.insert(0, str(Path(__file__).parent.parent))
+from test_accuracy import run_accuracy_test
+import json
+from datetime import datetime
+# Only test the 6 videos that had code bugs (should all pass now)
+QUICK_TEST_VIDEOS = [
+    {
+        "id": "chopin_nocturne",
+        "url": "https://www.youtube.com/watch?v=9E6b3swbnWg",
+        "description": "Chopin - Nocturne Op. 9 No. 2",
+        "difficulty": "hard",
+        "expected_accuracy": "50-60%",
+        "notes": "2048th note duration (Bug #2b)",
+        "bug": "2048th note duration (Bug #2b)"
+    },
+    {
+        "id": "canon_in_d",
+        "url": "https://www.youtube.com/watch?v=NlprozGcs80",
+        "description": "Pachelbel - Canon in D",
+        "difficulty": "medium",
+        "expected_accuracy": "60-70%",
+        "notes": "NoneType velocity (Bug #2a)",
+        "bug": "NoneType velocity (Bug #2a)"
+    },
+    {
+        "id": "river_flows",
+        "url": "https://www.youtube.com/watch?v=7maJOI3QMu0",
+        "description": "Yiruma - River Flows in You",
+        "difficulty": "medium",
+        "expected_accuracy": "60-70%",
+        "notes": "NoneType velocity (Bug #2a)",
+        "bug": "NoneType velocity (Bug #2a)"
+    },
+    {
+        "id": "moonlight_sonata",
+        "url": "https://www.youtube.com/watch?v=4Tr0otuiQuU",
+        "description": "Beethoven - Moonlight Sonata",
+        "difficulty": "medium",
+        "expected_accuracy": "60-70%",
+        "notes": "NoneType velocity (Bug #2a)",
+        "bug": "NoneType velocity (Bug #2a)"
+    },
+    {
+        "id": "claire_de_lune",
+        "url": "https://www.youtube.com/watch?v=WNcsUNKlAKw",
+        "description": "Debussy - Clair de Lune",
+        "difficulty": "hard",
+        "expected_accuracy": "50-60%",
+        "notes": "2048th note duration (Bug #2b)",
+        "bug": "2048th note duration (Bug #2b)"
+    },
+    {
+        "id": "la_campanella",
+        "url": "https://www.youtube.com/watch?v=MD6xMyuZls0",
+        "description": "Liszt - La Campanella",
+        "difficulty": "very_hard",
+        "expected_accuracy": "40-50%",
+        "notes": "NoneType velocity (Bug #2a)",
+        "bug": "NoneType velocity (Bug #2a)"
+    }
+]
+def main():
+    """Run quick verification tests."""
+    print("="*70)
+    print("Quick Verification Test - Bug Fixes")
+    print("="*70)
+    print(f"Testing {len(QUICK_TEST_VIDEOS)} videos that previously failed")
+    print("All should now succeed (verifies bug fixes)")
+    print()
+    results = []
+    for i, video in enumerate(QUICK_TEST_VIDEOS, 1):
+        print(f"\n[{i}/{len(QUICK_TEST_VIDEOS)}] Testing: {video['id']}")
+        print(f"Previous error: {video['bug']}")
+        result = run_accuracy_test(video, verbose=True)
+        results.append(result)
+    # Summary
+    print("\n" + "="*70)
+    print("QUICK VERIFICATION SUMMARY")
+    print("="*70)
+    successful = [r for r in results if r["success"]]
+    failed = [r for r in results if not r["success"]]
+    print(f"\nTotal: {len(results)} | Success: {len(successful)} | Failed: {len(failed)}")
+    print(f"Success Rate: {len(successful)/len(results)*100:.1f}%")
+    if successful:
+        print("\n✅ Bug Fixes Verified - Successful Transcriptions:")
+        for r in successful:
+            if "midi" in r["metrics"] and "musicxml" in r["metrics"]:
+                notes = r["metrics"]["midi"]["total_notes"]
+                measures = r["metrics"]["musicxml"]["total_measures"]
+                print(f"  - {r['video_id']:20s} | {notes:4d} notes | {measures:3d} measures")
+    if failed:
+        print("\n❌ Still Failing:")
+        for r in failed:
+            error_preview = r["error"][:80] if r["error"] else "Unknown"
+            print(f"  - {r['video_id']:20s} | {error_preview}")
+    # Save results
+    from config import settings
+    output_path = Path(settings.storage_path) / "quick_verify_results.json"
+    output_path.parent.mkdir(parents=True, exist_ok=True)
+    with open(output_path, 'w') as f:
+        json.dump({
+            "test_date": datetime.utcnow().isoformat(),
+            "test_type": "bug_fix_verification",
+            "total_tests": len(results),
+            "successful": len(successful),
+            "failed": len(failed),
+            "success_rate": len(successful) / len(results),
+            "results": results
+        }, f, indent=2)
+    print(f"\n📊 Results saved to: {output_path}")
+    if len(successful) == len(results):
+        print("\n🎉 ALL BUG FIXES VERIFIED! Ready for full test suite.")
+        return 0
+    else:
+        print(f"\n⚠️  {len(failed)} test(s) still failing - investigate before full suite")
+        return 1
+if __name__ == "__main__":
+    sys.exit(main())

backend/tasks.py ADDED Viewed

	@@ -0,0 +1,205 @@

+"""Celery tasks for background job processing."""
+from celery import Task
+from celery_app import celery_app
+from pipeline import TranscriptionPipeline, run_transcription_pipeline
+import redis
+import json
+from datetime import datetime
+from pathlib import Path
+from config import settings
+import shutil
+# Redis client
+redis_client = redis.Redis.from_url(settings.redis_url, decode_responses=True)
+class TranscriptionTask(Task):
+    """Base task with progress tracking."""
+    def update_progress(self, job_id: str, progress: int, stage: str, message: str):
+        """
+        Update job progress in Redis and publish to WebSocket subscribers.
+        Args:
+            job_id: Job identifier
+            progress: Progress percentage (0-100)
+            stage: Current stage name
+            message: Status message
+        """
+        job_key = f"job:{job_id}"
+        # Update Redis hash
+        redis_client.hset(job_key, mapping={
+            "progress": progress,
+            "current_stage": stage,
+            "status_message": message,
+            "updated_at": datetime.utcnow().isoformat(),
+        })
+        # Publish to pub/sub for WebSocket clients
+        update = {
+            "type": "progress",
+            "job_id": job_id,
+            "progress": progress,
+            "stage": stage,
+            "message": message,
+            "timestamp": datetime.utcnow().isoformat(),
+        }
+        redis_client.publish(f"job:{job_id}:updates", json.dumps(update))
+@celery_app.task(base=TranscriptionTask, bind=True)
+def process_transcription_task(self, job_id: str):
+    """
+    Main transcription task.
+    Args:
+        job_id: Unique job identifier
+    Returns:
+        Path to generated MusicXML file
+    """
+    try:
+        # Mark job as started
+        redis_client.hset(f"job:{job_id}", mapping={
+            "status": "processing",
+            "started_at": datetime.utcnow().isoformat(),
+        })
+        # Get job data
+        job_data = redis_client.hgetall(f"job:{job_id}")
+        if not job_data:
+            raise ValueError(f"Job not found: {job_id}")
+        youtube_url = job_data.get('youtube_url')
+        if not youtube_url:
+            raise ValueError(f"Job missing youtube_url: {job_id}")
+        # Initialize pipeline
+        pipeline = TranscriptionPipeline(
+            job_id=job_id,
+            youtube_url=youtube_url,
+            storage_path=settings.storage_path
+        )
+        pipeline.set_progress_callback(lambda p, s, m: self.update_progress(job_id, p, s, m))
+        # Run pipeline
+        temp_output_path = pipeline.run()
+        # Output is already in the temp directory, move to persistent storage
+        output_path = settings.outputs_path / f"{job_id}.musicxml"
+        midi_path = settings.outputs_path / f"{job_id}.mid"
+        # Ensure outputs directory exists
+        settings.outputs_path.mkdir(parents=True, exist_ok=True)
+        # Copy the MusicXML file to outputs
+        shutil.copy(str(temp_output_path), str(output_path))
+        # Copy the cleaned MIDI file to outputs
+        temp_midi_path = pipeline.temp_dir / "piano_clean.mid"
+        if temp_midi_path.exists():
+            shutil.copy(str(temp_midi_path), str(midi_path))
+        # Cleanup temp files (pipeline has its own cleanup method)
+        pipeline.cleanup()
+        # Mark job as completed
+        redis_client.hset(f"job:{job_id}", mapping={
+            "status": "completed",
+            "progress": 100,
+            "output_path": str(output_path),
+            "midi_path": str(midi_path) if temp_midi_path.exists() else "",
+            "completed_at": datetime.utcnow().isoformat(),
+        })
+        # Publish completion message
+        completion_msg = {
+            "type": "completed",
+            "job_id": job_id,
+            "result_url": f"/api/v1/scores/{job_id}",
+            "timestamp": datetime.utcnow().isoformat(),
+        }
+        redis_client.publish(f"job:{job_id}:updates", json.dumps(completion_msg))
+        return str(output_path)
+    except Exception as e:
+        # Mark job as failed
+        redis_client.hset(f"job:{job_id}", mapping={
+            "status": "failed",
+            "error": json.dumps({
+                "message": str(e),
+                "retryable": self.request.retries < self.max_retries,
+            }),
+            "failed_at": datetime.utcnow().isoformat(),
+        })
+        # Publish error message
+        error_msg = {
+            "type": "error",
+            "job_id": job_id,
+            "error": {
+                "message": str(e),
+                "retryable": self.request.retries < self.max_retries,
+            },
+            "timestamp": datetime.utcnow().isoformat(),
+        }
+        redis_client.publish(f"job:{job_id}:updates", json.dumps(error_msg))
+        # Retry if retryable
+        if self.request.retries < self.max_retries:
+            raise self.retry(exc=e, countdown=2 ** self.request.retries)
+        else:
+            raise
+# === Module-level helper functions for backward compatibility ===
+def update_progress(job_id: str, progress: int, stage: str, message: str):
+    """
+    Update job progress in Redis and publish to WebSocket subscribers.
+    Args:
+        job_id: Job identifier
+        progress: Progress percentage (0-100)
+        stage: Current stage name
+        message: Status message
+    """
+    job_key = f"job:{job_id}"
+    # Update Redis hash
+    redis_client.hset(job_key, mapping={
+        "progress": progress,
+        "current_stage": stage,
+        "status_message": message,
+        "updated_at": datetime.utcnow().isoformat(),
+    })
+    # Publish to pub/sub for WebSocket clients
+    update = {
+        "type": "progress",
+        "job_id": job_id,
+        "progress": progress,
+        "stage": stage,
+        "message": message,
+        "timestamp": datetime.utcnow().isoformat(),
+    }
+    redis_client.publish(f"job:{job_id}:updates", json.dumps(update))
+def cleanup_temp_files(job_id: str, storage_path: Path = None):
+    """
+    Clean up temporary files for a job.
+    Args:
+        job_id: Job identifier
+        storage_path: Path to storage directory (uses settings if not provided)
+    """
+    if storage_path is None:
+        storage_path = settings.storage_path
+    temp_dir = storage_path / "temp" / job_id
+    if temp_dir.exists():
+        shutil.rmtree(temp_dir, ignore_errors=True)

backend/tests/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ """Test suite for Rescored backend."""

backend/tests/conftest.py ADDED Viewed

	@@ -0,0 +1,169 @@

+"""Pytest configuration and fixtures for backend tests."""
+import pytest
+from pathlib import Path
+import tempfile
+import shutil
+from fastapi.testclient import TestClient
+from redis import Redis
+from unittest.mock import MagicMock, patch
+import uuid
+@pytest.fixture
+def temp_storage_dir():
+    """Create temporary storage directory for tests."""
+    temp_dir = tempfile.mkdtemp()
+    yield Path(temp_dir)
+    shutil.rmtree(temp_dir, ignore_errors=True)
+@pytest.fixture
+def mock_redis():
+    """Mock Redis client for testing."""
+    redis_mock = MagicMock(spec=Redis)
+    redis_mock.ping.return_value = True
+    redis_mock.hgetall.return_value = {}
+    redis_mock.hset.return_value = True
+    redis_mock.pubsub.return_value.subscribe.return_value = None
+    return redis_mock
+@pytest.fixture
+def test_client(mock_redis, temp_storage_dir):
+    """Create FastAPI test client with mocked dependencies."""
+    with patch('main.redis_client', mock_redis):
+        with patch('config.settings.storage_path', temp_storage_dir):
+            from main import app
+            client = TestClient(app)
+            yield client
+@pytest.fixture
+def sample_job_id():
+    """Generate a sample job ID for testing."""
+    return str(uuid.uuid4())
+@pytest.fixture
+def sample_job_data(sample_job_id):
+    """Sample job data for testing."""
+    return {
+        "job_id": sample_job_id,
+        "status": "queued",
+        "youtube_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
+        "video_id": "dQw4w9WgXcQ",
+        "options": '{"instruments": ["piano"]}',
+        "created_at": "2025-01-01T00:00:00",
+        "progress": 0,
+        "current_stage": "queued",
+        "status_message": "Job queued for processing",
+    }
+@pytest.fixture
+def sample_youtube_urls():
+    """Collection of sample YouTube URLs for testing."""
+    return {
+        "valid": [
+            "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
+            "https://youtu.be/dQw4w9WgXcQ",
+            "https://m.youtube.com/watch?v=dQw4w9WgXcQ",
+            "https://www.youtube.com/embed/dQw4w9WgXcQ",
+        ],
+        "invalid": [
+            "https://example.com/video",
+            "not-a-url",
+            "https://vimeo.com/12345",
+            "https://youtube.com/invalid",
+        ]
+    }
+@pytest.fixture
+def mock_yt_dlp_info():
+    """Mock yt-dlp video info."""
+    return {
+        'id': 'dQw4w9WgXcQ',
+        'title': 'Test Video',
+        'duration': 180,  # 3 minutes
+        'age_limit': 0,
+        'formats': [
+            {'format_id': '140', 'ext': 'wav', 'abr': 128}
+        ]
+    }
+@pytest.fixture
+def sample_audio_file(temp_storage_dir):
+    """Create a sample WAV file for testing."""
+    import numpy as np
+    import soundfile as sf
+    # Generate 1 second of silence at 44.1kHz
+    sample_rate = 44100
+    duration = 1.0
+    samples = np.zeros(int(sample_rate * duration), dtype=np.float32)
+    audio_path = temp_storage_dir / "test_audio.wav"
+    sf.write(str(audio_path), samples, sample_rate)
+    return audio_path
+@pytest.fixture
+def sample_midi_file(temp_storage_dir):
+    """Create a sample MIDI file for testing."""
+    import mido
+    mid = mido.MidiFile()
+    track = mido.MidiTrack()
+    mid.tracks.append(track)
+    # Add some notes (middle C for 1 beat)
+    track.append(mido.Message('note_on', note=60, velocity=64, time=0))
+    track.append(mido.Message('note_off', note=60, velocity=64, time=480))
+    midi_path = temp_storage_dir / "test_midi.mid"
+    mid.save(str(midi_path))
+    return midi_path
+@pytest.fixture
+def sample_musicxml_content():
+    """Sample MusicXML content for testing."""
+    return '''<?xml version="1.0" encoding="UTF-8"?>
+<!DOCTYPE score-partwise PUBLIC "-//Recordare//DTD MusicXML 3.1 Partwise//EN" "http://www.musicxml.org/dtds/partwise.dtd">
+<score-partwise version="3.1">
+  <part-list>
+    <score-part id="P1">
+      <part-name>Piano</part-name>
+    </score-part>
+  </part-list>
+  <part id="P1">
+    <measure number="1">
+      <attributes>
+        <divisions>1</divisions>
+        <key>
+          <fifths>0</fifths>
+        </key>
+        <time>
+          <beats>4</beats>
+          <beat-type>4</beat-type>
+        </time>
+        <clef>
+          <sign>G</sign>
+          <line>2</line>
+        </clef>
+      </attributes>
+      <note>
+        <pitch>
+          <step>C</step>
+          <octave>4</octave>
+        </pitch>
+        <duration>4</duration>
+        <type>whole</type>
+      </note>
+    </measure>
+  </part>
+</score-partwise>'''

backend/tests/test_api.py ADDED Viewed

	@@ -0,0 +1,369 @@

+"""Integration tests for FastAPI endpoints."""
+import pytest
+from unittest.mock import patch, MagicMock
+import json
+class TestRootEndpoint:
+    """Test root endpoint."""
+    def test_root(self, test_client):
+        """Test root endpoint returns API info."""
+        response = test_client.get("/")
+        assert response.status_code == 200
+        data = response.json()
+        assert data["name"] == "Rescored API"
+        assert data["version"] == "1.0.0"
+        assert data["docs"] == "/docs"
+class TestHealthCheck:
+    """Test health check endpoint."""
+    def test_health_check_healthy(self, test_client, mock_redis):
+        """Test health check when all services are healthy."""
+        mock_redis.ping.return_value = True
+        response = test_client.get("/health")
+        assert response.status_code == 200
+        data = response.json()
+        assert data["status"] == "healthy"
+        assert data["redis"] == "healthy"
+    def test_health_check_redis_down(self, test_client, mock_redis):
+        """Test health check when Redis is down."""
+        mock_redis.ping.side_effect = Exception("Connection failed")
+        response = test_client.get("/health")
+        assert response.status_code == 200
+        data = response.json()
+        assert data["status"] == "degraded"
+        assert data["redis"] == "unhealthy"
+class TestTranscribeEndpoint:
+    """Test transcription submission endpoint."""
+    @patch('main.process_transcription_task')
+    @patch('utils.check_video_availability')
+    @patch('utils.validate_youtube_url')
+    def test_submit_valid_transcription(
+        self,
+        mock_validate,
+        mock_check_availability,
+        mock_task,
+        test_client,
+        mock_redis
+    ):
+        """Test submitting valid transcription request."""
+        mock_validate.return_value = (True, "dQw4w9WgXcQ")
+        mock_check_availability.return_value = {
+            'available': True,
+            'info': {'duration': 180}
+        }
+        mock_task.delay.return_value = MagicMock(id="task-id")
+        response = test_client.post(
+            "/api/v1/transcribe",
+            json={"youtube_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ"}
+        )
+        assert response.status_code == 201
+        data = response.json()
+        assert "job_id" in data
+        assert data["status"] == "queued"
+        assert "websocket_url" in data
+        assert data["estimated_duration_seconds"] == 120
+        # Verify Redis was called to store job
+        assert mock_redis.hset.called
+        # Verify Celery task was queued
+        assert mock_task.delay.called
+    @patch('utils.validate_youtube_url')
+    def test_submit_invalid_url(self, mock_validate, test_client):
+        """Test submitting invalid YouTube URL."""
+        mock_validate.return_value = (False, "Invalid YouTube URL format")
+        response = test_client.post(
+            "/api/v1/transcribe",
+            json={"youtube_url": "https://invalid.com/video"}
+        )
+        assert response.status_code == 400
+        assert "Invalid YouTube URL format" in response.json()["detail"]
+    @patch('main.validate_youtube_url')
+    @patch('main.check_video_availability')
+    def test_submit_unavailable_video(
+        self,
+        mock_check_availability,
+        mock_validate,
+        test_client
+    ):
+        """Test submitting unavailable video."""
+        mock_validate.return_value = (True, "dQw4w9WgXcQ")
+        mock_check_availability.return_value = {
+            'available': False,
+            'reason': 'Video too long (max 15 minutes)'
+        }
+        response = test_client.post(
+            "/api/v1/transcribe",
+            json={"youtube_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ"}
+        )
+        assert response.status_code == 422
+        assert "too long" in response.json()["detail"]
+    @patch('utils.validate_youtube_url')
+    @patch('utils.check_video_availability')
+    def test_submit_with_options(
+        self,
+        mock_check_availability,
+        mock_validate,
+        test_client,
+        mock_redis
+    ):
+        """Test submitting transcription with custom options."""
+        mock_validate.return_value = (True, "dQw4w9WgXcQ")
+        mock_check_availability.return_value = {'available': True, 'info': {}}
+        with patch('main.process_transcription_task') as mock_task:
+            response = test_client.post(
+                "/api/v1/transcribe",
+                json={
+                    "youtube_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
+                    "options": {"instruments": ["piano", "guitar"]}
+                }
+            )
+            assert response.status_code == 201
+class TestRateLimiting:
+    """Test rate limiting middleware."""
+    @patch('utils.validate_youtube_url')
+    @patch('utils.check_video_availability')
+    @patch('main.process_transcription_task')
+    def test_rate_limit_enforced(
+        self,
+        mock_task,
+        mock_check_availability,
+        mock_validate,
+        test_client,
+        mock_redis
+    ):
+        """Test that rate limit is enforced after 10 requests."""
+        mock_validate.return_value = (True, "dQw4w9WgXcQ")
+        mock_check_availability.return_value = {'available': True, 'info': {}}
+        mock_task.delay.return_value = MagicMock(id="task-id")
+        # Mock Redis counter for rate limiting
+        mock_redis.get.return_value = "10"  # Already at limit
+        response = test_client.post(
+            "/api/v1/transcribe",
+            json={"youtube_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ"}
+        )
+        assert response.status_code == 429
+        assert "Rate limit exceeded" in response.json()["detail"]
+    @patch('utils.validate_youtube_url')
+    @patch('utils.check_video_availability')
+    @patch('main.process_transcription_task')
+    def test_rate_limit_under_limit(
+        self,
+        mock_task,
+        mock_check_availability,
+        mock_validate,
+        test_client,
+        mock_redis
+    ):
+        """Test that requests under limit succeed."""
+        mock_validate.return_value = (True, "dQw4w9WgXcQ")
+        mock_check_availability.return_value = {'available': True, 'info': {}}
+        mock_task.delay.return_value = MagicMock(id="task-id")
+        # Mock Redis counter for rate limiting (under limit)
+        mock_redis.get.return_value = "5"  # 5 out of 10
+        response = test_client.post(
+            "/api/v1/transcribe",
+            json={"youtube_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ"}
+        )
+        assert response.status_code == 201  # Request succeeds
+        assert mock_redis.pipeline.called  # Counter incremented
+class TestJobStatusEndpoint:
+    """Test job status endpoint."""
+    def test_get_existing_job_status(self, test_client, mock_redis, sample_job_data):
+        """Test getting status of existing job."""
+        mock_redis.hgetall.return_value = sample_job_data
+        response = test_client.get(f"/api/v1/jobs/{sample_job_data['job_id']}")
+        assert response.status_code == 200
+        data = response.json()
+        assert data["job_id"] == sample_job_data["job_id"]
+        assert data["status"] == "queued"
+        assert data["progress"] == 0
+        assert data["current_stage"] == "queued"
+    def test_get_nonexistent_job(self, test_client, mock_redis):
+        """Test getting status of nonexistent job."""
+        mock_redis.hgetall.return_value = {}
+        response = test_client.get("/api/v1/jobs/nonexistent-id")
+        assert response.status_code == 404
+        assert "not found" in response.json()["detail"]
+    def test_get_completed_job_status(self, test_client, mock_redis, sample_job_data):
+        """Test getting status of completed job."""
+        completed_job = {**sample_job_data, "status": "completed", "progress": 100}
+        mock_redis.hgetall.return_value = completed_job
+        response = test_client.get(f"/api/v1/jobs/{sample_job_data['job_id']}")
+        assert response.status_code == 200
+        data = response.json()
+        assert data["status"] == "completed"
+        assert data["progress"] == 100
+        assert data["result_url"] is not None
+    def test_get_failed_job_status(self, test_client, mock_redis, sample_job_data):
+        """Test getting status of failed job."""
+        error_data = {"message": "Transcription failed", "stage": "audio_download"}
+        failed_job = {
+            **sample_job_data,
+            "status": "failed",
+            "error": json.dumps(error_data)
+        }
+        mock_redis.hgetall.return_value = failed_job
+        response = test_client.get(f"/api/v1/jobs/{sample_job_data['job_id']}")
+        assert response.status_code == 200
+        data = response.json()
+        assert data["status"] == "failed"
+        assert data["error"] is not None
+        assert data["error"]["message"] == "Transcription failed"
+class TestScoreDownloadEndpoint:
+    """Test score download endpoint."""
+    def test_download_completed_score(
+        self,
+        test_client,
+        mock_redis,
+        sample_job_data,
+        temp_storage_dir,
+        sample_musicxml_content
+    ):
+        """Test downloading a completed score."""
+        # Create a real MusicXML file
+        score_path = temp_storage_dir / "score.musicxml"
+        score_path.write_text(sample_musicxml_content)
+        completed_job = {
+            **sample_job_data,
+            "status": "completed",
+            "output_path": str(score_path)
+        }
+        mock_redis.hgetall.return_value = completed_job
+        response = test_client.get(f"/api/v1/scores/{sample_job_data['job_id']}")
+        assert response.status_code == 200
+        assert response.headers["content-type"] == "application/vnd.recordare.musicxml+xml"
+        assert "score-partwise" in response.text
+    def test_download_nonexistent_job(self, test_client, mock_redis):
+        """Test downloading score for nonexistent job."""
+        mock_redis.hgetall.return_value = {}
+        response = test_client.get("/api/v1/scores/nonexistent-id")
+        assert response.status_code == 404
+    def test_download_incomplete_job(self, test_client, mock_redis, sample_job_data):
+        """Test downloading score for incomplete job."""
+        mock_redis.hgetall.return_value = sample_job_data  # Still queued
+        response = test_client.get(f"/api/v1/scores/{sample_job_data['job_id']}")
+        assert response.status_code == 404
+        assert "not available" in response.json()["detail"]
+    def test_download_missing_file(self, test_client, mock_redis, sample_job_data):
+        """Test downloading score when file is missing."""
+        completed_job = {
+            **sample_job_data,
+            "status": "completed",
+            "output_path": "/nonexistent/path/score.musicxml"
+        }
+        mock_redis.hgetall.return_value = completed_job
+        response = test_client.get(f"/api/v1/scores/{sample_job_data['job_id']}")
+        assert response.status_code == 404
+        assert "not found" in response.json()["detail"]
+class TestMIDIDownloadEndpoint:
+    """Test MIDI download endpoint."""
+    def test_download_completed_midi(self, test_client, sample_job_id, tmp_path, mock_redis):
+        """Test downloading MIDI from completed job."""
+        # Create a dummy MIDI file
+        midi_file = tmp_path / "test.mid"
+        midi_file.write_bytes(b"MIDI_DATA")
+        # Set job as completed with MIDI path
+        mock_redis.hgetall.return_value = {
+            "status": "completed",
+            "midi_path": str(midi_file)
+        }
+        response = test_client.get(f"/api/v1/scores/{sample_job_id}/midi")
+        assert response.status_code == 200
+        assert response.headers["content-type"] == "audio/midi"
+        assert response.content == b"MIDI_DATA"
+    def test_download_nonexistent_job_midi(self, test_client, mock_redis):
+        """Test downloading MIDI from nonexistent job."""
+        mock_redis.hgetall.return_value = {}
+        response = test_client.get("/api/v1/scores/nonexistent/midi")
+        assert response.status_code == 404
+        assert "not available" in response.json()["detail"]
+    def test_download_incomplete_job_midi(self, test_client, sample_job_id, mock_redis):
+        """Test downloading MIDI from incomplete job."""
+        mock_redis.hgetall.return_value = {"status": "processing"}
+        response = test_client.get(f"/api/v1/scores/{sample_job_id}/midi")
+        assert response.status_code == 404
+    def test_download_missing_midi_file(self, test_client, sample_job_id, mock_redis):
+        """Test downloading when MIDI file doesn't exist."""
+        mock_redis.hgetall.return_value = {
+            "status": "completed",
+            "midi_path": "/nonexistent/path.mid"
+        }
+        response = test_client.get(f"/api/v1/scores/{sample_job_id}/midi")
+        assert response.status_code == 404
+        assert "file not found" in response.json()["detail"].lower()

backend/tests/test_pipeline.py ADDED Viewed

	@@ -0,0 +1,102 @@

+"""Unit tests for audio processing pipeline - simplified version."""
+import pytest
+from pathlib import Path
+class TestPipelineImports:
+    """Test that pipeline functions can be imported and are callable."""
+    def test_download_audio_callable(self):
+        """Test download_audio is callable."""
+        from pipeline import download_audio
+        assert callable(download_audio)
+    def test_separate_sources_callable(self):
+        """Test separate_sources is callable."""
+        from pipeline import separate_sources
+        assert callable(separate_sources)
+    def test_transcribe_audio_callable(self):
+        """Test transcribe_audio is callable."""
+        from pipeline import transcribe_audio
+        assert callable(transcribe_audio)
+    def test_quantize_midi_callable(self):
+        """Test quantize_midi is callable."""
+        from pipeline import quantize_midi
+        assert callable(quantize_midi)
+    def test_remove_duplicate_notes_callable(self):
+        """Test remove_duplicate_notes is callable."""
+        from pipeline import remove_duplicate_notes
+        assert callable(remove_duplicate_notes)
+    def test_remove_short_notes_callable(self):
+        """Test remove_short_notes is callable."""
+        from pipeline import remove_short_notes
+        assert callable(remove_short_notes)
+    def test_generate_musicxml_callable(self):
+        """Test generate_musicxml is callable."""
+        from pipeline import generate_musicxml
+        assert callable(generate_musicxml)
+    def test_detect_key_signature_callable(self):
+        """Test detect_key_signature is callable."""
+        from pipeline import detect_key_signature
+        assert callable(detect_key_signature)
+    def test_detect_time_signature_callable(self):
+        """Test detect_time_signature is callable."""
+        from pipeline import detect_time_signature
+        assert callable(detect_time_signature)
+    def test_detect_tempo_callable(self):
+        """Test detect_tempo is callable."""
+        from pipeline import detect_tempo
+        assert callable(detect_tempo)
+    def test_run_transcription_pipeline_callable(self):
+        """Test run_transcription_pipeline is callable."""
+        from pipeline import run_transcription_pipeline
+        assert callable(run_transcription_pipeline)
+class TestTranscriptionPipelineClass:
+    """Test TranscriptionPipeline class."""
+    def test_pipeline_class_exists(self):
+        """Test TranscriptionPipeline class can be instantiated."""
+        from pipeline import TranscriptionPipeline
+        pipeline = TranscriptionPipeline("test_job", "http://example.com", Path("/tmp"))
+        assert pipeline.job_id == "test_job"
+        assert pipeline.youtube_url == "http://example.com"
+        assert isinstance(pipeline.storage_path, Path)
+    def test_pipeline_has_progress_callback(self):
+        """Test TranscriptionPipeline has progress_callback."""
+        from pipeline import TranscriptionPipeline
+        pipeline = TranscriptionPipeline("test_job", "http://example.com", Path("/tmp"))
+        assert hasattr(pipeline, 'set_progress_callback')
+        assert callable(pipeline.set_progress_callback)
+    def test_pipeline_has_required_methods(self):
+        """Test TranscriptionPipeline has all required methods."""
+        from pipeline import TranscriptionPipeline
+        pipeline = TranscriptionPipeline("test_job", "http://example.com", Path("/tmp"))
+        required_methods = [
+            'download_audio',
+            'separate_sources',
+            'transcribe_to_midi',
+            'clean_midi',
+            'generate_musicxml',
+            'cleanup'
+        ]
+        for method in required_methods:
+            assert hasattr(pipeline, method)
+            assert callable(getattr(pipeline, method))

backend/tests/test_tasks.py ADDED Viewed

	@@ -0,0 +1,243 @@

+"""Tests for Celery tasks."""
+import pytest
+from unittest.mock import patch, MagicMock, call
+import json
+class TestProcessTranscriptionTask:
+    """Test the main Celery transcription task."""
+    @patch('tasks.shutil.copy')
+    @patch('tasks.TranscriptionPipeline')
+    @patch('tasks.redis_client')
+    def test_task_success(self, mock_redis, mock_pipeline_class, mock_copy, sample_job_id, temp_storage_dir):
+        """Test successful task execution."""
+        from tasks import process_transcription_task
+        # Mock job data in Redis
+        job_data = {
+            'job_id': sample_job_id,
+            'youtube_url': 'https://www.youtube.com/watch?v=dQw4w9WgXcQ',
+            'video_id': 'dQw4w9WgXcQ',
+            'options': '{"instruments": ["piano"]}'
+        }
+        mock_redis.hgetall.return_value = job_data
+        # Mock successful pipeline instance
+        mock_pipeline = MagicMock()
+        mock_pipeline.run.return_value = str(temp_storage_dir / "output.musicxml")
+        mock_pipeline_class.return_value = mock_pipeline
+        # Execute task
+        process_transcription_task(sample_job_id)
+        # Verify pipeline ran
+        mock_pipeline.run.assert_called_once()
+        # Verify progress updates were published
+        assert mock_redis.publish.call_count > 0
+        # Verify final status was set to completed
+        completed_calls = [
+            call for call in mock_redis.hset.call_args_list
+            if 'completed' in str(call)
+        ]
+        assert len(completed_calls) > 0
+    @patch('tasks.shutil.copy')
+    @patch('tasks.TranscriptionPipeline')
+    @patch('tasks.redis_client')
+    def test_task_failure(self, mock_redis, mock_pipeline_class, mock_copy, sample_job_id):
+        """Test task execution with pipeline failure."""
+        from tasks import process_transcription_task
+        from celery.exceptions import Retry
+        job_data = {
+            'job_id': sample_job_id,
+            'youtube_url': 'https://www.youtube.com/watch?v=invalid',
+            'video_id': 'invalid',
+            'options': '{}'
+        }
+        mock_redis.hgetall.return_value = job_data
+        # Mock failed pipeline
+        mock_pipeline = MagicMock()
+        mock_pipeline.run.side_effect = RuntimeError("Download failed")
+        mock_pipeline_class.return_value = mock_pipeline
+        # Execute task - should raise Retry due to Celery's retry mechanism
+        with pytest.raises((Retry, RuntimeError)):
+            process_transcription_task(sample_job_id)
+        # Verify error was stored in Redis before retry
+        error_calls = [
+            call for call in mock_redis.hset.call_args_list
+            if 'error' in str(call)
+        ]
+        assert len(error_calls) > 0
+    @patch('tasks.shutil.copy')
+    @patch('tasks.TranscriptionPipeline')
+    @patch('tasks.redis_client')
+    def test_task_progress_updates(self, mock_redis, mock_pipeline_class, mock_copy, sample_job_id, temp_storage_dir):
+        """Test that task publishes progress updates."""
+        from tasks import process_transcription_task
+        job_data = {
+            'job_id': sample_job_id,
+            'youtube_url': 'https://www.youtube.com/watch?v=dQw4w9WgXcQ',
+            'video_id': 'dQw4w9WgXcQ',
+            'options': '{}'
+        }
+        mock_redis.hgetall.return_value = job_data
+        mock_pipeline = MagicMock()
+        mock_pipeline.run.return_value = str(temp_storage_dir / "output.musicxml")
+        mock_pipeline_class.return_value = mock_pipeline
+        process_transcription_task(sample_job_id)
+        # Verify completion message was published
+        publish_calls = mock_redis.publish.call_args_list
+        assert len(publish_calls) >= 1  # At least completion message
+        # Verify final publish call contains completion info
+        final_call = publish_calls[-1]
+        channel, message = final_call[0]
+        assert channel == f"job:{sample_job_id}:updates"
+        update_data = json.loads(message)
+        assert 'type' in update_data
+        assert update_data['type'] == 'completed'
+    @patch('tasks.redis_client')
+    def test_task_job_not_found(self, mock_redis, sample_job_id):
+        """Test task execution when job doesn't exist."""
+        from tasks import process_transcription_task
+        mock_redis.hgetall.return_value = {}
+        with pytest.raises(ValueError) as exc_info:
+            process_transcription_task(sample_job_id)
+        assert "Job not found" in str(exc_info.value)
+    @patch('tasks.shutil.copy')
+    @patch('tasks.TranscriptionPipeline')
+    @patch('tasks.redis_client')
+    def test_task_retry_on_network_error(self, mock_redis, mock_pipeline_class, mock_copy, sample_job_id):
+        """Test task retry logic for transient errors."""
+        from tasks import process_transcription_task
+        from celery.exceptions import Retry
+        job_data = {
+            'job_id': sample_job_id,
+            'youtube_url': 'https://www.youtube.com/watch?v=dQw4w9WgXcQ',
+            'video_id': 'dQw4w9WgXcQ',
+            'options': '{}'
+        }
+        mock_redis.hgetall.return_value = job_data
+        # Mock transient network error
+        mock_pipeline = MagicMock()
+        mock_pipeline.run.side_effect = ConnectionError("Network timeout")
+        mock_pipeline_class.return_value = mock_pipeline
+        with pytest.raises((Retry, ConnectionError)):
+            process_transcription_task(sample_job_id)
+class TestProgressCallback:
+    """Test progress callback functionality."""
+    @patch('tasks.redis_client')
+    def test_update_progress(self, mock_redis, sample_job_id):
+        """Test progress update function."""
+        from tasks import update_progress
+        update_progress(sample_job_id, 50, "transcription", "Transcribing audio...")
+        # Verify Redis was updated
+        mock_redis.hset.assert_called()
+        call_args = mock_redis.hset.call_args[0]
+        assert call_args[0] == f"job:{sample_job_id}"
+        # Verify WebSocket message was published
+        mock_redis.publish.assert_called()
+        channel, message = mock_redis.publish.call_args[0]
+        assert channel == f"job:{sample_job_id}:updates"
+        update_data = json.loads(message)
+        assert update_data['progress'] == 50
+        assert update_data['stage'] == "transcription"
+        assert update_data['message'] == "Transcribing audio..."
+    @patch('tasks.redis_client')
+    def test_multiple_progress_updates(self, mock_redis, sample_job_id):
+        """Test sequence of progress updates."""
+        from tasks import update_progress
+        stages = [
+            (5, "download", "Downloading audio"),
+            (25, "separation", "Separating audio sources"),
+            (60, "transcription", "Transcribing to MIDI"),
+            (90, "musicxml", "Generating MusicXML"),
+            (100, "completed", "Processing complete")
+        ]
+        for progress, stage, message in stages:
+            update_progress(sample_job_id, progress, stage, message)
+        # Should have 5 updates
+        assert mock_redis.hset.call_count == 5
+        assert mock_redis.publish.call_count == 5
+class TestCleanup:
+    """Test cleanup of temporary files."""
+    @patch('tasks.shutil.rmtree')
+    def test_cleanup_temp_files(self, mock_rmtree, sample_job_id, temp_storage_dir):
+        """Test cleanup of temporary files after job completion."""
+        from tasks import cleanup_temp_files
+        # Create the temp directory so cleanup will attempt to remove it
+        temp_dir = temp_storage_dir / "temp" / sample_job_id
+        temp_dir.mkdir(parents=True, exist_ok=True)
+        cleanup_temp_files(sample_job_id, storage_path=temp_storage_dir)
+        # Verify temp directory was removed
+        mock_rmtree.assert_called()
+    def test_cleanup_preserves_output(self, sample_job_id, temp_storage_dir):
+        """Test that cleanup preserves final output files."""
+        from tasks import cleanup_temp_files
+        # Create a temp directory with files
+        temp_dir = temp_storage_dir / "temp" / sample_job_id
+        temp_dir.mkdir(parents=True, exist_ok=True)
+        # Create temp files
+        (temp_dir / "temp_audio.wav").touch()
+        (temp_dir / "temp_midi.mid").touch()
+        # Create output files
+        outputs_dir = temp_storage_dir / "outputs"
+        outputs_dir.mkdir(parents=True, exist_ok=True)
+        output_files = [
+            outputs_dir / "output.musicxml",
+            outputs_dir / "output.mid"
+        ]
+        for f in output_files:
+            f.touch()
+        # Run cleanup
+        cleanup_temp_files(sample_job_id, storage_path=temp_storage_dir)
+        # Verify temp directory was removed
+        assert not temp_dir.exists()
+        # Verify output files still exist
+        for f in output_files:
+            assert f.exists()

backend/tests/test_utils.py ADDED Viewed

	@@ -0,0 +1,147 @@

+"""Unit tests for utility functions."""
+import pytest
+from utils import validate_youtube_url, check_video_availability
+from unittest.mock import patch, MagicMock
+import yt_dlp
+class TestValidateYouTubeURL:
+    """Test YouTube URL validation."""
+    def test_valid_watch_url(self):
+        """Test standard youtube.com/watch URL."""
+        is_valid, video_id = validate_youtube_url("https://www.youtube.com/watch?v=dQw4w9WgXcQ")
+        assert is_valid is True
+        assert video_id == "dQw4w9WgXcQ"
+    def test_valid_short_url(self):
+        """Test youtu.be short URL."""
+        is_valid, video_id = validate_youtube_url("https://youtu.be/dQw4w9WgXcQ")
+        assert is_valid is True
+        assert video_id == "dQw4w9WgXcQ"
+    def test_valid_mobile_url(self):
+        """Test mobile YouTube URL."""
+        is_valid, video_id = validate_youtube_url("https://m.youtube.com/watch?v=dQw4w9WgXcQ")
+        assert is_valid is True
+        assert video_id == "dQw4w9WgXcQ"
+    def test_valid_embed_url(self):
+        """Test embedded YouTube URL."""
+        is_valid, video_id = validate_youtube_url("https://www.youtube.com/embed/dQw4w9WgXcQ")
+        assert is_valid is True
+        assert video_id == "dQw4w9WgXcQ"
+    def test_valid_with_extra_params(self):
+        """Test URL with additional query parameters."""
+        is_valid, video_id = validate_youtube_url("https://www.youtube.com/watch?v=dQw4w9WgXcQ&t=30s")
+        assert is_valid is True
+        assert video_id == "dQw4w9WgXcQ"
+    def test_invalid_domain(self):
+        """Test URL from wrong domain."""
+        is_valid, error = validate_youtube_url("https://vimeo.com/12345")
+        assert is_valid is False
+        assert error == "Invalid YouTube URL format"
+    def test_invalid_format(self):
+        """Test malformed URL."""
+        is_valid, error = validate_youtube_url("not-a-url")
+        assert is_valid is False
+        assert error == "Invalid YouTube URL format"
+    def test_invalid_video_id_length(self):
+        """Test URL with incorrect video ID length."""
+        is_valid, error = validate_youtube_url("https://www.youtube.com/watch?v=short")
+        assert is_valid is False
+        assert error == "Invalid YouTube URL format"
+    def test_empty_url(self):
+        """Test empty URL."""
+        is_valid, error = validate_youtube_url("")
+        assert is_valid is False
+        assert error == "Invalid YouTube URL format"
+class TestCheckVideoAvailability:
+    """Test video availability checking."""
+    @patch('yt_dlp.YoutubeDL')
+    def test_available_video(self, mock_ydl_class, mock_yt_dlp_info):
+        """Test checking available video."""
+        mock_ydl = MagicMock()
+        mock_ydl.extract_info.return_value = mock_yt_dlp_info
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        result = check_video_availability("dQw4w9WgXcQ")
+        assert result['available'] is True
+        assert 'info' in result
+    @patch('yt_dlp.YoutubeDL')
+    def test_video_too_long(self, mock_ydl_class):
+        """Test video exceeding duration limit."""
+        mock_ydl = MagicMock()
+        mock_ydl.extract_info.return_value = {
+            'duration': 1200,  # 20 minutes
+            'age_limit': 0
+        }
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        result = check_video_availability("dQw4w9WgXcQ", max_duration=900)
+        assert result['available'] is False
+        assert 'max 15 minutes' in result['reason']
+    @patch('yt_dlp.YoutubeDL')
+    def test_age_restricted_video(self, mock_ydl_class):
+        """Test age-restricted video."""
+        mock_ydl = MagicMock()
+        mock_ydl.extract_info.return_value = {
+            'duration': 180,
+            'age_limit': 18
+        }
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        result = check_video_availability("dQw4w9WgXcQ")
+        assert result['available'] is False
+        assert 'Age-restricted' in result['reason']
+    @patch('yt_dlp.YoutubeDL')
+    def test_download_error(self, mock_ydl_class):
+        """Test yt-dlp download error."""
+        mock_ydl = MagicMock()
+        mock_ydl.extract_info.side_effect = yt_dlp.utils.DownloadError("Video unavailable")
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        result = check_video_availability("invalid_id")
+        assert result['available'] is False
+        assert 'Video unavailable' in result['reason']
+    @patch('yt_dlp.YoutubeDL')
+    def test_generic_error(self, mock_ydl_class):
+        """Test generic error handling."""
+        mock_ydl = MagicMock()
+        mock_ydl.extract_info.side_effect = Exception("Unknown error")
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        result = check_video_availability("dQw4w9WgXcQ")
+        assert result['available'] is False
+        assert 'Error checking video' in result['reason']
+    @patch('yt_dlp.YoutubeDL')
+    def test_video_at_max_duration(self, mock_ydl_class):
+        """Test video exactly at duration limit."""
+        mock_ydl = MagicMock()
+        mock_ydl.extract_info.return_value = {
+            'duration': 900,  # Exactly 15 minutes
+            'age_limit': 0
+        }
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        result = check_video_availability("dQw4w9WgXcQ", max_duration=900)
+        assert result['available'] is True

backend/utils.py ADDED Viewed

	@@ -0,0 +1,79 @@

+"""Utility functions for Rescored backend."""
+import re
+from urllib.parse import urlparse, parse_qs
+import yt_dlp
+def validate_youtube_url(url: str) -> tuple[bool, str | None]:
+    """
+    Validate YouTube URL and extract video ID.
+    Args:
+        url: YouTube URL to validate
+    Returns:
+        (is_valid, video_id or error_message)
+    """
+    # Supported formats:
+    # - https://www.youtube.com/watch?v=VIDEO_ID
+    # - https://youtu.be/VIDEO_ID
+    # - https://m.youtube.com/watch?v=VIDEO_ID
+    patterns = [
+        r'(?:youtube\.com/watch\?v=|youtu\.be/)([a-zA-Z0-9_-]{11})',
+        r'youtube\.com/embed/([a-zA-Z0-9_-]{11})',
+    ]
+    for pattern in patterns:
+        match = re.search(pattern, url)
+        if match:
+            return True, match.group(1)
+    return False, "Invalid YouTube URL format"
+def check_video_availability(video_id: str, max_duration: int = 900) -> dict:
+    """
+    Check if video is available for download.
+    Args:
+        video_id: YouTube video ID
+        max_duration: Maximum allowed duration in seconds
+    Returns:
+        Dictionary with 'available' (bool) and 'reason' or 'info'
+    """
+    ydl_opts = {
+        'quiet': True,
+        'no_warnings': True,
+        'extract_flat': True,
+    }
+    try:
+        with yt_dlp.YoutubeDL(ydl_opts) as ydl:
+            info = ydl.extract_info(
+                f"https://youtube.com/watch?v={video_id}",
+                download=False
+            )
+            # Check duration
+            duration = info.get('duration', 0)
+            if duration > max_duration:
+                return {
+                    'available': False,
+                    'reason': f'Video too long (max {max_duration // 60} minutes)'
+                }
+            # Check if age-restricted
+            if info.get('age_limit', 0) > 0:
+                return {
+                    'available': False,
+                    'reason': 'Age-restricted content not supported'
+                }
+            return {'available': True, 'info': info}
+    except yt_dlp.utils.DownloadError as e:
+        return {'available': False, 'reason': str(e)}
+    except Exception as e:
+        return {'available': False, 'reason': f'Error checking video: {str(e)}'}

docker-compose.yml ADDED Viewed

	@@ -0,0 +1,79 @@

+version: '3.8'
+services:
+  # Redis - Message broker and cache
+  redis:
+    image: redis:7-alpine
+    ports:
+      - "6379:6379"
+    volumes:
+      - redis_data:/data
+    healthcheck:
+      test: ["CMD", "redis-cli", "ping"]
+      interval: 5s
+      timeout: 3s
+      retries: 5
+  # Backend API
+  api:
+    build:
+      context: ./backend
+      dockerfile: Dockerfile
+    ports:
+      - "8000:8000"
+    environment:
+      - REDIS_URL=redis://redis:6379/0
+      - STORAGE_PATH=/app/storage
+      - API_HOST=0.0.0.0
+      - API_PORT=8000
+      - CORS_ORIGINS=http://localhost:5173,http://localhost:3000
+    volumes:
+      - ./backend:/app
+      - ./storage:/app/storage
+    depends_on:
+      redis:
+        condition: service_healthy
+    command: uvicorn main:app --host 0.0.0.0 --port 8000 --reload
+  # Celery Worker (GPU-enabled for ML processing)
+  worker:
+    build:
+      context: ./backend
+      dockerfile: Dockerfile.worker
+    environment:
+      - REDIS_URL=redis://redis:6379/0
+      - STORAGE_PATH=/app/storage
+      - GPU_ENABLED=true
+    volumes:
+      - ./backend:/app
+      - ./storage:/app/storage
+    depends_on:
+      redis:
+        condition: service_healthy
+    command: celery -A tasks worker --loglevel=info --concurrency=1
+    # Uncomment for GPU support (requires NVIDIA Docker runtime)
+    # deploy:
+    #   resources:
+    #     reservations:
+    #       devices:
+    #         - driver: nvidia
+    #           count: 1
+    #           capabilities: [gpu]
+  # Frontend (React + Vite)
+  frontend:
+    build:
+      context: ./frontend
+      dockerfile: Dockerfile
+    ports:
+      - "5173:5173"
+    environment:
+      - VITE_API_URL=http://localhost:8000
+    volumes:
+      - ./frontend:/app
+      - /app/node_modules
+    command: npm run dev -- --host 0.0.0.0
+volumes:
+  redis_data:
+  storage:

docs/testing/backend-testing.md ADDED Viewed

	@@ -0,0 +1,520 @@

+# Backend Testing Guide
+Comprehensive guide for testing the Rescored backend.
+## Table of Contents
+- [Setup](#setup)
+- [Running Tests](#running-tests)
+- [Test Structure](#test-structure)
+- [Writing Tests](#writing-tests)
+- [Testing Patterns](#testing-patterns)
+- [Troubleshooting](#troubleshooting)
+## Setup
+### Install Test Dependencies
+```bash
+cd backend
+pip install -r requirements-test.txt
+```
+This installs:
+- `pytest`: Test framework
+- `pytest-asyncio`: Async test support
+- `pytest-cov`: Coverage reporting
+- `pytest-mock`: Enhanced mocking
+- `httpx`: HTTP testing client
+### Configuration
+Test configuration is in `pytest.ini`:
+```ini
+[pytest]
+testpaths = tests
+markers =
+    unit: Unit tests
+    integration: Integration tests
+    slow: Slow-running tests
+    gpu: Tests requiring GPU
+    network: Tests requiring network
+```
+## Running Tests
+### Basic Commands
+```bash
+# Run all tests
+pytest
+# Run with coverage
+pytest --cov
+# Run specific file
+pytest tests/test_utils.py
+# Run specific test
+pytest tests/test_utils.py::TestValidateYouTubeURL::test_valid_watch_url
+# Run by marker
+pytest -m unit
+pytest -m "unit and not slow"
+```
+### Watch Mode
+Use `pytest-watch` for continuous testing:
+```bash
+pip install pytest-watch
+ptw  # Runs tests on file changes
+```
+### Coverage Reports
+```bash
+# Terminal report
+pytest --cov --cov-report=term-missing
+# HTML report
+pytest --cov --cov-report=html
+open htmlcov/index.html
+# Both
+pytest --cov --cov-report=term-missing --cov-report=html
+```
+## Test Structure
+### Test Files
+Each module has a corresponding test file:
+- `utils.py` → `tests/test_utils.py`
+- `pipeline.py` → `tests/test_pipeline.py`
+- `main.py` → `tests/test_api.py`
+- `tasks.py` → `tests/test_tasks.py`
+### Test Organization
+Group related tests in classes:
+```python
+class TestValidateYouTubeURL:
+    """Test YouTube URL validation."""
+    def test_valid_watch_url(self):
+        """Test standard youtube.com/watch URL."""
+        is_valid, video_id = validate_youtube_url("https://www.youtube.com/watch?v=...")
+        assert is_valid is True
+        assert video_id == "..."
+    def test_invalid_domain(self):
+        """Test URL from wrong domain."""
+        is_valid, error = validate_youtube_url("https://vimeo.com/12345")
+        assert is_valid is False
+```
+## Writing Tests
+### Basic Test Template
+```python
+import pytest
+from module_name import function_to_test
+class TestFunctionName:
+    """Test suite for function_name."""
+    def test_happy_path(self):
+        """Test normal successful execution."""
+        result = function_to_test(valid_input)
+        assert result == expected_output
+    def test_edge_case(self):
+        """Test boundary condition."""
+        result = function_to_test(edge_case_input)
+        assert result == expected_edge_output
+    def test_error_handling(self):
+        """Test error is raised for invalid input."""
+        with pytest.raises(ValueError) as exc_info:
+            function_to_test(invalid_input)
+        assert "expected error message" in str(exc_info.value)
+```
+### Using Fixtures
+Fixtures provide reusable test data:
+```python
+@pytest.fixture
+def sample_audio_file(temp_storage_dir):
+    """Create a sample WAV file for testing."""
+    import numpy as np
+    import soundfile as sf
+    sample_rate = 44100
+    duration = 1.0
+    samples = np.zeros(int(sample_rate * duration), dtype=np.float32)
+    audio_path = temp_storage_dir / "test_audio.wav"
+    sf.write(str(audio_path), samples, sample_rate)
+    return audio_path
+def test_using_fixture(sample_audio_file):
+    """Test that uses the fixture."""
+    assert sample_audio_file.exists()
+    assert sample_audio_file.suffix == ".wav"
+```
+### Mocking External Dependencies
+#### Mock yt-dlp
+```python
+from unittest.mock import patch, MagicMock
+@patch('pipeline.yt_dlp.YoutubeDL')
+def test_download_audio(mock_ydl_class, temp_storage_dir):
+    """Test audio download with mocked yt-dlp."""
+    mock_ydl = MagicMock()
+    mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+    result = download_audio("https://youtube.com/watch?v=...", temp_storage_dir)
+    assert result.exists()
+    mock_ydl.download.assert_called_once()
+```
+#### Mock Redis
+```python
+@pytest.fixture
+def mock_redis():
+    """Mock Redis client."""
+    redis_mock = MagicMock(spec=Redis)
+    redis_mock.ping.return_value = True
+    redis_mock.hgetall.return_value = {}
+    return redis_mock
+def test_with_redis(mock_redis):
+    """Test function that uses Redis."""
+    # Redis is mocked, no real connection needed
+    mock_redis.hset("key", "field", "value")
+    assert mock_redis.hset.called
+```
+#### Mock ML Models
+```python
+@patch('pipeline.basic_pitch.inference.predict')
+def test_transcribe_audio(mock_predict, sample_audio_file, temp_storage_dir):
+    """Test transcription with mocked ML model."""
+    # Mock model output
+    mock_predict.return_value = (
+        np.zeros((100, 88)),  # note activations
+        np.zeros((100, 88)),  # onsets
+        np.zeros((100, 1))    # contours
+    )
+    result = transcribe_audio(sample_audio_file, temp_storage_dir)
+    assert result.exists()
+    assert result.suffix == ".mid"
+```
+## Testing Patterns
+### Testing API Endpoints
+```python
+from fastapi.testclient import TestClient
+def test_submit_transcription(test_client, mock_redis):
+    """Test transcription submission endpoint."""
+    response = test_client.post(
+        "/api/v1/transcribe",
+        json={"youtube_url": "https://www.youtube.com/watch?v=..."}
+    )
+    assert response.status_code == 201
+    data = response.json()
+    assert "job_id" in data
+    assert data["status"] == "queued"
+```
+### Testing Async Functions
+```python
+import pytest
+@pytest.mark.asyncio
+async def test_async_function():
+    """Test async function."""
+    result = await async_operation()
+    assert result == expected_value
+```
+### Testing WebSocket Connections
+```python
+def test_websocket(test_client, sample_job_id):
+    """Test WebSocket connection."""
+    with test_client.websocket_connect(f"/api/v1/jobs/{sample_job_id}/stream") as websocket:
+        data = websocket.receive_json()
+        assert data["type"] == "progress"
+        assert "job_id" in data
+```
+### Testing Error Scenarios
+```python
+def test_video_too_long(test_client):
+    """Test error handling for videos exceeding duration limit."""
+    with patch('utils.check_video_availability') as mock_check:
+        mock_check.return_value = {
+            'available': False,
+            'reason': 'Video too long (max 15 minutes)'
+        }
+        response = test_client.post(
+            "/api/v1/transcribe",
+            json={"youtube_url": "https://www.youtube.com/watch?v=long"}
+        )
+        assert response.status_code == 422
+        assert "too long" in response.json()["detail"]
+```
+### Testing Retries
+```python
+def test_retry_on_network_error():
+    """Test that function retries on network error."""
+    mock_func = MagicMock()
+    mock_func.side_effect = [
+        ConnectionError("Network timeout"),  # First call fails
+        ConnectionError("Network timeout"),  # Second call fails
+        {"success": True}                     # Third call succeeds
+    ]
+    # Function should retry and eventually succeed
+    result = function_with_retry(mock_func)
+    assert result == {"success": True}
+    assert mock_func.call_count == 3
+```
+### Parametrized Tests
+Test multiple inputs efficiently:
+```python
+@pytest.mark.parametrize("url,expected_valid,expected_id", [
+    ("https://www.youtube.com/watch?v=dQw4w9WgXcQ", True, "dQw4w9WgXcQ"),
+    ("https://youtu.be/dQw4w9WgXcQ", True, "dQw4w9WgXcQ"),
+    ("https://vimeo.com/12345", False, None),
+    ("not-a-url", False, None),
+])
+def test_url_validation(url, expected_valid, expected_id):
+    """Test URL validation with multiple inputs."""
+    is_valid, result = validate_youtube_url(url)
+    assert is_valid == expected_valid
+    if expected_valid:
+        assert result == expected_id
+```
+## Testing Pipeline Stages
+### Audio Download
+```python
+@patch('pipeline.yt_dlp.YoutubeDL')
+def test_download_audio_success(mock_ydl_class, temp_storage_dir):
+    """Test successful audio download."""
+    mock_ydl = MagicMock()
+    mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+    result = download_audio("https://youtube.com/watch?v=...", temp_storage_dir)
+    assert result.exists()
+    assert result.suffix == ".wav"
+```
+### Source Separation
+```python
+@patch('pipeline.demucs.separate.main')
+def test_separate_sources(mock_demucs, sample_audio_file, temp_storage_dir):
+    """Test source separation."""
+    # Create mock output files
+    stems_dir = temp_storage_dir / "htdemucs" / "test_audio"
+    stems_dir.mkdir(parents=True)
+    for stem in ["drums", "bass", "vocals", "other"]:
+        (stems_dir / f"{stem}.wav").touch()
+    result = separate_sources(sample_audio_file, temp_storage_dir)
+    assert all(stem in result for stem in ["drums", "bass", "vocals", "other"])
+    assert all(path.exists() for path in result.values())
+```
+### Transcription
+```python
+@patch('pipeline.basic_pitch.inference.predict')
+def test_transcribe_audio(mock_predict, sample_audio_file, temp_storage_dir):
+    """Test audio transcription."""
+    mock_predict.return_value = (
+        np.random.rand(100, 88),
+        np.random.rand(100, 88),
+        np.random.rand(100, 1)
+    )
+    result = transcribe_audio(sample_audio_file, temp_storage_dir)
+    assert result.exists()
+    assert result.suffix == ".mid"
+```
+### MusicXML Generation
+```python
+@patch('pipeline.music21.converter.parse')
+def test_generate_musicxml(mock_parse, sample_midi_file, temp_storage_dir):
+    """Test MusicXML generation."""
+    mock_score = MagicMock()
+    mock_parse.return_value = mock_score
+    result = generate_musicxml(sample_midi_file, temp_storage_dir)
+    assert result.exists()
+    assert result.suffix == ".musicxml"
+    mock_score.write.assert_called_once()
+```
+## Troubleshooting
+### Common Issues
+**Import Errors**
+```bash
+# Ensure backend directory is in PYTHONPATH
+export PYTHONPATH="${PYTHONPATH}:$(pwd)"
+pytest
+```
+**Redis Connection Errors**
+```python
+# Always mock Redis in tests unless testing Redis specifically
+@pytest.fixture(autouse=True)
+def mock_redis():
+    with patch('main.redis_client') as mock:
+        yield mock
+```
+**File Permission Errors**
+```python
+# Always use temp directories
+@pytest.fixture
+def temp_storage_dir():
+    temp_dir = tempfile.mkdtemp()
+    yield Path(temp_dir)
+    shutil.rmtree(temp_dir, ignore_errors=True)
+```
+**GPU Not Available**
+```python
+# Mark GPU tests and skip if unavailable
+@pytest.mark.gpu
+@pytest.mark.skipif(not torch.cuda.is_available(), reason="GPU not available")
+def test_gpu_processing():
+    ...
+```
+### Debugging Failed Tests
+```bash
+# Show print statements
+pytest -s
+# Verbose output
+pytest -vv
+# Drop into debugger on failure
+pytest --pdb
+# Run only failed tests
+pytest --lf
+```
+### Performance Issues
+```bash
+# Identify slow tests
+pytest --durations=10
+# Run tests in parallel
+pytest -n auto  # Requires pytest-xdist
+```
+## Best Practices
+1. **Mock external dependencies**: Don't make real API calls, network requests, or ML inferences
+2. **Use fixtures**: Share common setup code across tests
+3. **Test edge cases**: Empty inputs, None values, boundary conditions
+4. **Clean up resources**: Always clean up temp files, connections
+5. **Keep tests independent**: Tests should not depend on each other
+6. **Write descriptive names**: Test names should explain what they verify
+7. **Test one thing**: Each test should verify one specific behavior
+8. **Use markers**: Tag tests by type (unit, integration, slow, gpu)
+## Example Test File
+Complete example showing best practices:
+```python
+"""Tests for audio processing pipeline."""
+import pytest
+from pathlib import Path
+from unittest.mock import patch, MagicMock
+import numpy as np
+from pipeline import download_audio, separate_sources, transcribe_audio
+class TestAudioDownload:
+    """Test audio download stage."""
+    @patch('pipeline.yt_dlp.YoutubeDL')
+    def test_success(self, mock_ydl_class, temp_storage_dir):
+        """Test successful audio download."""
+        mock_ydl = MagicMock()
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        result = download_audio("https://youtube.com/watch?v=test", temp_storage_dir)
+        assert result.exists()
+        assert result.suffix == ".wav"
+        mock_ydl.download.assert_called_once()
+    @patch('pipeline.yt_dlp.YoutubeDL')
+    def test_network_error(self, mock_ydl_class, temp_storage_dir):
+        """Test handling of network error."""
+        import yt_dlp
+        mock_ydl = MagicMock()
+        mock_ydl.download.side_effect = yt_dlp.utils.DownloadError("Network error")
+        mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+        with pytest.raises(Exception) as exc_info:
+            download_audio("https://youtube.com/watch?v=test", temp_storage_dir)
+        assert "Network error" in str(exc_info.value)
+```

docs/testing/baseline-accuracy.md ADDED Viewed

	@@ -0,0 +1,178 @@

+# Baseline Accuracy Report
+**Date**: 2024-12-24
+**Pipeline Version**: Phase 1 Complete (MusicXML corruption fixes, MIDI export, rate limiting)
+**Test Suite**: 10 diverse piano videos
+## Executive Summary
+This report establishes the baseline transcription accuracy for the Rescored MVP pipeline after Phase 1 improvements.
+**Initial Test Results** (Before Bug Fixes):
+- Overall Success Rate: **10%** (1/10 videos)
+- Videos Blocked: 3 (YouTube copyright/availability)
+- Code Bugs Found: 6 (all fixed ✅)
+- Successful Test: simple_melody (2,588 notes, 122 measures)
+**Expected After Fixes**:
+- Success Rate: **70-80%** (7-8/10 videos, excluding blocked ones)
+- All code bugs resolved
+- Need to replace 3 blocked videos with alternatives
+**Key Finding**: Measure timing accuracy is imperfect (78% of measures show duration warnings), but this is expected for ML-based transcription. MusicXML files load successfully in notation software.
+## Test Videos
+| ID | Description | Difficulty | Expected Accuracy | URL |
+|----|-------------|------------|-------------------|-----|
+| simple_melody | C major scale practice | Easy | >80% | [Link](https://www.youtube.com/watch?v=TK1Ij_-mank) |
+| twinkle_twinkle | Twinkle Twinkle Little Star | Easy | >75% | [Link](https://www.youtube.com/watch?v=YCZ_d_4ZEqk) |
+| fur_elise | Beethoven - Für Elise (simplified) | Medium | 60-70% | [Link](https://www.youtube.com/watch?v=_mVW8tgGY_w) |
+| chopin_nocturne | Chopin - Nocturne Op. 9 No. 2 | Hard | 50-60% | [Link](https://www.youtube.com/watch?v=9E6b3swbnWg) |
+| canon_in_d | Pachelbel - Canon in D | Medium | 60-70% | [Link](https://www.youtube.com/watch?v=NlprozGcs80) |
+| river_flows | Yiruma - River Flows in You | Medium | 60-70% | [Link](https://www.youtube.com/watch?v=7maJOI3QMu0) |
+| moonlight_sonata | Beethoven - Moonlight Sonata | Medium | 60-70% | [Link](https://www.youtube.com/watch?v=4Tr0otuiQuU) |
+| jazz_blues | Simple jazz blues piano | Medium | 55-65% | [Link](https://www.youtube.com/watch?v=F3W_alUuFkA) |
+| claire_de_lune | Debussy - Clair de Lune | Hard | 50-60% | [Link](https://www.youtube.com/watch?v=WNcsUNKlAKw) |
+| la_campanella | Liszt - La Campanella | Very Hard | 40-50% | [Link](https://www.youtube.com/watch?v=MD6xMyuZls0) |
+## Results
+### Overall Statistics
+(To be filled after test completion)
+- **Total Tests**: 10
+- **Successful**: TBD
+- **Failed**: TBD
+- **Success Rate**: TBD%
+### Per-Video Results
+#### Easy Difficulty (2 videos)
+**simple_melody** ✅:
+- Status: **SUCCESS**
+- MIDI Notes: 2,588
+- Measures: 122
+- Duration: 245.2 seconds
+- Separation Quality: 99.3% energy in 'other' stem (excellent)
+- Measure Warnings: 95/122 (78%) - typical for ML transcription
+- Issues: None - clean transcription
+**twinkle_twinkle** ❌:
+- Status: **BLOCKED**
+- Error: "Video unavailable"
+- Action: Replace with alternative video
+#### Medium Difficulty (5 videos)
+**fur_elise** ❌:
+- Status: **BLOCKED**
+- Error: "Video unavailable"
+- Action: Replace with alternative video
+**canon_in_d** ❌ → ✅:
+- Status: **FIXED**
+- Error: NoneType velocity comparison (Bug #2a)
+- Fix Applied: Safe velocity handling in deduplication
+- Expected: Success on re-run
+**river_flows** ❌ → ✅:
+- Status: **FIXED**
+- Error: NoneType velocity comparison (Bug #2a)
+- Fix Applied: Safe velocity handling
+- Expected: Success on re-run
+**moonlight_sonata** ❌ → ✅:
+- Status: **FIXED**
+- Error: NoneType velocity comparison (Bug #2a)
+- Fix Applied: Safe velocity handling
+- Expected: Success on re-run
+**jazz_blues** ❌:
+- Status: **BLOCKED**
+- Error: "Blocked on copyright grounds"
+- Action: Replace with public domain jazz piano
+#### Hard Difficulty (2 videos)
+**chopin_nocturne** ❌ → ✅:
+- Status: **FIXED**
+- Error: 2048th note duration in measure 129 (Bug #2b)
+- Fix Applied: Increased minimum duration threshold to 128th note
+- Expected: Success on re-run
+**claire_de_lune** ❌ → ✅:
+- Status: **FIXED**
+- Error: 2048th note duration in measure 30 (Bug #2b)
+- Fix Applied: Increased minimum duration threshold
+- Expected: Success on re-run
+#### Very Hard Difficulty (1 video)
+**la_campanella** ❌ → ✅:
+- Status: **FIXED**
+- Error: NoneType velocity comparison (Bug #2a)
+- Fix Applied: Safe velocity handling
+- Expected: Success on re-run (may have low accuracy due to extreme difficulty)
+## Common Failure Modes
+Detailed analysis in [failure-modes.md](failure-modes.md)
+### 1. Video Availability (30% of failures)
+- YouTube blocking, copyright claims, unavailable videos
+- **Solution**: Replace with Creative Commons alternatives
+### 2. Code Bugs - All Fixed ✅ (60% of failures)
+- **Bug 2a**: NoneType velocity comparison (4 videos)
+  - Fixed in [pipeline.py:403-409](../../backend/pipeline.py#L403-L409)
+- **Bug 2b**: 2048th note duration errors (2 videos)
+  - Fixed in [pipeline.py:465-502](../../backend/pipeline.py#L465-L502)
+### 3. Measure Timing Accuracy (78% imperfect)
+- Most measures deviate from exact 4.0 beats
+- Range: 0.0 to 7.83 beats (should be 4.0)
+- **Root causes**: basic-pitch timing, duration snapping, polyphony
+- **Impact**: MusicXML loads but rhythms need manual correction
+- **Status**: Expected limitation for ML transcription - Phase 3 will improve
+## Accuracy by Difficulty
+| Difficulty | Avg Success Rate | Avg Notes | Avg Measures | Notes |
+|------------|------------------|-----------|--------------|-------|
+| Easy | TBD | TBD | TBD | TBD |
+| Medium | TBD | TBD | TBD | TBD |
+| Hard | TBD | TBD | TBD | TBD |
+| Very Hard | TBD | TBD | TBD | TBD |
+## Known Limitations
+Based on Phase 1 implementation:
+1. **Measure Timing**: Many measures show duration warnings (3.5-6.5 beats instead of exactly 4.0). This is expected due to:
+   - basic-pitch not perfectly aligned to beats
+   - Duration snapping to nearest valid note values
+   - Imperfect tempo detection
+2. **MusicXML Warnings**: music21 reports some "overfull measures" when parsing. These are handled gracefully but indicate timing imperfections.
+3. **Single Staff Only**: Grand staff (treble + bass) disabled in Phase 1 due to polyphony issues.
+4. **Piano Only**: Currently only transcribes "other" stem from Demucs, assuming piano/keyboard content.
+## Recommendations for Phase 3
+(To be filled based on failure analysis)
+1. **Parameter Tuning**: TBD
+2. **Model Improvements**: TBD
+3. **Post-Processing**: TBD
+4. **Source Separation**: TBD
+## Appendix: Raw Test Data
+Full test results JSON: `/tmp/rescored/accuracy_test_results.json`
+Individual test outputs in: `/tmp/rescored/temp/accuracy_test_*/`

docs/testing/failure-modes.md ADDED Viewed

	@@ -0,0 +1,216 @@

+# Failure Mode Analysis
+**Date**: 2024-12-24
+**Test Suite**: Phase 2 Accuracy Baseline (10 videos)
+**Pipeline Version**: Phase 1 Complete + Bug Fixes
+## Executive Summary
+Initial accuracy testing revealed **3 major failure categories** affecting 9 out of 10 test videos:
+1. **Video Availability** (30% of failures) - YouTube blocking/copyright
+2. **Code Bugs** (60% of failures) - NoneType errors and 2048th note duration issues
+3. **MusicXML Export** (20% of failures) - Impossible duration errors
+**All code bugs have been fixed.** Success rate expected to improve significantly with re-run.
+## Failure Categories
+### 1. Video Availability Issues (3 videos - 30%)
+**Videos Affected:**
+- `twinkle_twinkle` - "Video unavailable"
+- `fur_elise` - "Video unavailable"
+- `jazz_blues` - "Blocked in your country on copyright grounds"
+**Root Cause:** YouTube access restrictions, not pipeline issues
+**Mitigation:**
+- Replace with alternative videos for same difficulty level
+- Use Creative Commons licensed videos
+- Host test videos on alternative platforms
+**Impact:** Not a pipeline issue - will replace test videos
+---
+### 2. Code Bugs - Fixed ✅ (6 videos - 60%)
+#### Bug 2a: NoneType Velocity Comparison (4 videos)
+**Error:** `'<' not supported between instances of 'int' and 'NoneType'`
+**Videos Affected:**
+- `canon_in_d`
+- `river_flows`
+- `moonlight_sonata`
+- `la_campanella`
+**Root Cause:** In `_deduplicate_overlapping_notes()` at [pipeline.py:403-407](../backend/pipeline.py#L403-L407), the code tried to sort notes by velocity, but `note.volume.velocity` can return `None`.
+**Fix Applied:**
+```python
+def get_velocity(note):
+    if hasattr(note, 'volume') and hasattr(note.volume, 'velocity'):
+        vel = note.volume.velocity
+        return vel if vel is not None else 64
+    return 64
+pitch_notes.sort(key=lambda x: (x.quarterLength, get_velocity(x)), reverse=True)
+```
+**Status:** ✅ Fixed in [pipeline.py:403-409](../backend/pipeline.py#L403-L409)
+---
+#### Bug 2b: 2048th Note Duration (2 videos)
+**Error:** `In part (Piano), measure (X): Cannot convert "2048th" duration to MusicXML (too short).`
+**Videos Affected:**
+- `chopin_nocturne` (measure 129)
+- `claire_de_lune` (measure 30)
+**Root Cause:** `music21.makeMeasures()` creates extremely short rests (2048th notes) when filling gaps between notes. MusicXML export fails because these durations are too short to represent.
+**Previous Attempts:**
+1. ❌ Filtered notes < 64th note (0.0625) before `makeMeasures()` - didn't work
+2. ❌ Removed notes < 64th note after `makeMeasures()` - still had issues
+**Final Fix:**
+- Increased minimum duration threshold to **128th note** (0.03125)
+- Added logging to show how many notes/rests were removed
+- Applied in `_remove_impossible_durations()` at [pipeline.py:465-502](../backend/pipeline.py#L465-L502)
+**Status:** ✅ Fixed - more aggressive filtering
+---
+### 3. Successful Test Analysis
+**Video:** `simple_melody` (C major scale practice, Easy difficulty)
+**Results:**
+- ✅ Successfully generated MusicXML
+- **2,588 notes** detected
+- **122 measures** created
+- **245 seconds** duration
+- **99.3% energy** preserved in 'other' stem (excellent separation)
+**Key Metrics:**
+| Metric | Value | Assessment |
+|--------|-------|------------|
+| Note density | 5.36 notes/sec | Reasonable for piano |
+| Pitch range | G1 to A6 (62 semitones) | Full piano range |
+| Polyphony | ~1.6 avg, ~6 max | Modest polyphony |
+| Short notes | 271 (21%) under 200ms | Acceptable |
+| Measure warnings | 95/122 (78%) | **High** - timing imperfect |
+**Measure Timing Issues:**
+78% of measures showed duration warnings (range 0.0 - 7.83 beats instead of exactly 4.0). Examples:
+- Measure 1: 0.00 beats (empty)
+- Measure 30: 6.41 beats (overfull)
+- Measure 69: 7.33 beats (very overfull)
+- Measure 77: 7.83 beats (worst case)
+**Root Causes:**
+1. **basic-pitch timing** not aligned to musical beats
+2. **Duration snapping** to nearest valid note value loses precision
+3. **Tempo detection** may be inaccurate
+4. **Polyphonic overlaps** creating extra duration
+**Impact:** MusicXML loads in notation software but rhythms are imperfect. This is expected with ML-based transcription.
+---
+## Common Patterns
+### Pattern 1: Quiet Audio Detection
+- Diagnostic shows RMS energy of 0.0432 (very quiet)
+- 20% silence in audio
+- basic-pitch may struggle with quiet inputs
+### Pattern 2: Separation Quality
+- For `simple_melody`: 99.3% energy in 'other' stem ✅
+- Only 0.2% in 'no_other' stem (excellent isolation)
+- Demucs successfully isolated piano
+### Pattern 3: Measure Duration Accuracy
+- **Only 22%** of measures have exactly 4.0 beats
+- **78%** show timing deviations
+- Range: -4.0 to +3.83 beats deviation
+- Largest errors in complex sections (likely polyphony)
+---
+## Recommendations
+### Immediate Actions (Phase 2 completion)
+1. **Replace unavailable videos** with Creative Commons alternatives
+2. **Re-run accuracy suite** with bug fixes
+3. **Document actual baseline** with successful tests
+### Phase 3 Improvements (Accuracy Tuning)
+1. **Tempo Detection:**
+   - Implement better tempo detection (analyze beat patterns)
+   - Consider fixed tempo option for practice scales
+2. **Quantization:**
+   - Improve rhythmic quantization to align with detected beats
+   - Consider time signature detection
+3. **Post-Processing:**
+   - Add measure duration normalization
+   - Stretch/compress note timings to fit exact 4.0 beats
+4. **Parameter Tuning:**
+   - Test different `onset-threshold` values (current: 0.5)
+   - Test different `frame-threshold` values (current: 0.4)
+   - Experiment with `minimum-note-length`
+### Alternative Models (Phase 3 - Optional)
+Consider testing:
+- **MT3** (Google's Music Transformer) - better rhythm accuracy
+- **htdemucs_6s** - 6-stem model with dedicated piano stem
+- **Omnizart** - specialized for classical music
+---
+## Success Criteria
+After fixes and re-run, we expect:
+- ✅ **Video availability**: 7-8 working videos (replacing blocked ones)
+- ✅ **Code bugs**: 0% failure rate (all fixed)
+- ✅ **MusicXML export**: 100% success for available videos
+- 🎯 **Overall success rate**: 70-80% (from 10%)
+Measure timing accuracy will remain imperfect (~78% with warnings) but this is expected for MVP. Phase 3 will focus on improving timing accuracy.
+---
+## Appendix: Error Details
+### NoneType Error Stack Trace
+```
+File "pipeline.py", line 403
+    pitch_notes.sort(key=lambda x: (x.quarterLength, x.volume.velocity if ...))
+TypeError: '<' not supported between instances of 'int' and 'NoneType'
+```
+### 2048th Note Error Stack Trace
+```
+File "music21/musicxml/m21ToXml.py", line 4702
+    mxNormalType.text = typeToMusicXMLType(tup.durationNormal.type)
+MusicXMLExportException: In part (Piano), measure (129): Cannot convert "2048th" duration to MusicXML (too short).
+```
+---
+**Last Updated**: 2024-12-24
+**Next Review**: After accuracy suite re-run

docs/testing/frontend-testing.md ADDED Viewed

	@@ -0,0 +1,653 @@

+# Frontend Testing Guide
+Comprehensive guide for testing the Rescored frontend.
+## Table of Contents
+- [Setup](#setup)
+- [Running Tests](#running-tests)
+- [Test Structure](#test-structure)
+- [Writing Tests](#writing-tests)
+- [Testing Patterns](#testing-patterns)
+- [Troubleshooting](#troubleshooting)
+## Setup
+### Install Test Dependencies
+```bash
+cd frontend
+npm install
+```
+Test dependencies (already in `package.json`):
+- `vitest`: Test framework
+- `@testing-library/react`: React testing utilities
+- `@testing-library/user-event`: User interaction simulation
+- `@testing-library/jest-dom`: DOM matchers
+- `jsdom`: DOM implementation for Node.js
+- `@vitest/ui`: Interactive test UI
+- `@vitest/coverage-v8`: Coverage reporting
+### Configuration
+Test configuration is in `vitest.config.ts`:
+```typescript
+export default defineConfig({
+  test: {
+    globals: true,
+    environment: 'jsdom',
+    setupFiles: ['./src/tests/setup.ts'],
+    coverage: {
+      provider: 'v8',
+      reporter: ['text', 'html', 'lcov'],
+    },
+  },
+});
+```
+## Running Tests
+### Basic Commands
+```bash
+# Run all tests
+npm test
+# Run in watch mode
+npm test -- --watch
+# Run with UI
+npm run test:ui
+# Run with coverage
+npm run test:coverage
+# Run specific file
+npm test -- src/tests/api/client.test.ts
+# Run tests matching pattern
+npm test -- --grep "JobSubmission"
+```
+### Watch Mode
+Watch mode automatically re-runs tests when files change:
+```bash
+npm test -- --watch
+# Watch specific file
+npm test -- --watch src/tests/components/NotationCanvas.test.tsx
+```
+### Coverage Reports
+```bash
+# Generate coverage report
+npm run test:coverage
+# Open HTML report
+open coverage/index.html
+```
+## Test Structure
+### Test Files
+Component tests live alongside components or in `src/tests/`:
+```
+frontend/src/
+├── components/
+│   ├── JobSubmission.tsx
+│   └── JobSubmission.test.tsx      # Option 1: Co-located
+├── tests/
+│   ├── setup.ts                     # Test configuration
+│   ├── fixtures.ts                  # Shared test data
+│   ├── components/
+│   │   └── JobSubmission.test.tsx  # Option 2: Separate directory
+│   └── api/
+│       └── client.test.ts
+```
+### Test Organization
+```typescript
+import { describe, it, expect, vi, beforeEach } from 'vitest';
+import { render, screen } from '@testing-library/react';
+import Component from './Component';
+describe('Component', () => {
+  beforeEach(() => {
+    // Setup before each test
+  });
+  describe('Rendering', () => {
+    it('should render correctly', () => {
+      // Test rendering
+    });
+  });
+  describe('Interactions', () => {
+    it('should handle user input', async () => {
+      // Test interactions
+    });
+  });
+  describe('Edge Cases', () => {
+    it('should handle empty state', () => {
+      // Test edge cases
+    });
+  });
+});
+```
+## Writing Tests
+### Basic Component Test
+```typescript
+import { describe, it, expect } from 'vitest';
+import { render, screen } from '@testing-library/react';
+import MyComponent from './MyComponent';
+describe('MyComponent', () => {
+  it('should render text', () => {
+    render(<MyComponent text="Hello" />);
+    expect(screen.getByText('Hello')).toBeInTheDocument();
+  });
+  it('should handle button click', async () => {
+    const user = userEvent.setup();
+    const handleClick = vi.fn();
+    render(<MyComponent onClick={handleClick} />);
+    const button = screen.getByRole('button');
+    await user.click(button);
+    expect(handleClick).toHaveBeenCalledTimes(1);
+  });
+});
+```
+### Testing with User Interactions
+Use `@testing-library/user-event` for realistic interactions:
+```typescript
+import userEvent from '@testing-library/user-event';
+it('should accept user input', async () => {
+  const user = userEvent.setup();
+  render(<JobSubmission />);
+  const input = screen.getByPlaceholderText(/youtube url/i);
+  // Type into input
+  await user.type(input, 'https://www.youtube.com/watch?v=...');
+  expect(input).toHaveValue('https://www.youtube.com/watch?v=...');
+  // Click button
+  const button = screen.getByRole('button', { name: /submit/i });
+  await user.click(button);
+  // Verify action
+  await waitFor(() => {
+    expect(mockSubmit).toHaveBeenCalled();
+  });
+});
+```
+### Testing Async Operations
+```typescript
+import { waitFor } from '@testing-library/react';
+it('should load data', async () => {
+  const mockFetch = vi.fn().mockResolvedValue({
+    ok: true,
+    json: async () => ({ data: 'test' }),
+  });
+  global.fetch = mockFetch;
+  render(<DataComponent />);
+  await waitFor(() => {
+    expect(screen.getByText('test')).toBeInTheDocument();
+  });
+});
+```
+### Mocking Dependencies
+#### Mock API Client
+```typescript
+vi.mock('../../api/client', () => ({
+  submitTranscription: vi.fn(),
+  getJobStatus: vi.fn(),
+  downloadScore: vi.fn(),
+}));
+import { submitTranscription } from '../../api/client';
+it('should call API', async () => {
+  const mockSubmit = vi.mocked(submitTranscription);
+  mockSubmit.mockResolvedValue({ job_id: '123', status: 'queued' });
+  // Test component that uses submitTranscription
+  // ...
+  expect(mockSubmit).toHaveBeenCalledWith('https://youtube.com/...');
+});
+```
+#### Mock Zustand Store
+```typescript
+import { renderHook, act } from '@testing-library/react';
+import { useScoreStore } from '../../store/scoreStore';
+it('should update store', () => {
+  const { result } = renderHook(() => useScoreStore());
+  act(() => {
+    result.current.setMusicXML('<musicxml>...</musicxml>');
+  });
+  expect(result.current.musicXML).toBe('<musicxml>...</musicxml>');
+});
+```
+#### Mock VexFlow
+```typescript
+// In setup.ts
+vi.mock('vexflow', () => ({
+  Flow: {
+    Renderer: vi.fn(() => ({
+      resize: vi.fn(),
+      getContext: vi.fn(() => ({
+        clear: vi.fn(),
+        setFont: vi.fn(),
+      })),
+    })),
+    Stave: vi.fn(() => ({
+      addClef: vi.fn().mockReturnThis(),
+      addTimeSignature: vi.fn().mockReturnThis(),
+      setContext: vi.fn().mockReturnThis(),
+      draw: vi.fn(),
+    })),
+  },
+}));
+```
+## Testing Patterns
+### Testing Form Submission
+```typescript
+it('should submit form with valid data', async () => {
+  const user = userEvent.setup();
+  const onSubmit = vi.fn();
+  render(<Form onSubmit={onSubmit} />);
+  // Fill out form
+  await user.type(screen.getByLabelText(/url/i), 'https://youtube.com/...');
+  // Submit
+  await user.click(screen.getByRole('button', { name: /submit/i }));
+  // Verify
+  await waitFor(() => {
+    expect(onSubmit).toHaveBeenCalledWith({
+      url: 'https://youtube.com/...',
+    });
+  });
+});
+```
+### Testing Error States
+```typescript
+it('should show error message', async () => {
+  const mockFetch = vi.fn().mockRejectedValue(new Error('Network error'));
+  global.fetch = mockFetch;
+  render(<Component />);
+  await waitFor(() => {
+    expect(screen.getByText(/network error/i)).toBeInTheDocument();
+  });
+});
+```
+### Testing Loading States
+```typescript
+it('should show loading indicator', async () => {
+  const mockFetch = vi.fn(() =>
+    new Promise(resolve => setTimeout(() => resolve({ ok: true }), 100))
+  );
+  global.fetch = mockFetch;
+  render(<Component />);
+  // Should show loading
+  expect(screen.getByText(/loading/i)).toBeInTheDocument();
+  // Should hide loading after data loads
+  await waitFor(() => {
+    expect(screen.queryByText(/loading/i)).not.toBeInTheDocument();
+  });
+});
+```
+### Testing WebSocket Connections
+```typescript
+it('should handle WebSocket messages', () => {
+  const mockWS = {
+    addEventListener: vi.fn(),
+    send: vi.fn(),
+    close: vi.fn(),
+  };
+  global.WebSocket = vi.fn(() => mockWS) as any;
+  render(<WebSocketComponent />);
+  // Get message handler
+  const messageHandler = mockWS.addEventListener.mock.calls.find(
+    call => call[0] === 'message'
+  )?.[1];
+  // Simulate message
+  messageHandler?.({ data: JSON.stringify({ type: 'progress', progress: 50 }) });
+  // Verify UI updated
+  expect(screen.getByText(/50%/)).toBeInTheDocument();
+});
+```
+### Testing Conditional Rendering
+```typescript
+it('should render different states', () => {
+  const { rerender } = render(<StatusIndicator status="loading" />);
+  expect(screen.getByText(/loading/i)).toBeInTheDocument();
+  rerender(<StatusIndicator status="success" />);
+  expect(screen.getByText(/success/i)).toBeInTheDocument();
+  rerender(<StatusIndicator status="error" />);
+  expect(screen.getByText(/error/i)).toBeInTheDocument();
+});
+```
+### Testing Canvas/VexFlow Components
+```typescript
+it('should render notation', () => {
+  // Mock canvas context
+  const mockContext = {
+    fillRect: vi.fn(),
+    clearRect: vi.fn(),
+    beginPath: vi.fn(),
+    stroke: vi.fn(),
+  };
+  HTMLCanvasElement.prototype.getContext = vi.fn(() => mockContext) as any;
+  const { container } = render(<NotationCanvas musicXML={sampleXML} />);
+  // Verify canvas or SVG exists
+  const canvas = container.querySelector('canvas');
+  expect(canvas).toBeInTheDocument();
+});
+```
+### Snapshot Testing
+Use snapshots for stable UI components:
+```typescript
+it('should match snapshot', () => {
+  const { container } = render(<StaticComponent />);
+  expect(container).toMatchSnapshot();
+});
+```
+**Update snapshots:**
+```bash
+npm test -- -u
+```
+## Testing Custom Hooks
+```typescript
+import { renderHook, act } from '@testing-library/react';
+import { useCustomHook } from './useCustomHook';
+it('should handle state changes', () => {
+  const { result } = renderHook(() => useCustomHook());
+  expect(result.current.count).toBe(0);
+  act(() => {
+    result.current.increment();
+  });
+  expect(result.current.count).toBe(1);
+});
+```
+## Accessibility Testing
+```typescript
+it('should be accessible', () => {
+  render(<Component />);
+  // Check for proper labels
+  expect(screen.getByLabelText(/input field/i)).toBeInTheDocument();
+  // Check for ARIA attributes
+  expect(screen.getByRole('button')).toHaveAttribute('aria-label', 'Submit');
+  // Check keyboard navigation
+  const button = screen.getByRole('button');
+  button.focus();
+  expect(button).toHaveFocus();
+});
+```
+## Troubleshooting
+### Common Issues
+**Canvas/VexFlow Errors**
+```typescript
+// Mock canvas in setup.ts
+beforeEach(() => {
+  HTMLCanvasElement.prototype.getContext = vi.fn(() => ({
+    fillRect: vi.fn(),
+    // ... other canvas methods
+  })) as any;
+});
+```
+**WebSocket Errors**
+```typescript
+// Mock WebSocket in setup.ts
+global.WebSocket = vi.fn(() => ({
+  addEventListener: vi.fn(),
+  send: vi.fn(),
+  close: vi.fn(),
+  readyState: WebSocket.OPEN,
+})) as any;
+```
+**Module Import Errors**
+```typescript
+// Use vi.mock at top of test file
+vi.mock('external-module', () => ({
+  default: vi.fn(),
+  namedExport: vi.fn(),
+}));
+```
+**Async Test Timeouts**
+```typescript
+// Increase timeout for slow tests
+it('slow test', async () => {
+  // ...
+}, { timeout: 10000 });
+```
+### Debugging Tests
+```bash
+# Run with UI for interactive debugging
+npm run test:ui
+# Run specific test in watch mode
+npm test -- --watch --grep "test name"
+# Debug in VS Code
+# Add breakpoint and use "Debug Test" code lens
+```
+### Performance Issues
+```bash
+# Identify slow tests
+npm test -- --reporter=verbose
+# Run tests in parallel (default)
+npm test
+# Run sequentially if needed
+npm test -- --no-threads
+```
+## Best Practices
+1. **Test user behavior, not implementation**: Focus on what users see and do
+2. **Use accessible queries**: Prefer `getByRole`, `getByLabelText` over `getByTestId`
+3. **Avoid testing implementation details**: Don't test internal state or methods
+4. **Keep tests simple**: Each test should verify one thing
+5. **Use realistic data**: Test with data similar to production
+6. **Clean up**: Always clean up side effects (timers, listeners)
+7. **Mock external dependencies**: Don't make real API calls or WebSocket connections
+8. **Test edge cases**: Empty states, errors, loading states
+## Query Priority
+Use queries in this order (most preferred first):
+1. **Accessible Queries**:
+   - `getByRole`
+   - `getByLabelText`
+   - `getByPlaceholderText`
+   - `getByText`
+2. **Semantic Queries**:
+   - `getByAltText`
+   - `getByTitle`
+3. **Test IDs** (last resort):
+   - `getByTestId`
+Example:
+```typescript
+// Good
+const button = screen.getByRole('button', { name: /submit/i });
+const input = screen.getByLabelText(/email/i);
+// Acceptable
+const image = screen.getByAltText('Logo');
+// Last resort
+const element = screen.getByTestId('custom-element');
+```
+## Example Test File
+Complete example showing best practices:
+```typescript
+import { describe, it, expect, vi, beforeEach } from 'vitest';
+import { render, screen, waitFor } from '@testing-library/react';
+import userEvent from '@testing-library/user-event';
+import JobSubmission from './JobSubmission';
+vi.mock('../../api/client', () => ({
+  submitTranscription: vi.fn(),
+}));
+import { submitTranscription } from '../../api/client';
+describe('JobSubmission', () => {
+  beforeEach(() => {
+    vi.clearAllMocks();
+  });
+  describe('Rendering', () => {
+    it('should render input and button', () => {
+      render(<JobSubmission />);
+      expect(screen.getByPlaceholderText(/youtube url/i)).toBeInTheDocument();
+      expect(screen.getByRole('button', { name: /transcribe/i })).toBeInTheDocument();
+    });
+  });
+  describe('User Interactions', () => {
+    it('should accept and submit valid URL', async () => {
+      const user = userEvent.setup();
+      const mockSubmit = vi.mocked(submitTranscription);
+      mockSubmit.mockResolvedValue({ job_id: '123', status: 'queued' });
+      render(<JobSubmission />);
+      const input = screen.getByPlaceholderText(/youtube url/i);
+      const button = screen.getByRole('button', { name: /transcribe/i });
+      await user.type(input, 'https://www.youtube.com/watch?v=...');
+      await user.click(button);
+      await waitFor(() => {
+        expect(mockSubmit).toHaveBeenCalledWith(
+          'https://www.youtube.com/watch?v=...',
+          expect.any(Object)
+        );
+      });
+    });
+  });
+  describe('Error Handling', () => {
+    it('should show error for invalid URL', async () => {
+      const user = userEvent.setup();
+      render(<JobSubmission />);
+      const input = screen.getByPlaceholderText(/youtube url/i);
+      const button = screen.getByRole('button', { name: /transcribe/i });
+      await user.type(input, 'invalid-url');
+      await user.click(button);
+      await waitFor(() => {
+        expect(screen.getByText(/invalid/i)).toBeInTheDocument();
+      });
+    });
+  });
+});
+```

docs/testing/overview.md ADDED Viewed

	@@ -0,0 +1,315 @@

+# Testing Guide
+Complete testing guide for the Rescored project.
+## Quick Start
+### Backend Tests
+```bash
+cd backend
+pip install -r requirements-test.txt
+pytest --cov
+```
+### Frontend Tests
+```bash
+cd frontend
+npm install
+npm test
+```
+## Testing Philosophy
+Rescored follows these testing principles:
+1. **Test behavior, not implementation** - Verify what the code does, not how
+2. **Write tests that give confidence** - Focus on high-value tests that catch real bugs
+3. **Keep tests maintainable** - Tests should be easy to understand and modify
+4. **Test at the right level** - Unit tests for logic, integration tests for workflows
+5. **Fast feedback loops** - Tests should run quickly to enable rapid development
+## Test Suites
+### Backend Test Suite (`backend/tests/`)
+- **Unit Tests** (`test_utils.py`) - URL validation, video availability checks
+- **API Tests** (`test_api.py`) - FastAPI endpoints, WebSocket connections
+- **Pipeline Tests** (`test_pipeline.py`) - Audio processing, transcription, MusicXML generation
+- **Task Tests** (`test_tasks.py`) - Celery workers, job processing, progress updates
+**Features**: Mocked external dependencies (yt-dlp, Redis, ML models), temporary file handling, parametrized tests, coverage reporting
+### Frontend Test Suite (`frontend/src/tests/`)
+- **API Client Tests** (`api/client.test.ts`) - HTTP requests, WebSocket connections
+- **Component Tests** (`components/`) - JobSubmission, NotationCanvas, PlaybackControls
+- **Store Tests** (`store/useScoreStore.test.ts`) - Zustand state management
+**Features**: React Testing Library, user event simulation, mocked VexFlow and Tone.js, coverage reporting
+## Coverage Goals
+| Component | Target | Priority |
+|-----------|--------|----------|
+| Backend Utils | 90%+ | High |
+| Backend Pipeline | 85%+ | Critical |
+| Backend API | 80%+ | High |
+| Frontend API Client | 85%+ | Critical |
+| Frontend Components | 75%+ | High |
+| Frontend Store | 80%+ | High |
+## Running Tests
+### Backend
+```bash
+# Run all tests
+pytest
+# With coverage
+pytest --cov --cov-report=html
+# Specific tests
+pytest tests/test_utils.py
+pytest tests/test_utils.py::TestValidateYouTubeURL::test_valid_watch_url
+# By category
+pytest -m unit              # Only unit tests
+pytest -m integration       # Only integration tests
+pytest -m "not slow"        # Exclude slow tests
+pytest -m "not gpu"         # Exclude GPU tests
+# Debugging
+pytest -vv                  # Verbose output
+pytest -s                   # Show print statements
+pytest --pdb                # Drop into debugger on failure
+pytest --lf                 # Run last failed tests
+```
+### Frontend
+```bash
+# Run all tests
+npm test
+# Watch mode
+npm test -- --watch
+# With UI
+npm run test:ui
+# With coverage
+npm run test:coverage
+# Specific tests
+npm test -- src/tests/api/client.test.ts
+npm test -- --grep "JobSubmission"
+```
+## Test Structure
+### Backend
+```
+backend/tests/
+├── conftest.py           # Shared fixtures (temp dirs, mock Redis, sample files)
+├── test_utils.py         # Utility function tests
+├── test_api.py           # API endpoint tests
+├── test_pipeline.py      # Audio processing tests
+└── test_tasks.py         # Celery task tests
+```
+### Frontend
+```
+frontend/src/tests/
+├── setup.ts              # Test configuration (mocks for VexFlow, Tone.js, WebSocket)
+├── fixtures.ts           # Shared test data (MusicXML, job responses, etc.)
+├── api/client.test.ts
+├── components/
+│   ├── JobSubmission.test.tsx
+│   ├── NotationCanvas.test.tsx
+│   └── PlaybackControls.test.tsx
+└── store/useScoreStore.test.ts
+```
+## Common Patterns
+### Backend Testing
+```python
+# Mock external services
+@patch('pipeline.yt_dlp.YoutubeDL')
+def test_download_audio(mock_ydl_class, temp_storage_dir):
+    mock_ydl = MagicMock()
+    mock_ydl_class.return_value.__enter__.return_value = mock_ydl
+    result = download_audio("https://youtube.com/...", temp_storage_dir)
+    assert result.exists()
+    assert result.suffix == ".wav"
+# Test API endpoints
+def test_submit_transcription(test_client):
+    response = test_client.post(
+        "/api/v1/transcribe",
+        json={"youtube_url": "https://www.youtube.com/watch?v=..."}
+    )
+    assert response.status_code == 201
+    assert "job_id" in response.json()
+# Parametrized tests
+@pytest.mark.parametrize("url,expected_valid", [
+    ("https://www.youtube.com/watch?v=dQw4w9WgXcQ", True),
+    ("https://vimeo.com/12345", False),
+])
+def test_url_validation(url, expected_valid):
+    is_valid, _ = validate_youtube_url(url)
+    assert is_valid == expected_valid
+```
+### Frontend Testing
+```typescript
+// Test components with user interaction
+it('should submit form', async () => {
+  const user = userEvent.setup();
+  const onSubmit = vi.fn();
+  render(<JobSubmission onSubmit={onSubmit} />);
+  const input = screen.getByPlaceholderText(/youtube url/i);
+  await user.type(input, 'https://www.youtube.com/watch?v=...');
+  const button = screen.getByRole('button', { name: /submit/i });
+  await user.click(button);
+  await waitFor(() => {
+    expect(onSubmit).toHaveBeenCalled();
+  });
+});
+// Mock API calls
+vi.mock('../../api/client', () => ({
+  submitTranscription: vi.fn(),
+}));
+it('should call API', async () => {
+  const mockSubmit = vi.mocked(submitTranscription);
+  mockSubmit.mockResolvedValue({ job_id: '123' });
+  // Test component that uses submitTranscription
+  // ...
+});
+// Test store
+it('should update store', () => {
+  const { result } = renderHook(() => useScoreStore());
+  act(() => {
+    result.current.setMusicXML('<musicxml>...</musicxml>');
+  });
+  expect(result.current.musicXML).toBe('<musicxml>...</musicxml>');
+});
+```
+## Mocking Strategy
+### Backend
+- **External Services**: Mock yt-dlp, Redis, Celery
+- **ML Models**: Mock Demucs and basic-pitch for fast tests
+- **File System**: Use temporary directories
+### Frontend
+- **API Calls**: Mock fetch with vitest
+- **WebSockets**: Mock WebSocket connections
+- **Browser APIs**: Mock Canvas, Audio, localStorage
+- **Libraries**: Mock VexFlow, Tone.js
+## Best Practices
+### General
+1. ✅ Write descriptive test names that explain the scenario
+2. ✅ Keep tests simple and focused (one thing per test)
+3. ✅ Use Arrange-Act-Assert structure
+4. ✅ Make tests independent (no shared state)
+5. ✅ Clean up resources (files, connections, timers)
+6. ✅ Mock external dependencies
+7. ✅ Add tests when fixing bugs
+8. ✅ Keep test code as clean as production code
+### Backend-Specific
+- Use pytest fixtures for shared setup
+- Mock yt-dlp, Redis, Celery, ML models
+- Use temporary directories for file operations
+- Mark slow/GPU tests with `@pytest.mark.slow` and `@pytest.mark.gpu`
+- Test both success and error paths
+### Frontend-Specific
+- Test user behavior, not implementation details
+- Use accessible queries: `getByRole`, `getByLabelText` (not `getByTestId`)
+- Mock API calls and WebSocket connections
+- Test loading states and error handling
+- Clean up side effects (timers, event listeners)
+## Troubleshooting
+### Backend
+**Import errors**
+```bash
+export PYTHONPATH="${PYTHONPATH}:$(pwd)"
+```
+**Redis connection errors** - Always mock Redis unless testing Redis specifically
+**GPU tests failing** - Mark with `@pytest.mark.gpu` and skip if unavailable
+### Frontend
+**Canvas errors** - Mock canvas context in `setup.ts`
+**WebSocket errors** - Mock WebSocket in `setup.ts`
+**Module import errors** - Use `vi.mock()` at top of test file
+**Async timeouts** - Increase timeout: `it('test', async () => { ... }, { timeout: 10000 })`
+## Test Performance
+**Benchmarks:**
+- Unit tests: < 100ms each
+- Full backend suite: < 30 seconds
+- Full frontend suite: < 20 seconds
+**Optimization:**
+- Mock expensive operations (ML inference, network calls)
+- Use test markers to skip slow tests during development
+- Parallelize tests (pytest-xdist for backend, vitest default)
+- Cache expensive fixtures
+## CI/CD Integration
+Tests run automatically on:
+- **Pull Requests** - All tests must pass
+- **Main Branch** - Full suite including slow tests
+- **Nightly** - Extended test suite with real YouTube videos
+- **Pre-release** - E2E tests, performance benchmarks
+## Detailed Guides
+For detailed information, see:
+- **[Backend Testing Guide](./backend-testing.md)** - In-depth backend testing patterns and examples
+- **[Frontend Testing Guide](./frontend-testing.md)** - In-depth frontend testing patterns and examples
+- **[Test Video Collection](./test-videos.md)** - Curated YouTube videos for testing transcription quality
+## Resources
+- [pytest Documentation](https://docs.pytest.org/)
+- [Vitest Documentation](https://vitest.dev/)
+- [React Testing Library](https://testing-library.com/react)
+- [FastAPI Testing](https://fastapi.tiangolo.com/tutorial/testing/)

docs/testing/test-videos.md ADDED Viewed

	@@ -0,0 +1,371 @@

+# Test Video Collection
+Curated collection of YouTube videos for testing transcription quality and edge cases.
+## Table of Contents
+- [Simple Piano Tests](#simple-piano-tests)
+- [Classical Piano](#classical-piano)
+- [Pop Piano Covers](#pop-piano-covers)
+- [Jazz Piano](#jazz-piano)
+- [Complex/Challenging](#complexchallenging)
+- [Edge Cases](#edge-cases)
+- [Testing Criteria](#testing-criteria)
+## Simple Piano Tests
+Use these for basic functionality and quick iteration.
+### 1. Twinkle Twinkle Little Star (Beginner Piano)
+- **Duration**: ~1 minute
+- **Tempo**: Slow (60-80 BPM)
+- **Complexity**: Very simple melody, single notes
+- **Expected Accuracy**: 95%+
+- **Use For**: Smoke tests, basic functionality
+### 2. Mary Had a Little Lamb
+- **Duration**: ~1 minute
+- **Tempo**: Moderate (100 BPM)
+- **Complexity**: Simple melody with consistent rhythm
+- **Expected Accuracy**: 90%+
+- **Use For**: Key signature detection, basic transcription
+### 3. Happy Birthday (Piano Solo)
+- **Duration**: ~1 minute
+- **Tempo**: Moderate (120 BPM)
+- **Complexity**: Simple melody with occasional harmony
+- **Expected Accuracy**: 85%+
+- **Use For**: Time signature detection (3/4 time)
+## Classical Piano
+Test with well-known classical pieces to verify quality.
+### 4. Chopin - Nocturne Op. 9 No. 2
+- **Duration**: 4-5 minutes
+- **Tempo**: Andante (60-70 BPM)
+- **Complexity**: Expressive melody with arpeggiated accompaniment
+- **Expected Accuracy**: 75-80%
+- **Use For**:
+  - Pedal sustain handling
+  - Rubato tempo changes
+  - Expressive timing
+**Challenges**:
+- Overlapping notes from pedal
+- Tempo fluctuations
+- Decorative grace notes
+### 5. Beethoven - Für Elise
+- **Duration**: 3 minutes
+- **Tempo**: Poco moto (120-130 BPM)
+- **Complexity**: Famous melody with consistent rhythm
+- **Expected Accuracy**: 80-85%
+- **Use For**:
+  - A minor key signature
+  - Repeated patterns
+  - Multiple sections
+**Challenges**:
+- Fast 16th note passages
+- Dynamic contrasts
+### 6. Mozart - Piano Sonata K. 545 (1st Movement)
+- **Duration**: 3-4 minutes
+- **Tempo**: Allegro (120-140 BPM)
+- **Complexity**: Clear melody with Alberti bass
+- **Expected Accuracy**: 75-80%
+- **Use For**:
+  - C major scale passages
+  - Alberti bass pattern recognition
+  - Classical form
+**Challenges**:
+- Fast running passages
+- Hand coordination
+## Pop Piano Covers
+Test with contemporary music to verify modern styles.
+### 7. Let It Be (Piano Cover)
+- **Duration**: 3-4 minutes
+- **Tempo**: Moderate (76 BPM)
+- **Complexity**: Block chords with melody
+- **Expected Accuracy**: 70-75%
+- **Use For**:
+  - Chord detection
+  - Popular music transcription
+  - Mixed rhythm patterns
+**Challenges**:
+- Dense chords
+- Vocal line vs accompaniment
+### 8. Someone Like You (Piano Cover)
+- **Duration**: 4-5 minutes
+- **Tempo**: Slow (67 BPM)
+- **Complexity**: Arpeggiated chords with melody
+- **Expected Accuracy**: 70-75%
+- **Use For**:
+  - Sustained notes
+  - Emotional expression
+  - Modern pop harmony
+**Challenges**:
+- Overlapping arpeggios
+- Pedal sustain
+### 9. River Flows in You (Original Piano)
+- **Duration**: 3-4 minutes
+- **Tempo**: Moderato (110 BPM)
+- **Complexity**: Flowing arpeggios with melody
+- **Expected Accuracy**: 75-80%
+- **Use For**:
+  - Continuous motion
+  - Pattern recognition
+  - Popular instrumental
+**Challenges**:
+- Rapid note sequences
+- Consistent texture
+## Jazz Piano
+Test improvisation and complex harmony.
+### 10. Bill Evans - Waltz for Debby
+- **Duration**: 5-7 minutes
+- **Tempo**: Moderate waltz (140-160 BPM)
+- **Complexity**: Jazz voicings, walking bass, improvisation
+- **Expected Accuracy**: 60-70%
+- **Use For**:
+  - Jazz harmony
+  - 3/4 time signature
+  - Complex chord voicings
+**Challenges**:
+- Extended chords (7ths, 9ths, 11ths)
+- Improvised passages
+- Swing feel
+### 11. Oscar Peterson - C Jam Blues
+- **Duration**: 3-4 minutes
+- **Tempo**: Fast (200+ BPM)
+- **Complexity**: Blues progression with virtuosic runs
+- **Expected Accuracy**: 55-65%
+- **Use For**:
+  - Fast tempo handling
+  - Blues scale
+  - Virtuosic passages
+**Challenges**:
+- Extremely fast notes
+- Grace notes and ornaments
+- Complex rhythm
+## Complex/Challenging
+Stress tests for the transcription system.
+### 12. Flight of the Bumblebee (Piano)
+- **Duration**: 1-2 minutes
+- **Tempo**: Presto (170-200 BPM)
+- **Complexity**: Extremely fast chromatic runs
+- **Expected Accuracy**: 50-60%
+- **Use For**:
+  - Stress testing
+  - Fast passage detection
+  - Chromatic scales
+**Challenges**:
+- Very fast notes (32nd notes)
+- Chromatic passages
+- Continuous motion
+### 13. Liszt - La Campanella
+- **Duration**: 4-5 minutes
+- **Tempo**: Allegretto (120 BPM)
+- **Complexity**: Virtuosic with wide leaps and rapid passages
+- **Expected Accuracy**: 55-65%
+- **Use For**:
+  - Wide register jumps
+  - Repeated notes
+  - Virtuosic technique
+**Challenges**:
+- Octave leaps
+- Repeated staccato notes
+- Ornamentation
+### 14. Rachmaninoff - Prelude in C# Minor
+- **Duration**: 3-4 minutes
+- **Tempo**: Lento (60 BPM) to Agitato
+- **Complexity**: Dense chords, dramatic dynamics
+- **Expected Accuracy**: 60-70%
+- **Use For**:
+  - Heavy chords
+  - Dramatic contrasts
+  - Multiple voices
+**Challenges**:
+- 6+ note chords
+- Extreme dynamics
+- Multiple simultaneous voices
+## Edge Cases
+Special cases to test error handling and boundaries.
+### 15. Prepared Piano / Extended Techniques
+- **Use For**: Testing unusual timbres
+- **Expected Accuracy**: 30-50%
+- **Expected Behavior**: Should handle gracefully
+### 16. Piano with Background Noise
+- **Use For**: Testing source separation quality
+- **Expected Accuracy**: Variable
+- **Expected Behavior**: Should isolate piano reasonably
+### 17. Poor Audio Quality
+- **Use For**: Testing robustness
+- **Expected Accuracy**: Reduced
+- **Expected Behavior**: Should not crash
+### 18. Non-Piano Video (Should Fail Gracefully)
+- **Examples**:
+  - Drum solo
+  - A cappella singing
+  - Electronic music
+- **Expected Behavior**: Should complete but with poor results
+## Testing Criteria
+### Accuracy Metrics
+**High Priority (Must Work Well)**:
+- Note pitch accuracy: 85%+ for simple pieces
+- Note onset timing: 80%+ within 50ms
+- Note duration: 70%+ within one quantization unit
+**Medium Priority (Should Work)**:
+- Key signature detection: 80%+ accuracy
+- Time signature detection: 75%+ accuracy
+- Tempo detection: 70%+ within 10 BPM
+**Low Priority (Nice to Have)**:
+- Dynamic markings: Not implemented in MVP
+- Articulations: Not implemented in MVP
+- Pedal markings: Not implemented in MVP
+### Performance Benchmarks
+| Video Duration | Target Processing Time (GPU) | Max Processing Time (CPU) |
+|---------------|------------------------------|---------------------------|
+| 1 minute      | < 30 seconds                 | < 5 minutes               |
+| 3 minutes     | < 2 minutes                  | < 10 minutes              |
+| 5 minutes     | < 3 minutes                  | < 15 minutes              |
+### Success Criteria
+A transcription is considered successful if:
+1. **Job completes without error**: 95%+ success rate
+2. **Basic pitch accuracy**: 70%+ correct notes for simple pieces, 60%+ for complex
+3. **Playback sounds recognizable**: User can identify the piece
+4. **Usable for editing**: Notation is clean enough to edit and correct
+### Quality Grades
+**A (90%+ accuracy)**:
+- Simple melodies
+- Clear recordings
+- Slow to moderate tempo
+- Minimal harmony
+**B (75-89% accuracy)**:
+- Standard classical pieces
+- Good recordings
+- Moderate tempo
+- Some harmony
+**C (60-74% accuracy)**:
+- Complex pieces
+- Standard recordings
+- Fast tempo or complex harmony
+- Multiple voices
+**D (50-59% accuracy)**:
+- Virtuosic pieces
+- Poor recordings
+- Very fast or complex
+- Jazz/improvisation
+**F (< 50% accuracy)**:
+- Extended techniques
+- Very poor quality
+- Non-piano instruments
+- Extreme complexity
+## Using Test Videos
+### Manual Testing
+1. Submit each video URL through the UI
+2. Wait for processing to complete
+3. Check for errors in each pipeline stage
+4. Download and inspect MusicXML output
+5. Load in MuseScore or similar to verify quality
+6. Note accuracy, timing issues, and artifacts
+### Automated Testing
+```python
+# In tests/test_integration.py
+@pytest.mark.parametrize("video_id,expected_grade", [
+    ("simple_melody", "A"),
+    ("fur_elise", "B"),
+    ("jazz_piece", "C"),
+])
+def test_transcription_quality(video_id, expected_grade):
+    """Test transcription quality meets expectations."""
+    result = transcribe_video(video_id)
+    assert result['status'] == 'success'
+    accuracy = calculate_accuracy(result['musicxml'])
+    assert accuracy >= grade_threshold(expected_grade)
+```
+### Regression Testing
+Maintain a suite of test videos and track accuracy over time:
+```bash
+# Run regression test suite
+python scripts/run_regression_tests.py
+# Compare with baseline
+python scripts/compare_results.py --baseline v1.0.0 --current HEAD
+```
+## Maintaining Test Collection
+1. **Add new test cases** when bugs are found
+2. **Update expected accuracy** as system improves
+3. **Remove broken links** and replace with alternatives
+4. **Document edge cases** that reveal system limitations
+5. **Share results** with team to track progress
+## Test Video Sources
+When selecting test videos:
+- ✅ Use videos with clear audio
+- ✅ Prefer solo piano recordings
+- ✅ Choose varied difficulty levels
+- ✅ Include different musical styles
+- ✅ Ensure videos are publicly accessible
+- ✅ Respect copyright and fair use
+- ❌ Avoid videos with talking/commentary
+- ❌ Avoid poor audio quality unless testing robustness
+- ❌ Don't use videos over 15 minutes (MVP limit)

frontend/.env.example ADDED Viewed

	@@ -0,0 +1 @@


1	+ VITE_API_URL=http://localhost:8000

frontend/.gitignore ADDED Viewed

	@@ -0,0 +1,24 @@

+# Logs
+logs
+*.log
+npm-debug.log*
+yarn-debug.log*
+yarn-error.log*
+pnpm-debug.log*
+lerna-debug.log*
+node_modules
+dist
+dist-ssr
+*.local
+# Editor directories and files
+.vscode/*
+!.vscode/extensions.json
+.idea
+.DS_Store
+*.suo
+*.ntvs*
+*.njsproj
+*.sln
+*.sw?

frontend/Dockerfile ADDED Viewed

	@@ -0,0 +1,19 @@

+FROM node:20-alpine
+# Set working directory
+WORKDIR /app
+# Copy package files
+COPY package*.json ./
+# Install dependencies
+RUN npm install
+# Copy application code
+COPY . .
+# Expose Vite dev server port
+EXPOSE 5173
+# Default command
+CMD ["npm", "run", "dev", "--", "--host", "0.0.0.0"]

frontend/README.md ADDED Viewed

	@@ -0,0 +1,73 @@

+# React + TypeScript + Vite
+This template provides a minimal setup to get React working in Vite with HMR and some ESLint rules.
+Currently, two official plugins are available:
+- [@vitejs/plugin-react](https://github.com/vitejs/vite-plugin-react/blob/main/packages/plugin-react) uses [Babel](https://babeljs.io/) (or [oxc](https://oxc.rs) when used in [rolldown-vite](https://vite.dev/guide/rolldown)) for Fast Refresh
+- [@vitejs/plugin-react-swc](https://github.com/vitejs/vite-plugin-react/blob/main/packages/plugin-react-swc) uses [SWC](https://swc.rs/) for Fast Refresh
+## React Compiler
+The React Compiler is not enabled on this template because of its impact on dev & build performances. To add it, see [this documentation](https://react.dev/learn/react-compiler/installation).
+## Expanding the ESLint configuration
+If you are developing a production application, we recommend updating the configuration to enable type-aware lint rules:
+```js
+export default defineConfig([
+  globalIgnores(['dist']),
+  {
+    files: ['**/*.{ts,tsx}'],
+    extends: [
+      // Other configs...
+      // Remove tseslint.configs.recommended and replace with this
+      tseslint.configs.recommendedTypeChecked,
+      // Alternatively, use this for stricter rules
+      tseslint.configs.strictTypeChecked,
+      // Optionally, add this for stylistic rules
+      tseslint.configs.stylisticTypeChecked,
+      // Other configs...
+    ],
+    languageOptions: {
+      parserOptions: {
+        project: ['./tsconfig.node.json', './tsconfig.app.json'],
+        tsconfigRootDir: import.meta.dirname,
+      },
+      // other options...
+    },
+  },
+])
+```
+You can also install [eslint-plugin-react-x](https://github.com/Rel1cx/eslint-react/tree/main/packages/plugins/eslint-plugin-react-x) and [eslint-plugin-react-dom](https://github.com/Rel1cx/eslint-react/tree/main/packages/plugins/eslint-plugin-react-dom) for React-specific lint rules:
+```js
+// eslint.config.js
+import reactX from 'eslint-plugin-react-x'
+import reactDom from 'eslint-plugin-react-dom'
+export default defineConfig([
+  globalIgnores(['dist']),
+  {
+    files: ['**/*.{ts,tsx}'],
+    extends: [
+      // Other configs...
+      // Enable lint rules for React
+      reactX.configs['recommended-typescript'],
+      // Enable lint rules for React DOM
+      reactDom.configs.recommended,
+    ],
+    languageOptions: {
+      parserOptions: {
+        project: ['./tsconfig.node.json', './tsconfig.app.json'],
+        tsconfigRootDir: import.meta.dirname,
+      },
+      // other options...
+    },
+  },
+])
+```

frontend/eslint.config.js ADDED Viewed

	@@ -0,0 +1,23 @@

+import js from '@eslint/js'
+import globals from 'globals'
+import reactHooks from 'eslint-plugin-react-hooks'
+import reactRefresh from 'eslint-plugin-react-refresh'
+import tseslint from 'typescript-eslint'
+import { defineConfig, globalIgnores } from 'eslint/config'
+export default defineConfig([
+  globalIgnores(['dist']),
+  {
+    files: ['**/*.{ts,tsx}'],
+    extends: [
+      js.configs.recommended,
+      tseslint.configs.recommended,
+      reactHooks.configs.flat.recommended,
+      reactRefresh.configs.vite,
+    ],
+    languageOptions: {
+      ecmaVersion: 2020,
+      globals: globals.browser,
+    },
+  },
+])

frontend/index.html ADDED Viewed

	@@ -0,0 +1,13 @@

+<!doctype html>
+<html lang="en">
+  <head>
+    <meta charset="UTF-8" />
+    <link rel="icon" type="image/svg+xml" href="/vite.svg" />
+    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
+    <title>frontend</title>
+  </head>
+  <body>
+    <div id="root"></div>
+    <script type="module" src="/src/main.tsx"></script>
+  </body>
+</html>

frontend/package-lock.json ADDED Viewed

The diff for this file is too large to render. See raw diff

frontend/package.json ADDED Viewed

	@@ -0,0 +1,44 @@

+{
+  "name": "frontend",
+  "private": true,
+  "version": "0.0.0",
+  "type": "module",
+  "scripts": {
+    "dev": "vite",
+    "build": "tsc -b && vite build",
+    "lint": "eslint .",
+    "preview": "vite preview",
+    "test": "vitest",
+    "test:ui": "vitest --ui",
+    "test:coverage": "vitest --coverage"
+  },
+  "dependencies": {
+    "@xmldom/xmldom": "^0.8.11",
+    "react": "^19.2.0",
+    "react-dom": "^19.2.0",
+    "tone": "^15.1.3",
+    "vexflow": "^4.2.4",
+    "zustand": "^5.0.3"
+  },
+  "devDependencies": {
+    "@eslint/js": "^9.39.1",
+    "@testing-library/jest-dom": "^6.1.5",
+    "@testing-library/react": "^14.1.2",
+    "@testing-library/user-event": "^14.5.1",
+    "@types/node": "^24.10.1",
+    "@types/react": "^19.2.5",
+    "@types/react-dom": "^19.2.3",
+    "@vitejs/plugin-react": "^5.1.1",
+    "@vitest/ui": "^1.1.0",
+    "eslint": "^9.39.1",
+    "eslint-plugin-react-hooks": "^7.0.1",
+    "eslint-plugin-react-refresh": "^0.4.24",
+    "globals": "^16.5.0",
+    "jsdom": "^23.0.1",
+    "typescript": "~5.9.3",
+    "typescript-eslint": "^8.46.4",
+    "vite": "^7.2.4",
+    "vitest": "^1.1.0",
+    "@vitest/coverage-v8": "^1.1.0"
+  }
+}

frontend/public/vite.svg ADDED Viewed

frontend/scripts/debug-parser.cjs ADDED Viewed

	@@ -0,0 +1,58 @@

+// Quick script to test the parser
+const fs = require('fs');
+const { DOMParser } = require('@xmldom/xmldom');
+const xml = fs.readFileSync('../storage/outputs/497306b6-8e09-41c2-b8c7-0792dbd22022.musicxml', 'utf-8');
+const parser = new DOMParser();
+const doc = parser.parseFromString(xml, 'text/xml');
+// Check what we're getting
+const beats = doc.getElementsByTagName('beats')[0]?.textContent;
+const beatType = doc.getElementsByTagName('beat-type')[0]?.textContent;
+console.log('Time signature:', beats + '/' + beatType);
+console.log('Divisions:', doc.getElementsByTagName('divisions')[0]?.textContent);
+console.log('Key (fifths):', doc.getElementsByTagName('fifths')[0]?.textContent);
+const soundEl = doc.getElementsByTagName('sound')[0];
+console.log('Tempo:', soundEl?.getAttribute('tempo'));
+const measures = doc.getElementsByTagName('measure');
+console.log('\nTotal measures:', measures.length);
+console.log('First 10 measures:');
+const divisions = parseInt(doc.getElementsByTagName('divisions')[0]?.textContent || '10080');
+for (let i = 0; i < Math.min(10, measures.length); i++) {
+  const m = measures[i];
+  const notes = m.getElementsByTagName('note');
+  const pitchedNotes = [];
+  let totalDuration = 0;
+  for (let n = 0; n < notes.length; n++) {
+    const note = notes[n];
+    const isRest = note.getElementsByTagName('rest').length > 0;
+    const duration = parseInt(note.getElementsByTagName('duration')[0]?.textContent || '0');
+    totalDuration += duration;
+    if (!isRest) {
+      pitchedNotes.push(note);
+    }
+  }
+  const expectedDuration = divisions * 4; // 4 beats in 4/4
+  const durationMatch = totalDuration === expectedDuration ? '✓' : `✗ (expected ${expectedDuration}, got ${totalDuration})`;
+  console.log(`  Measure ${m.getAttribute('number')}: ${notes.length} total notes, ${pitchedNotes.length} pitched notes, duration ${durationMatch}`);
+  // Show first 3 pitched notes
+  for (let j = 0; j < Math.min(3, pitchedNotes.length); j++) {
+    const note = pitchedNotes[j];
+    const pitch = note.getElementsByTagName('step')[0]?.textContent;
+    const octave = note.getElementsByTagName('octave')[0]?.textContent;
+    const duration = note.getElementsByTagName('duration')[0]?.textContent;
+    const type = note.getElementsByTagName('type')[0]?.textContent;
+    const alter = note.getElementsByTagName('alter')[0]?.textContent;
+    const accidental = alter === '1' ? '#' : alter === '-1' ? 'b' : '';
+    console.log(`    Note ${j+1}: ${pitch}${accidental}${octave}, duration=${duration}, type=${type}`);
+  }
+}

frontend/scripts/test-chord-handling.cjs ADDED Viewed

	@@ -0,0 +1,42 @@

+const fs = require('fs');
+const { DOMParser } = require('@xmldom/xmldom');
+const xml = fs.readFileSync('../storage/outputs/497306b6-8e09-41c2-b8c7-0792dbd22022.musicxml', 'utf-8');
+const parser = new DOMParser();
+const doc = parser.parseFromString(xml, 'text/xml');
+// Look at measure 4 which has chords
+const measures = doc.getElementsByTagName('measure');
+const measure4 = measures[3]; // 0-indexed
+console.log('=== MEASURE 4 ANALYSIS ===');
+const notes = measure4.getElementsByTagName('note');
+console.log('Total note elements:', notes.length);
+let noteCount = 0;
+let totalDuration = 0;
+for (let i = 0; i < notes.length; i++) {
+  const note = notes[i];
+  const isChord = note.getElementsByTagName('chord').length > 0;
+  const isRest = note.getElementsByTagName('rest').length > 0;
+  const duration = parseInt(note.getElementsByTagName('duration')[0]?.textContent || '0');
+  // Chord notes share duration with previous note
+  if (!isChord) {
+    totalDuration += duration;
+  }
+  if (!isRest) {
+    const pitch = note.getElementsByTagName('step')[0]?.textContent;
+    const octave = note.getElementsByTagName('octave')[0]?.textContent;
+    const type = note.getElementsByTagName('type')[0]?.textContent;
+    noteCount++;
+    console.log('Note', noteCount, ':', pitch + octave, '(' + type + '), duration=' + duration, ', chord=' + isChord);
+  }
+}
+const divisions = 10080;
+const expected = divisions * 4;
+console.log('\nTotal duration:', totalDuration, '(expected', expected + ')');
+console.log('Duration ratio:', (totalDuration / expected).toFixed(2) + 'x');

frontend/src/App.css ADDED Viewed

	@@ -0,0 +1,30 @@

+* {
+  box-sizing: border-box;
+}
+body {
+  margin: 0;
+  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', 'Roboto', 'Oxygen',
+    'Ubuntu', 'Cantarell', 'Fira Sans', 'Droid Sans', 'Helvetica Neue',
+    sans-serif;
+  -webkit-font-smoothing: antialiased;
+  -moz-osx-font-smoothing: grayscale;
+}
+.app {
+  min-height: 100vh;
+  background: linear-gradient(135deg, #667eea 0%, #764ba2 100%);
+  padding: 2rem;
+}
+.back-button {
+  margin-bottom: 1rem;
+  background: white;
+  color: #667eea;
+  border: 2px solid #667eea;
+}
+.back-button:hover {
+  background: #667eea;
+  color: white;
+}

frontend/src/App.tsx ADDED Viewed

	@@ -0,0 +1,36 @@

+/**
+ * Main application component.
+ */
+import { useState } from 'react';
+import { JobSubmission } from './components/JobSubmission';
+import { ScoreEditor } from './components/ScoreEditor';
+import './App.css';
+function App() {
+  const [currentJobId, setCurrentJobId] = useState<string | null>(null);
+  const handleJobComplete = (jobId: string) => {
+    setCurrentJobId(jobId);
+  };
+  const handleReset = () => {
+    setCurrentJobId(null);
+  };
+  return (
+    <div className="app">
+      {!currentJobId ? (
+        <JobSubmission onComplete={handleJobComplete} />
+      ) : (
+        <div>
+          <button className="back-button" onClick={handleReset}>
+            ← New Transcription
+          </button>
+          <ScoreEditor jobId={currentJobId} />
+        </div>
+      )}
+    </div>
+  );
+}
+export default App;

frontend/src/api/client.ts ADDED Viewed

	@@ -0,0 +1,143 @@

+/**
+ * API client for Rescored backend.
+ */
+const API_BASE_URL = import.meta.env.VITE_API_URL || 'http://localhost:8000';
+const WS_BASE_URL = API_BASE_URL.replace('http', 'ws');
+export interface TranscribeRequest {
+  youtube_url: string;
+  options?: {
+    instruments: string[];
+  };
+}
+export interface TranscribeResponse {
+  job_id: string;
+  status: string;
+  created_at: string;
+  estimated_duration_seconds: number;
+  websocket_url: string;
+}
+export interface JobStatus {
+  job_id: string;
+  status: 'queued' | 'processing' | 'completed' | 'failed';
+  progress: number;
+  current_stage: string | null;
+  status_message: string | null;
+  created_at: string;
+  started_at: string | null;
+  completed_at: string | null;
+  failed_at: string | null;
+  error: { message: string; retryable: boolean } | null;
+  result_url: string | null;
+}
+export interface ProgressUpdate {
+  type: 'progress' | 'completed' | 'error' | 'heartbeat';
+  job_id: string;
+  progress?: number;
+  stage?: string;
+  message?: string;
+  result_url?: string;
+  error?: { message: string; retryable: boolean };
+  timestamp: string;
+}
+export class RescoredAPI {
+  private baseURL = API_BASE_URL;
+  private wsBaseURL = WS_BASE_URL;
+  async submitJob(youtubeURL: string, options?: { instruments?: string[] }): Promise<TranscribeResponse> {
+    const response = await fetch(`${this.baseURL}/api/v1/transcribe`, {
+      method: 'POST',
+      headers: {
+        'Content-Type': 'application/json',
+      },
+      body: JSON.stringify({
+        youtube_url: youtubeURL,
+        options: options ?? { instruments: ['piano'] },
+      }),
+    });
+    if (!response.ok) {
+      const error = await response.json();
+      throw new Error(error.detail || 'Failed to submit job');
+    }
+    return response.json();
+  }
+  async getJobStatus(jobId: string): Promise<JobStatus> {
+    const response = await fetch(`${this.baseURL}/api/v1/jobs/${jobId}`);
+    if (!response.ok) {
+      throw new Error('Failed to fetch job status');
+    }
+    return response.json();
+  }
+  async getScore(jobId: string): Promise<string> {
+    const response = await fetch(`${this.baseURL}/api/v1/scores/${jobId}`);
+    if (!response.ok) {
+      throw new Error('Failed to fetch score');
+    }
+    return response.text();
+  }
+  connectWebSocket(
+    jobId: string,
+    onMessage: (update: ProgressUpdate) => void,
+    onError?: (error: Event) => void,
+    onClose?: () => void
+  ): WebSocket {
+    const ws = new WebSocket(`${this.wsBaseURL}/api/v1/jobs/${jobId}/stream`);
+    ws.onmessage = (event) => {
+      const update: ProgressUpdate = JSON.parse(event.data);
+      onMessage(update);
+      // Send pong for heartbeat
+      if (update.type === 'heartbeat') {
+        ws.send(JSON.stringify({ type: 'pong', timestamp: new Date().toISOString() }));
+      }
+    };
+    if (onError) {
+      ws.onerror = onError;
+    }
+    if (onClose) {
+      ws.onclose = onClose;
+    }
+    return ws;
+  }
+  getScoreURL(jobId: string): string {
+    return `${this.baseURL}/api/v1/scores/${jobId}`;
+  }
+}
+export const api = new RescoredAPI();
+// Compatibility function wrappers for tests
+export async function submitTranscription(
+  youtubeURL: string,
+  options?: { instruments?: string[] }
+) {
+  // Delegate to class method; include options if provided
+  return api.submitJob(youtubeURL, options);
+}
+export async function getJobStatus(jobId: string) {
+  return api.getJobStatus(jobId);
+}
+export async function downloadScore(jobId: string) {
+  return api.getScore(jobId);
+}

frontend/src/assets/react.svg ADDED Viewed

frontend/src/components/JobSubmission.css ADDED Viewed

	@@ -0,0 +1,83 @@

+.job-submission {
+  max-width: 600px;
+  margin: 0 auto;
+  padding: 2rem;
+}
+.job-submission h1 {
+  font-size: 2rem;
+  margin-bottom: 0.5rem;
+}
+.form-group {
+  margin-bottom: 1rem;
+}
+.form-group label {
+  display: block;
+  margin-bottom: 0.5rem;
+  font-weight: bold;
+}
+.form-group input {
+  width: 100%;
+  padding: 0.5rem;
+  border: 1px solid #ccc;
+  border-radius: 4px;
+  font-size: 1rem;
+}
+button {
+  padding: 0.75rem 1.5rem;
+  background-color: #007bff;
+  color: white;
+  border: none;
+  border-radius: 4px;
+  font-size: 1rem;
+  cursor: pointer;
+}
+button:hover {
+  background-color: #0056b3;
+}
+.progress-container {
+  text-align: center;
+}
+.progress-bar {
+  width: 100%;
+  height: 30px;
+  background-color: #f0f0f0;
+  border-radius: 15px;
+  overflow: hidden;
+  margin: 1rem 0;
+}
+.progress-fill {
+  height: 100%;
+  background-color: #28a745;
+  transition: width 0.3s ease;
+}
+.progress-text {
+  color: #666;
+  font-size: 0.9rem;
+}
+.success-message,
+.error-message {
+  text-align: center;
+  padding: 2rem;
+  border-radius: 8px;
+}
+.success-message {
+  background-color: #d4edda;
+  color: #155724;
+}
+.error-message {
+  background-color: #f8d7da;
+  color: #721c24;
+}