Spaces:

raylim
/

mosaic-zero

Sleeping

App Files Files Community

mosaic-zero / tests /README_BATCH_TESTS.md

raylim

Add batch processing optimization for slide analysis

0234c58 unverified 25 days ago

preview code

raw

history blame contribute delete

5.71 kB

A newer version of the Gradio SDK is available: 6.5.1

Upgrade

Batch Processing Tests

This directory contains comprehensive tests for the batch processing optimization feature.

Test Files

Unit Tests

test_model_manager.py - Tests for model loading and caching

ModelCache class initialization
Model loading (Aeon, Paladin, CTransPath, Optimus)
GPU type detection (T4 vs A100)
Aggressive memory management vs caching
Model cleanup functionality
Paladin lazy-loading and caching

Integration Tests

test_batch_analysis.py - Tests for batch processing coordinator

End-to-end batch analysis workflow
Batch processing with multiple slides
Error handling (individual slide failures)
Cleanup on errors
Progress tracking
Multi-slide result aggregation

Regression Tests

test_regression_single_slide.py - Ensures single-slide mode is unchanged

Single-slide analysis behavior
Gradio UI single-slide path
API backward compatibility
Function signatures unchanged
Return types unchanged

Performance Benchmarks

benchmark_batch_performance.py - Performance comparison tool

Sequential processing (old method) benchmark
Batch processing (new method) benchmark
Performance comparison and reporting
Memory usage tracking

Running Tests

Run All Tests

# From repository root
pytest tests/test_model_manager.py tests/test_batch_analysis.py tests/test_regression_single_slide.py -v

Run Specific Test Files

# Unit tests only
pytest tests/test_model_manager.py -v

# Integration tests only
pytest tests/test_batch_analysis.py -v

# Regression tests only
pytest tests/test_regression_single_slide.py -v

Run Specific Test Classes or Functions

# Test specific class
pytest tests/test_model_manager.py::TestModelCache -v

# Test specific function
pytest tests/test_model_manager.py::TestModelCache::test_model_cache_initialization -v

Run with Coverage

pytest tests/ --cov=mosaic.model_manager --cov=mosaic.batch_analysis --cov-report=html

Running Performance Benchmarks

Basic Benchmark (3 slides with default settings)

python tests/benchmark_batch_performance.py --slides slide1.svs slide2.svs slide3.svs

Benchmark with CSV Settings

python tests/benchmark_batch_performance.py --slide-csv test_slides.csv

Benchmark Batch Mode Only (Skip Sequential)

Useful for quick testing when you don't need comparison:

python tests/benchmark_batch_performance.py --slides slide1.svs slide2.svs --skip-sequential

Save Benchmark Results

python tests/benchmark_batch_performance.py \
    --slide-csv test_slides.csv \
    --output benchmark_results.json

Benchmark Options

--slides: List of slide paths (e.g., slide1.svs slide2.svs)
--slide-csv: Path to CSV with slide settings
--num-workers: Number of CPU workers for data loading (default: 4)
--skip-sequential: Skip sequential benchmark (faster)
--output: Save results to JSON file

Expected Test Results

Unit Tests

test_model_manager.py: Should pass all tests
Tests model loading, caching, cleanup
Tests GPU detection and adaptive memory management

Integration Tests

test_batch_analysis.py: Should pass all tests
Tests end-to-end batch workflow
Tests error handling and recovery

Regression Tests

test_regression_single_slide.py: Should pass all tests
Ensures backward compatibility
Single-slide behavior unchanged

Performance Benchmarks

Expected performance improvements:

Speedup: 1.25x - 1.45x (25-45% faster)
Time saved: Depends on batch size and model loading overhead
Memory: Similar peak memory to single-slide (~9-15GB on typical slides)

Example output: ``` PERFORMANCE COMPARISON

Number of slides: 10

Sequential processing: 450.23s Batch processing: 300.45s

Time saved: 149.78s Speedup: 1.50x Improvement: 33.3% faster

Sequential peak memory: 12.45 GB Batch peak memory: 13.12 GB Memory difference: +0.67 GB


## Test Coverage Goals

- **Model Manager**: >90% coverage
- **Batch Analysis**: >85% coverage
- **Regression Tests**: 100% of critical paths
- **Integration Tests**: All major workflows

## Troubleshooting

### Tests Fail Due to Missing Models

If tests fail with "model not found" errors:
```bash
# Download models first
python -m mosaic.gradio_app --help
# This will trigger model download

CUDA Out of Memory Errors

If benchmarks fail with OOM:

Reduce number of slides in benchmark
Use --skip-sequential to reduce memory pressure
Test on T4 GPU will use aggressive memory management automatically

Import Errors

Ensure mosaic package is installed:

pip install -e .

Contributing

When adding new features to batch processing:

Add unit tests to test_model_manager.py or test_batch_analysis.py
Add regression tests if modifying existing functions
Run benchmarks to verify performance improvements
Update this README with new test information

CI/CD Integration

To integrate with CI/CD:

# Example GitHub Actions workflow
- name: Run Batch Processing Tests
  run: |
    pytest tests/test_model_manager.py tests/test_batch_analysis.py tests/test_regression_single_slide.py -v --cov

For performance regression detection:

- name: Performance Benchmark
  run: |
    python tests/benchmark_batch_performance.py --slide-csv ci_test_slides.csv --output benchmark.json
    python scripts/check_performance_regression.py benchmark.json