kirudang's picture
Copy files from original watermark leaderboard
40b3335

A newer version of the Gradio SDK is available: 6.1.0

Upgrade
metadata
title: Watermark Leaderboard
emoji: πŸ†
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
short_description: Interactive leaderboard for watermark performance evaluation

Watermark Leaderboard πŸ†

An interactive leaderboard for comparing watermark performance across different models and evaluation settings.

Features

  • Interactive Scatter Plot: Visualize watermark performance with Plotly charts
  • Performance Table: Detailed metrics with sorting and filtering
  • Multiple Evaluation Settings: Attack-free, Watermark Removal, and Stealing Attack
  • Model Support: LLaMA3 and DeepSeek models
  • Dynamic Filtering: Real-time updates based on model and metric selection
  • Flexible Submissions: Submit data for any combination of attack types
  • Pending Approval System: All submissions reviewed before appearing on leaderboard
  • Complete Field Visibility: Administrators see all submission details for review
  • Professional UI: Clean, modern interface with accordion sections
  • Reproducibility: Access to all evaluation codes and guidelines

How to Use

  1. Select Model: Choose between LLaMA3 or DeepSeek
  2. Choose Setting: Pick from Attack-free, Watermark Removal, or Stealing Attack
  3. View Results: Explore the scatter plot and detailed table
  4. Submit Data: Click "Add Your Data" to submit new results
    • Submit any combination of attack types (Attack-free, Watermark Removal, Stealing Attack)
    • All submissions go through approval process before appearing on leaderboard
  5. Administrator Review: Administrators can review pending submissions with full field visibility

Metrics Explained

  • Normalized Utility ↑: Higher values indicate better text quality
  • Detection Rate (%) ↑: Higher values indicate better watermark detection
  • Absolute Utility Degradation ↑: Higher values indicate better resistance to removal attacks
  • Adversary BERT Score ↑: Higher values indicate better performance under adversarial conditions

Contributing

We encourage researchers to contribute their evaluation results. Please follow the guidelines in the "Guidelines" section for submission requirements.

License

MIT License


Last updated: September 2024