--- title: Watermark Leaderboard emoji: 🏆 colorFrom: blue colorTo: green sdk: gradio sdk_version: "4.44.0" app_file: app.py pinned: false license: mit short_description: Interactive leaderboard for watermark performance evaluation --- # Watermark Leaderboard 🏆 An interactive leaderboard for comparing watermark performance across different models and evaluation settings. ## Features - **Interactive Scatter Plot**: Visualize watermark performance with Plotly charts - **Performance Table**: Detailed metrics with sorting and filtering - **Multiple Evaluation Settings**: Attack-free, Watermark Removal, and Stealing Attack - **Model Support**: LLaMA3 and DeepSeek models - **Dynamic Filtering**: Real-time updates based on model and metric selection - **Flexible Submissions**: Submit data for any combination of attack types - **Pending Approval System**: All submissions reviewed before appearing on leaderboard - **Complete Field Visibility**: Administrators see all submission details for review - **Professional UI**: Clean, modern interface with accordion sections - **Reproducibility**: Access to all evaluation codes and guidelines ## How to Use 1. **Select Model**: Choose between LLaMA3 or DeepSeek 2. **Choose Setting**: Pick from Attack-free, Watermark Removal, or Stealing Attack 3. **View Results**: Explore the scatter plot and detailed table 4. **Submit Data**: Click "Add Your Data" to submit new results - Submit any combination of attack types (Attack-free, Watermark Removal, Stealing Attack) - All submissions go through approval process before appearing on leaderboard 5. **Administrator Review**: Administrators can review pending submissions with full field visibility ## Metrics Explained - **Normalized Utility ↑**: Higher values indicate better text quality - **Detection Rate (%) ↑**: Higher values indicate better watermark detection - **Absolute Utility Degradation ↑**: Higher values indicate better resistance to removal attacks - **Adversary BERT Score ↑**: Higher values indicate better performance under adversarial conditions ## Contributing We encourage researchers to contribute their evaluation results. Please follow the guidelines in the "Guidelines" section for submission requirements. ## License MIT License --- *Last updated: September 2024*