Spaces:
Running
Running
metadata
title: SafeLLM Leaderboard
emoji: π‘οΈ
colorFrom: indigo
colorTo: blue
sdk: docker
pinned: true
license: apache-2.0
short_description: Trusted OSS Model Supply Chain Security Rankings
π° SafeLLM Leaderboard
Comprehensive security rankings for machine learning models
π About
This leaderboard displays security rankings for ML models scanned with Palisade, a comprehensive security scanner that detects:
- π― Backdoors & Trojans - Hidden malicious behaviors
- π Pickle RCE - Remote code execution vulnerabilities
- π₯ Buffer Overflows - Memory safety issues
- π Supply Chain Attacks - Compromised dependencies
- π Model Integrity - Tampering detection
- π Tokenizer Hijacking - Malicious configurations
π― Understanding the Scores
Security Score
Lower is better! Calculated as:
Score = (Critical Γ 100) + (High Γ 50) + (Medium Γ 10) + (Low Γ 2)
- 0-49: β Excellent security
- 50-99: π‘ Good security
- 100-199: π Moderate concerns
- 200+: π΄ Significant issues
Risk Levels
| Level | Meaning | Action |
|---|---|---|
| π’ Safe | No significant issues | Deploy with confidence |
| π‘ Low | Minor issues only | Review and monitor |
| π Medium | Some concerns | Fix before production |
| π΄ High | Serious issues | Use with caution |
| β Critical | Critical vulnerabilities | Do NOT use |
π Features
- Interactive Filtering - By risk level, score, and size
- Rich Visualizations - Charts and graphs powered by Plotly
- Detailed Analysis - Threat categories and MITRE ATT&CK mapping
- SARIF Reports - Industry-standard security reports
- Real-time Updates - Auto-refreshes from HuggingFace dataset
π Data Source
All scan results are stored in the public dataset: javelinai/palisade-scan-results
Models are scanned weekly with automated GitHub Actions.
π οΈ Technology Stack
- Scanner: Palisade
- Frontend: Gradio 4.27
- Visualizations: Plotly
- Data: HuggingFace Datasets
- Hosting: HuggingFace Spaces