SafeLLM-leaderboard / README.md
saucam's picture
Update README.md
c2834c2 verified
---
title: SafeLLM Leaderboard
emoji: πŸ›‘οΈ
colorFrom: indigo
colorTo: blue
sdk: docker
pinned: true
license: apache-2.0
short_description: Trusted OSS Model Supply Chain Security Rankings
---
# 🏰 SafeLLM Leaderboard
**Comprehensive security rankings for machine learning models**
[![Dataset](https://img.shields.io/badge/πŸ€—-Dataset-yellow)](https://huggingface.co/datasets/javelinai/palisade-scan-results)
[![Powered by Palisade](https://img.shields.io/badge/Powered%20by-Palisade-blue)](https://github.com/highflame-ai/highflame-palisade)
---
## πŸ“Š About
This leaderboard displays security rankings for ML models scanned with **[Palisade](https://github.com/highflame-ai/palisade)**,
a comprehensive security scanner that detects:
- 🎯 **Backdoors & Trojans** - Hidden malicious behaviors
- πŸ”“ **Pickle RCE** - Remote code execution vulnerabilities
- πŸ’₯ **Buffer Overflows** - Memory safety issues
- πŸ”— **Supply Chain Attacks** - Compromised dependencies
- πŸ” **Model Integrity** - Tampering detection
- 🎭 **Tokenizer Hijacking** - Malicious configurations
## 🎯 Understanding the Scores
### Security Score
**Lower is better!** Calculated as:
```
Score = (Critical Γ— 100) + (High Γ— 50) + (Medium Γ— 10) + (Low Γ— 2)
```
- **0-49**: βœ… Excellent security
- **50-99**: 🟑 Good security
- **100-199**: 🟠 Moderate concerns
- **200+**: πŸ”΄ Significant issues
### Risk Levels
| Level | Meaning | Action |
|-------|---------|--------|
| 🟒 **Safe** | No significant issues | Deploy with confidence |
| 🟑 **Low** | Minor issues only | Review and monitor |
| 🟠 **Medium** | Some concerns | Fix before production |
| πŸ”΄ **High** | Serious issues | Use with caution |
| β›” **Critical** | Critical vulnerabilities | Do NOT use |
## πŸ“ˆ Features
- **Interactive Filtering** - By risk level, score, and size
- **Rich Visualizations** - Charts and graphs powered by Plotly
- **Detailed Analysis** - Threat categories and MITRE ATT&CK mapping
- **SARIF Reports** - Industry-standard security reports
- **Real-time Updates** - Auto-refreshes from HuggingFace dataset
## πŸ” Data Source
All scan results are stored in the public dataset:
**[javelinai/palisade-scan-results](https://huggingface.co/datasets/highflame/palisade-scan-results)**
Models are scanned weekly with automated GitHub Actions.
## πŸ› οΈ Technology Stack
- **Scanner**: [Palisade](https://github.com/highflame-ai/highflame-palisade)
- **Frontend**: Gradio 4.27
- **Visualizations**: Plotly
- **Data**: HuggingFace Datasets
- **Hosting**: HuggingFace Spaces
## πŸ“š Learn More
- [Palisade Documentation](https://github.com/highflame-ai/highflame-palisade)
- [SARIF Specification](https://docs.oasis-open.org/sarif/sarif/v2.1.0/)
- [MITRE ATT&CK for ML](https://atlas.mitre.org/)
## πŸ“ž Support
- πŸ’¬ [Discord](https://discord.gg/javelin)
- πŸ“§ [Email](mailto:support@highflame.com)
- 🐦 [Twitter](https://twitter.com/getjavelin)
---
<div align="center">
**Built with ❀️ by [Highflame](https://highflame.com)**
[Website](https://highflame.com) β€’
[GitHub](https://github.com/highflame-ai) β€’
[Discord](https://discord.gg/javelin)
</div>