title: SHA-Index
emoji: π΅οΈ
colorFrom: gray
colorTo: blue
sdk: static
pinned: false
π΅οΈ SHA-Index
The Global Registry of Model Provenance
SHA-Index is a community-driven initiative to map the "DNA" of open-source AI models. By indexing the unique SHA256 hashes of model weights (via Git LFS), we trace the lineage of models across the Hugging Face Hub, identifying original authors and verifying model authenticity.
π οΈ Our Tools
π΅οΈ Search-SHA (Live Tool)
The Search-SHA is our flagship tool. It allows you to:
- Trace Origins: Paste a SHA256 hash to find the original repository where it first appeared.
- Live Patrol: Scan the newest uploads on Hugging Face to detect re-uploads and uncredited copies in real-time.
- Index Repos: Help grow the database by scanning your favorite models.
πΎ The Data
𧬠Model DNA Index
This is the central database powering our tools. It is an open, community-maintained registry of:
- SHA256 Hashes (extracted from LFS pointers)
- Repository IDs
- Creation Timestamps
- Filenames
This dataset is updated automatically by the Search-SHA space.
π How It Works
We do not download model weights. Instead, we analyze the Git LFS (Large File Storage) pointer files. These tiny metadata files act as a fingerprint for the actual weights.
- Scan: We read the
oid sha256:...from the pointer file. - Compare: We check our index for this hash.
- Verify: If the hash exists, the earliest timestamp wins. That repository is considered the "Original Source."
π€ Contributing
We believe in open provenance. You can contribute by:
- Using the Search-SHA to index missing models.
- Reporting bugs or feature requests in our community discussions.
Let's build the source of truth for AI weights.