File size: 521 Bytes
12df2a9
 
 
 
 
 
 
 
 
34aa1e7
 
 
1
2
3
4
5
6
7
8
9
10
11
12
---
title: README
emoji: 🌖
colorFrom: green
colorTo: red
sdk: static
pinned: false
---

The [webshart format](https://github.com/bghira/webshart) is a community-driven, [loosely-organised](https://hf.co/terminusresearch) attempt at pushing a better standard for dataset metadata.

The datasets here are either a converted dataset from a third-party source (such as CC12M) or were created by [SimpleTuner](https://github.com/bghira/SimpleTuner) or [CaptionFlow](https://github.com/bghira/CaptionFlow) community members.