--- title: README emoji: 🌖 colorFrom: green colorTo: red sdk: static pinned: false --- The [webshart format](https://github.com/bghira/webshart) is a community-driven, [loosely-organised](https://hf.co/terminusresearch) attempt at pushing a better standard for dataset metadata. The datasets here are either a converted dataset from a third-party source (such as CC12M) or were created by [SimpleTuner](https://github.com/bghira/SimpleTuner) or [CaptionFlow](https://github.com/bghira/CaptionFlow) community members.