Buckets:
| # dolma3-bin-characterization | |
| Bin-characterization analysis artifacts (RQ4 figures, statistics) for the Dolma3 6T corpus across the 24x24 topic x format grid. | |
| ## Provenance | |
| This bucket was renamed on 2026-05-24 as part of the TrackStar typo fix + SOC-number cleanup. | |
| | Field | Value | | |
| |---|---| | |
| | Previous name | `HCAI-Lab/soc14-rq4-bin-characterization` | | |
| | SOC ticket(s) | SOC-14 | | |
| | Renamed | 2026-05-24 | | |
| See [`docs/data_home/inventory.json`](https://github.com/eilab-gt/social-data-attribution/blob/main/docs/data_home/inventory.json) for the full inventory including the `old_names` field on each entry. | |
Xet Storage Details
- Size:
- 615 Bytes
- Xet hash:
- 20b1b13fe08bb8e0eb2a069a23bde615c3b2fe72647191b92b5f31fb1577cbdd
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.