Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0024
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000097.jsonl.zst
320 kB
xet
about 2 months ago
d470a510
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000153.jsonl.zst
228 kB
xet
about 2 months ago
88822421
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000281.jsonl.zst
343 kB
xet
about 2 months ago
c29b5926
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000347.jsonl.zst
279 kB
xet
about 2 months ago
76a384c1
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000365.jsonl.zst
263 kB
xet
about 2 months ago
187fa359
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000375.jsonl.zst
303 kB
xet
about 2 months ago
eef881d5
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000376.jsonl.zst
305 kB
xet
about 2 months ago
1d8fe9e8
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000423.jsonl.zst
296 kB
xet
about 2 months ago
50a3c7b2
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000448.jsonl.zst
371 kB
xet
about 2 months ago
6adde3d8
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0016__shard_00000450.jsonl.zst
300 kB
xet
about 2 months ago
208a4c67
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000035.jsonl.zst
295 kB
xet
about 2 months ago
a34bdfbe
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000053.jsonl.zst
324 kB
xet
about 2 months ago
f1a9eb0c
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000058.jsonl.zst
251 kB
xet
about 2 months ago
a3a3f8d6
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000060.jsonl.zst
369 kB
xet
about 2 months ago
d478d1c3
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000062.jsonl.zst
337 kB
xet
about 2 months ago
03d74726
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000066.jsonl.zst
298 kB
xet
about 2 months ago
a6a0f43b
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000071.jsonl.zst
297 kB
xet
about 2 months ago
b6e30e30
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000083.jsonl.zst
281 kB
xet
about 2 months ago
deb9bd6f
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000088.jsonl.zst
301 kB
xet
about 2 months ago
1872c23e
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000091.jsonl.zst
286 kB
xet
about 2 months ago
e7931b90
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000101.jsonl.zst
261 kB
xet
about 2 months ago
126b7916
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000102.jsonl.zst
365 kB
xet
about 2 months ago
9634d6c5
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000108.jsonl.zst
463 kB
xet
about 2 months ago
75b5875f
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000110.jsonl.zst
344 kB
xet
about 2 months ago
3108889d
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000117.jsonl.zst
276 kB
xet
about 2 months ago
e0389723
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000118.jsonl.zst
302 kB
xet
about 2 months ago
93c61e36
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000123.jsonl.zst
339 kB
xet
about 2 months ago
3c4222e9
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000124.jsonl.zst
266 kB
xet
about 2 months ago
e286a700
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000133.jsonl.zst
349 kB
xet
about 2 months ago
3d85c8ce
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000137.jsonl.zst
238 kB
xet
about 2 months ago
9ba7267d
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000145.jsonl.zst
271 kB
xet
about 2 months ago
75d280e7
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000150.jsonl.zst
318 kB
xet
about 2 months ago
5d4c900f
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000151.jsonl.zst
365 kB
xet
about 2 months ago
55b37f20
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000153.jsonl.zst
236 kB
xet
about 2 months ago
fb56cf14
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000156.jsonl.zst
304 kB
xet
about 2 months ago
2d52547e
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000177.jsonl.zst
321 kB
xet
about 2 months ago
3aaec01a
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000181.jsonl.zst
354 kB
xet
about 2 months ago
78b2bb48
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000188.jsonl.zst
260 kB
xet
about 2 months ago
321027cb
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000189.jsonl.zst
326 kB
xet
about 2 months ago
c9f023f4
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000205.jsonl.zst
336 kB
xet
about 2 months ago
07260252
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000229.jsonl.zst
291 kB
xet
about 2 months ago
b36a2bcb
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000237.jsonl.zst
342 kB
xet
about 2 months ago
f29f945a
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000246.jsonl.zst
247 kB
xet
about 2 months ago
48e8b591
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000247.jsonl.zst
357 kB
xet
about 2 months ago
102b3de0
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000252.jsonl.zst
287 kB
xet
about 2 months ago
7b0b9fba
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000255.jsonl.zst
359 kB
xet
about 2 months ago
5e6c04c0
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000265.jsonl.zst
326 kB
xet
about 2 months ago
ca7bbc05
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000267.jsonl.zst
438 kB
xet
about 2 months ago
b56a749d
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000271.jsonl.zst
389 kB
xet
about 2 months ago
c8b3e485
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000272.jsonl.zst
332 kB
xet
about 2 months ago
3100ddd5
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000277.jsonl.zst
291 kB
xet
about 2 months ago
b652b077
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000281.jsonl.zst
307 kB
xet
about 2 months ago
e11f82ba
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000286.jsonl.zst
244 kB
xet
about 2 months ago
40d6a67d
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000287.jsonl.zst
274 kB
xet
about 2 months ago
973230f5
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000299.jsonl.zst
314 kB
xet
about 2 months ago
9c52dd42
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000300.jsonl.zst
271 kB
xet
about 2 months ago
923164da
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000312.jsonl.zst
297 kB
xet
about 2 months ago
38b6cf14
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000321.jsonl.zst
278 kB
xet
about 2 months ago
3aa6a5ef
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000323.jsonl.zst
254 kB
xet
about 2 months ago
b793f19c
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000331.jsonl.zst
327 kB
xet
about 2 months ago
871d2a8b
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000333.jsonl.zst
332 kB
xet
about 2 months ago
5126392a
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000337.jsonl.zst
378 kB
xet
about 2 months ago
c8653e32
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000339.jsonl.zst
282 kB
xet
about 2 months ago
6ae1ec97
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000346.jsonl.zst
392 kB
xet
about 2 months ago
62802a3f
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000349.jsonl.zst
374 kB
xet
about 2 months ago
3ae189b7
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000350.jsonl.zst
342 kB
xet
about 2 months ago
c6af7546
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000375.jsonl.zst
440 kB
xet
about 2 months ago
a3cdc447
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000378.jsonl.zst
246 kB
xet
about 2 months ago
03f056f1
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000387.jsonl.zst
361 kB
xet
about 2 months ago
8e5f2c7e
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000421.jsonl.zst
354 kB
xet
about 2 months ago
e153e92b
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000438.jsonl.zst
389 kB
xet
about 2 months ago
5d3a23aa
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000439.jsonl.zst
266 kB
xet
about 2 months ago
f270bbae
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000447.jsonl.zst
402 kB
xet
about 2 months ago
e0b25539
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000453.jsonl.zst
436 kB
xet
about 2 months ago
ea55e2a1
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000455.jsonl.zst
399 kB
xet
about 2 months ago
52c0358c
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000462.jsonl.zst
294 kB
xet
about 2 months ago
7895db9e
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000474.jsonl.zst
362 kB
xet
about 2 months ago
87ea6ac8
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000475.jsonl.zst
306 kB
xet
about 2 months ago
a91b9fb6
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000477.jsonl.zst
279 kB
xet
about 2 months ago
a56c30c0
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000478.jsonl.zst
335 kB
xet
about 2 months ago
8273ef16
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000479.jsonl.zst
339 kB
xet
about 2 months ago
dc593592
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0017__shard_00000500.jsonl.zst
296 kB
xet
about 2 months ago
2594da6f
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000001.jsonl.zst
332 kB
xet
about 2 months ago
6b28f3f5
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000002.jsonl.zst
368 kB
xet
about 2 months ago
d0ef10c2
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000003.jsonl.zst
316 kB
xet
about 2 months ago
e9b7155b
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000004.jsonl.zst
297 kB
xet
about 2 months ago
55d1f7b5
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000007.jsonl.zst
359 kB
xet
about 2 months ago
e4a1c0c8
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000008.jsonl.zst
323 kB
xet
about 2 months ago
64e80967
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000010.jsonl.zst
234 kB
xet
about 2 months ago
fe823b4a
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000012.jsonl.zst
352 kB
xet
about 2 months ago
29cdc638
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000013.jsonl.zst
333 kB
xet
about 2 months ago
895cd45e
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000014.jsonl.zst
280 kB
xet
about 2 months ago
a24e6096
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000016.jsonl.zst
345 kB
xet
about 2 months ago
0e1719b8
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000017.jsonl.zst
301 kB
xet
about 2 months ago
ef89801a
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000018.jsonl.zst
315 kB
xet
about 2 months ago
7fb19594
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000019.jsonl.zst
311 kB
xet
about 2 months ago
08922eb4
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000022.jsonl.zst
251 kB
xet
about 2 months ago
4d8e8c47
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000024.jsonl.zst
303 kB
xet
about 2 months ago
dc0f3942
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000026.jsonl.zst
274 kB
xet
about 2 months ago
96b885c3
soc127__phase1_pool_shared__common_crawl__part_002__data__common_crawl-food_and_dining-0018__shard_00000027.jsonl.zst
351 kB
xet
about 2 months ago
44ff1c63
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors