Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0094
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026431.jsonl.zst
59.7 kB
xet
about 2 months ago
ac5c0f00
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026434.jsonl.zst
38.5 kB
xet
about 2 months ago
aa04c213
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026441.jsonl.zst
6.54 kB
xet
about 2 months ago
daa44f65
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026443.jsonl.zst
8.17 kB
xet
about 2 months ago
cf8cb446
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026444.jsonl.zst
21.2 kB
xet
about 2 months ago
66d0ff1e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026445.jsonl.zst
54.7 kB
xet
about 2 months ago
d02a1d96
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026450.jsonl.zst
106 kB
xet
about 2 months ago
5eb8235b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026456.jsonl.zst
145 kB
xet
about 2 months ago
1dcf8092
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026460.jsonl.zst
113 kB
xet
about 2 months ago
6fdc6a06
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026463.jsonl.zst
43.9 kB
xet
about 2 months ago
da7461da
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026468.jsonl.zst
3.92 kB
xet
about 2 months ago
0e2a7ef4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026473.jsonl.zst
17.6 kB
xet
about 2 months ago
bf635435
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026474.jsonl.zst
62.9 kB
xet
about 2 months ago
c0317ce0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026478.jsonl.zst
55.4 kB
xet
about 2 months ago
623046a5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026481.jsonl.zst
32.8 kB
xet
about 2 months ago
ba1bc438
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026490.jsonl.zst
90.1 kB
xet
about 2 months ago
3b1e541a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026491.jsonl.zst
30.7 kB
xet
about 2 months ago
c366d5d0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026492.jsonl.zst
58.4 kB
xet
about 2 months ago
1db3fed3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026493.jsonl.zst
54.3 kB
xet
about 2 months ago
c0d3632a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026498.jsonl.zst
78.3 kB
xet
about 2 months ago
c22e95c9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026508.jsonl.zst
82.1 kB
xet
about 2 months ago
9431007c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026509.jsonl.zst
14.6 kB
xet
about 2 months ago
99308822
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026514.jsonl.zst
33.5 kB
xet
about 2 months ago
b56a7b73
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026519.jsonl.zst
52.7 kB
xet
about 2 months ago
cf84748c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026520.jsonl.zst
35.1 kB
xet
about 2 months ago
351c653a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026523.jsonl.zst
40.6 kB
xet
about 2 months ago
a02d1721
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026526.jsonl.zst
49.7 kB
xet
about 2 months ago
aefe66fc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026530.jsonl.zst
55.2 kB
xet
about 2 months ago
d4ac6c23
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026537.jsonl.zst
41.3 kB
xet
about 2 months ago
d6295fbd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026538.jsonl.zst
69.6 kB
xet
about 2 months ago
37bcef36
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026539.jsonl.zst
53.8 kB
xet
about 2 months ago
ddc22d24
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026541.jsonl.zst
31.8 kB
xet
about 2 months ago
1f402431
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026546.jsonl.zst
428 kB
xet
about 2 months ago
807af94a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026553.jsonl.zst
31.3 kB
xet
about 2 months ago
4fd48737
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026555.jsonl.zst
15.5 kB
xet
about 2 months ago
dc1c9478
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026556.jsonl.zst
4.75 kB
xet
about 2 months ago
f4e59497
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026558.jsonl.zst
29.2 kB
xet
about 2 months ago
c436b5b5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026561.jsonl.zst
44.3 kB
xet
about 2 months ago
61a21fa3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026568.jsonl.zst
54.3 kB
xet
about 2 months ago
b32f3951
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026569.jsonl.zst
73 kB
xet
about 2 months ago
c83e9331
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026570.jsonl.zst
25.2 kB
xet
about 2 months ago
7702230a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026577.jsonl.zst
2.82 kB
xet
about 2 months ago
21f98c2e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026584.jsonl.zst
83.5 kB
xet
about 2 months ago
0ba6832b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026585.jsonl.zst
6.46 kB
xet
about 2 months ago
730b7109
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026586.jsonl.zst
15.3 kB
xet
about 2 months ago
1ffe3981
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026593.jsonl.zst
35.2 kB
xet
about 2 months ago
88725e98
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026600.jsonl.zst
31.6 kB
xet
about 2 months ago
e70fb6af
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026602.jsonl.zst
315 kB
xet
about 2 months ago
79a88b09
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026607.jsonl.zst
25.3 kB
xet
about 2 months ago
aad682d0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026609.jsonl.zst
97.1 kB
xet
about 2 months ago
bd979b1d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026614.jsonl.zst
17.3 kB
xet
about 2 months ago
4dda2135
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026616.jsonl.zst
51.8 kB
xet
about 2 months ago
89647b03
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026618.jsonl.zst
157 kB
xet
about 2 months ago
b8ebf7b7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026622.jsonl.zst
38.7 kB
xet
about 2 months ago
2af78b3d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026626.jsonl.zst
61 kB
xet
about 2 months ago
65b234c4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026631.jsonl.zst
42.7 kB
xet
about 2 months ago
278c640c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026632.jsonl.zst
77.7 kB
xet
about 2 months ago
7ba38161
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026639.jsonl.zst
53.7 kB
xet
about 2 months ago
c0eb0030
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026641.jsonl.zst
39.1 kB
xet
about 2 months ago
b5fc8be1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026646.jsonl.zst
50.5 kB
xet
about 2 months ago
43939c9b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026647.jsonl.zst
23.1 kB
xet
about 2 months ago
0751a432
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026650.jsonl.zst
24.5 kB
xet
about 2 months ago
ed123a9c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026655.jsonl.zst
36.2 kB
xet
about 2 months ago
2a451370
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026656.jsonl.zst
38.6 kB
xet
about 2 months ago
1bee153f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026662.jsonl.zst
72 kB
xet
about 2 months ago
9efe6f1c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026663.jsonl.zst
104 kB
xet
about 2 months ago
70402b92
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026665.jsonl.zst
17.1 kB
xet
about 2 months ago
fe0cd692
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026671.jsonl.zst
37.4 kB
xet
about 2 months ago
2fcf4237
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026672.jsonl.zst
74.8 kB
xet
about 2 months ago
c80bef2d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026678.jsonl.zst
12 kB
xet
about 2 months ago
e6908d92
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026682.jsonl.zst
49.5 kB
xet
about 2 months ago
36233915
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026688.jsonl.zst
75.5 kB
xet
about 2 months ago
0af6b2d2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026689.jsonl.zst
65.4 kB
xet
about 2 months ago
e87bd9bb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026694.jsonl.zst
19.3 kB
xet
about 2 months ago
4afd5035
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026696.jsonl.zst
17.6 kB
xet
about 2 months ago
04a222a6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026704.jsonl.zst
86.5 kB
xet
about 2 months ago
8a9b1d72
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026708.jsonl.zst
51.2 kB
xet
about 2 months ago
9664fabe
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026711.jsonl.zst
91.8 kB
xet
about 2 months ago
728186c0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026714.jsonl.zst
16.9 kB
xet
about 2 months ago
22b8e50e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026719.jsonl.zst
14.7 kB
xet
about 2 months ago
c1c8d18e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026725.jsonl.zst
223 kB
xet
about 2 months ago
2104872e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026727.jsonl.zst
98.7 kB
xet
about 2 months ago
502102ae
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026729.jsonl.zst
13.8 kB
xet
about 2 months ago
adc956b6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026736.jsonl.zst
72.3 kB
xet
about 2 months ago
675e550d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026744.jsonl.zst
11.1 kB
xet
about 2 months ago
3421e266
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026752.jsonl.zst
32.5 kB
xet
about 2 months ago
1c9498f8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026753.jsonl.zst
75.5 kB
xet
about 2 months ago
151fca79
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026756.jsonl.zst
89.6 kB
xet
about 2 months ago
205679e4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026758.jsonl.zst
67 kB
xet
about 2 months ago
f3b8f8dd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026761.jsonl.zst
7.27 kB
xet
about 2 months ago
9870ecfd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026769.jsonl.zst
6.91 kB
xet
about 2 months ago
7050d733
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026771.jsonl.zst
24.1 kB
xet
about 2 months ago
1fb8673c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026774.jsonl.zst
63.1 kB
xet
about 2 months ago
5cb68bd1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026776.jsonl.zst
60.7 kB
xet
about 2 months ago
a1a82a1b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026783.jsonl.zst
140 kB
xet
about 2 months ago
63cef4ab
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026784.jsonl.zst
42.3 kB
xet
about 2 months ago
94e38633
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026787.jsonl.zst
56.4 kB
xet
about 2 months ago
ef128ab2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026789.jsonl.zst
386 kB
xet
about 2 months ago
0bf1ac62
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026792.jsonl.zst
7.66 kB
xet
about 2 months ago
223564b5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00026798.jsonl.zst
31 kB
xet
about 2 months ago
2682cc12
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors