Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0089
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018019.jsonl.zst
23.7 kB
xet
about 2 months ago
359a69eb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018020.jsonl.zst
54.3 kB
xet
about 2 months ago
81564caa
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018043.jsonl.zst
8.69 kB
xet
about 2 months ago
2ac60a03
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018044.jsonl.zst
22.6 kB
xet
about 2 months ago
77f9cf0c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018048.jsonl.zst
58.1 kB
xet
about 2 months ago
ff702d2c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018049.jsonl.zst
8.84 kB
xet
about 2 months ago
2341667e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018070.jsonl.zst
47.9 kB
xet
about 2 months ago
19f9da3f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018074.jsonl.zst
16.1 kB
xet
about 2 months ago
dc43faec
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018076.jsonl.zst
9.02 kB
xet
about 2 months ago
90337ba0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018077.jsonl.zst
100 kB
xet
about 2 months ago
1fbe7ef8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018078.jsonl.zst
4.59 kB
xet
about 2 months ago
449f36f7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018084.jsonl.zst
22.7 kB
xet
about 2 months ago
2f5dd835
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018099.jsonl.zst
75.5 kB
xet
about 2 months ago
c85e04ad
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018104.jsonl.zst
90.2 kB
xet
about 2 months ago
cd585c10
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018106.jsonl.zst
56.6 kB
xet
about 2 months ago
3a819e7a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018108.jsonl.zst
2.35 kB
xet
about 2 months ago
42ca0c29
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018110.jsonl.zst
19.7 kB
xet
about 2 months ago
0bdf72b2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018130.jsonl.zst
72.7 kB
xet
about 2 months ago
d775b865
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018134.jsonl.zst
3.44 kB
xet
about 2 months ago
d1166639
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018137.jsonl.zst
30.8 kB
xet
about 2 months ago
a6fb2b75
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018139.jsonl.zst
12.9 kB
xet
about 2 months ago
4a6740d6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018140.jsonl.zst
16.5 kB
xet
about 2 months ago
be4f439f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018143.jsonl.zst
7.89 kB
xet
about 2 months ago
1d74ab3c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018157.jsonl.zst
36.9 kB
xet
about 2 months ago
b31f6ca7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018164.jsonl.zst
4.1 kB
xet
about 2 months ago
77bbbaa7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018165.jsonl.zst
25.1 kB
xet
about 2 months ago
317adda9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018166.jsonl.zst
35.9 kB
xet
about 2 months ago
d5f4d8ee
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018167.jsonl.zst
5.12 kB
xet
about 2 months ago
32ca0908
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018188.jsonl.zst
47.7 kB
xet
about 2 months ago
fd1cc088
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018194.jsonl.zst
34.5 kB
xet
about 2 months ago
62ae4bdb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018195.jsonl.zst
13 kB
xet
about 2 months ago
438f5e78
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018196.jsonl.zst
45.7 kB
xet
about 2 months ago
2d371f82
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018198.jsonl.zst
17.6 kB
xet
about 2 months ago
17cc64fa
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018202.jsonl.zst
39.8 kB
xet
about 2 months ago
32a7ef27
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018216.jsonl.zst
20.3 kB
xet
about 2 months ago
3249076a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018221.jsonl.zst
93.4 kB
xet
about 2 months ago
1ee07752
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018223.jsonl.zst
41.9 kB
xet
about 2 months ago
4c0ad2d6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018228.jsonl.zst
13.8 kB
xet
about 2 months ago
bcacbf1d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018231.jsonl.zst
8.74 kB
xet
about 2 months ago
2e510e59
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018244.jsonl.zst
17 kB
xet
about 2 months ago
9c1238d1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018251.jsonl.zst
19.2 kB
xet
about 2 months ago
e02e9149
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018252.jsonl.zst
91.8 kB
xet
about 2 months ago
968f37ff
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018253.jsonl.zst
19.4 kB
xet
about 2 months ago
150d437b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018257.jsonl.zst
53.4 kB
xet
about 2 months ago
4a812530
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018261.jsonl.zst
62.2 kB
xet
about 2 months ago
8b908f74
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018278.jsonl.zst
22.6 kB
xet
about 2 months ago
7ba17f32
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018279.jsonl.zst
2.46 kB
xet
about 2 months ago
79e364e7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018280.jsonl.zst
540 kB
xet
about 2 months ago
853f217f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018285.jsonl.zst
19.7 kB
xet
about 2 months ago
fede6d17
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018301.jsonl.zst
29 kB
xet
about 2 months ago
a6527ba5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018305.jsonl.zst
50.5 kB
xet
about 2 months ago
7a269505
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018306.jsonl.zst
29.1 kB
xet
about 2 months ago
e57b5cd8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018309.jsonl.zst
4.79 kB
xet
about 2 months ago
ce9fdd39
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018312.jsonl.zst
15.2 kB
xet
about 2 months ago
1463cb27
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018317.jsonl.zst
30.7 kB
xet
about 2 months ago
4c56e571
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018328.jsonl.zst
11 kB
xet
about 2 months ago
92190548
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018331.jsonl.zst
3.04 kB
xet
about 2 months ago
cb88b992
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018333.jsonl.zst
57.7 kB
xet
about 2 months ago
5721b100
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018335.jsonl.zst
16.2 kB
xet
about 2 months ago
5ac7a1a4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018339.jsonl.zst
78.3 kB
xet
about 2 months ago
d843d014
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018343.jsonl.zst
34.5 kB
xet
about 2 months ago
ae6f151d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018354.jsonl.zst
10.3 kB
xet
about 2 months ago
98a5fae0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018359.jsonl.zst
10.9 kB
xet
about 2 months ago
d11fbfcd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018360.jsonl.zst
24.3 kB
xet
about 2 months ago
b9a7bf61
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018366.jsonl.zst
19.8 kB
xet
about 2 months ago
732d0f52
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018368.jsonl.zst
4.23 kB
xet
about 2 months ago
d8c4c0c7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018382.jsonl.zst
22.7 kB
xet
about 2 months ago
41d429b4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018385.jsonl.zst
11.3 kB
xet
about 2 months ago
fe49a691
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018387.jsonl.zst
274 kB
xet
about 2 months ago
703f1d73
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018391.jsonl.zst
7.25 kB
xet
about 2 months ago
44b99300
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018394.jsonl.zst
198 kB
xet
about 2 months ago
1c060b77
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018395.jsonl.zst
11.9 kB
xet
about 2 months ago
044db445
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018408.jsonl.zst
63.5 kB
xet
about 2 months ago
6b425179
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018417.jsonl.zst
7.41 kB
xet
about 2 months ago
6da3512c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018421.jsonl.zst
89.2 kB
xet
about 2 months ago
7c66f121
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018434.jsonl.zst
28.2 kB
xet
about 2 months ago
e06c47bf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018441.jsonl.zst
56.1 kB
xet
about 2 months ago
4ec9e15e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018443.jsonl.zst
21.4 kB
xet
about 2 months ago
04047b73
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018444.jsonl.zst
54.5 kB
xet
about 2 months ago
2a1b000d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018448.jsonl.zst
17.5 kB
xet
about 2 months ago
cf9dc6d2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018450.jsonl.zst
16.9 kB
xet
about 2 months ago
81ee6c14
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018461.jsonl.zst
8.96 kB
xet
about 2 months ago
6adaa893
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018468.jsonl.zst
38.8 kB
xet
about 2 months ago
0455f40c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018469.jsonl.zst
31.4 kB
xet
about 2 months ago
8aa517e0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018471.jsonl.zst
104 kB
xet
about 2 months ago
1b40f3e9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018476.jsonl.zst
56.7 kB
xet
about 2 months ago
a6fa6ba7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018477.jsonl.zst
17.6 kB
xet
about 2 months ago
e8c536f8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018489.jsonl.zst
12.5 kB
xet
about 2 months ago
0a3f4447
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018495.jsonl.zst
27.2 kB
xet
about 2 months ago
6f864085
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018496.jsonl.zst
6.17 kB
xet
about 2 months ago
d9750038
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018498.jsonl.zst
33.8 kB
xet
about 2 months ago
038d0d73
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018504.jsonl.zst
32.9 kB
xet
about 2 months ago
f853101c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018517.jsonl.zst
13.2 kB
xet
about 2 months ago
c2033fa5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018521.jsonl.zst
16.9 kB
xet
about 2 months ago
72555e02
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018530.jsonl.zst
5.34 kB
xet
about 2 months ago
4832b234
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018531.jsonl.zst
24.1 kB
xet
about 2 months ago
3aec65fe
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018543.jsonl.zst
108 kB
xet
about 2 months ago
2796d9fa
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018548.jsonl.zst
20.7 kB
xet
about 2 months ago
304bd5fd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018549.jsonl.zst
140 kB
xet
about 2 months ago
c71cba1e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00018552.jsonl.zst
128 kB
xet
about 2 months ago
6afe19ea
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors