Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0093
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024901.jsonl.zst
64.8 kB
xet
about 2 months ago
8e8e203a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024905.jsonl.zst
105 kB
xet
about 2 months ago
cd91324d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024906.jsonl.zst
91 kB
xet
about 2 months ago
29a298dd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024911.jsonl.zst
59.3 kB
xet
about 2 months ago
dee8f31f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024916.jsonl.zst
92.6 kB
xet
about 2 months ago
8468f0ee
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024917.jsonl.zst
11.7 kB
xet
about 2 months ago
9ee69262
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024923.jsonl.zst
70.7 kB
xet
about 2 months ago
9779c48d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024926.jsonl.zst
6.81 kB
xet
about 2 months ago
63fc4e19
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024931.jsonl.zst
16.9 kB
xet
about 2 months ago
f631354e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024937.jsonl.zst
76.3 kB
xet
about 2 months ago
61316a3d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024938.jsonl.zst
18.1 kB
xet
about 2 months ago
ef0d01f0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024942.jsonl.zst
34.5 kB
xet
about 2 months ago
21c65560
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024947.jsonl.zst
128 kB
xet
about 2 months ago
7ea12798
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024948.jsonl.zst
25.1 kB
xet
about 2 months ago
c1e49e05
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024953.jsonl.zst
43.2 kB
xet
about 2 months ago
43486273
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024954.jsonl.zst
29.5 kB
xet
about 2 months ago
24958cda
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024958.jsonl.zst
129 kB
xet
about 2 months ago
8de9f93b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024961.jsonl.zst
135 kB
xet
about 2 months ago
83a1b695
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024965.jsonl.zst
25.2 kB
xet
about 2 months ago
29f035df
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024969.jsonl.zst
75.3 kB
xet
about 2 months ago
f74b69f5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024970.jsonl.zst
18 kB
xet
about 2 months ago
bcdf76b6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024973.jsonl.zst
18 kB
xet
about 2 months ago
f2f6620d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024980.jsonl.zst
147 kB
xet
about 2 months ago
d510bc32
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024985.jsonl.zst
167 kB
xet
about 2 months ago
78a2fba9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024986.jsonl.zst
21.5 kB
xet
about 2 months ago
42ebaffc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00024996.jsonl.zst
26.7 kB
xet
about 2 months ago
e5b6d6d0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025000.jsonl.zst
49.2 kB
xet
about 2 months ago
69f3fcec
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025002.jsonl.zst
21.8 kB
xet
about 2 months ago
daafb1d8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025008.jsonl.zst
37.9 kB
xet
about 2 months ago
abed6bb4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025010.jsonl.zst
22.9 kB
xet
about 2 months ago
af26c3cf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025016.jsonl.zst
14.1 kB
xet
about 2 months ago
7fdd6b56
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025017.jsonl.zst
155 kB
xet
about 2 months ago
aef90fcd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025021.jsonl.zst
62.3 kB
xet
about 2 months ago
ae483f1f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025022.jsonl.zst
129 kB
xet
about 2 months ago
549b6019
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025025.jsonl.zst
67.8 kB
xet
about 2 months ago
2f127302
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025032.jsonl.zst
42.9 kB
xet
about 2 months ago
7ccfbd4c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025033.jsonl.zst
36.9 kB
xet
about 2 months ago
9cdc8572
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025036.jsonl.zst
66.1 kB
xet
about 2 months ago
aa1ab8a7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025039.jsonl.zst
230 kB
xet
about 2 months ago
121bb716
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025042.jsonl.zst
54.8 kB
xet
about 2 months ago
8a22591d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025049.jsonl.zst
49.5 kB
xet
about 2 months ago
dd628623
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025050.jsonl.zst
3.97 kB
xet
about 2 months ago
9095f349
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025053.jsonl.zst
56.8 kB
xet
about 2 months ago
c3a9e90b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025054.jsonl.zst
57.8 kB
xet
about 2 months ago
f3c3a9b9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025057.jsonl.zst
31.2 kB
xet
about 2 months ago
c649164a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025064.jsonl.zst
109 kB
xet
about 2 months ago
35c80d8f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025065.jsonl.zst
18.6 kB
xet
about 2 months ago
3f8218a7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025069.jsonl.zst
109 kB
xet
about 2 months ago
b352321d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025070.jsonl.zst
5.71 kB
xet
about 2 months ago
dd99b2a3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025074.jsonl.zst
50 kB
xet
about 2 months ago
42eff913
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025080.jsonl.zst
110 kB
xet
about 2 months ago
2889037f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025081.jsonl.zst
14.6 kB
xet
about 2 months ago
6912547a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025084.jsonl.zst
46.3 kB
xet
about 2 months ago
2b3f6e23
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025085.jsonl.zst
48.8 kB
xet
about 2 months ago
80e13d8f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025088.jsonl.zst
22 kB
xet
about 2 months ago
bf328c54
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025096.jsonl.zst
44.8 kB
xet
about 2 months ago
0da4217f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025098.jsonl.zst
52.5 kB
xet
about 2 months ago
1313a482
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025101.jsonl.zst
14.8 kB
xet
about 2 months ago
d9171c87
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025102.jsonl.zst
70.2 kB
xet
about 2 months ago
0e3736cd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025105.jsonl.zst
13.1 kB
xet
about 2 months ago
f2fd05e9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025111.jsonl.zst
27.6 kB
xet
about 2 months ago
22127ae6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025116.jsonl.zst
47.4 kB
xet
about 2 months ago
5f8a28e5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025117.jsonl.zst
22.4 kB
xet
about 2 months ago
ec63289e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025120.jsonl.zst
88.6 kB
xet
about 2 months ago
f4c444c2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025129.jsonl.zst
18 kB
xet
about 2 months ago
7af72116
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025130.jsonl.zst
83.3 kB
xet
about 2 months ago
16d8376a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025133.jsonl.zst
486 kB
xet
about 2 months ago
d130df9f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025134.jsonl.zst
14.8 kB
xet
about 2 months ago
2973be54
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025136.jsonl.zst
34 kB
xet
about 2 months ago
62dd3ade
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025146.jsonl.zst
31.6 kB
xet
about 2 months ago
0a75e5ee
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025147.jsonl.zst
40 kB
xet
about 2 months ago
fe53da1b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025148.jsonl.zst
18 kB
xet
about 2 months ago
14949e75
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025152.jsonl.zst
54.2 kB
xet
about 2 months ago
321e26d1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025160.jsonl.zst
41.6 kB
xet
about 2 months ago
51b11c8e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025161.jsonl.zst
120 kB
xet
about 2 months ago
1823f005
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025164.jsonl.zst
62.4 kB
xet
about 2 months ago
4f52b6dc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025165.jsonl.zst
46.8 kB
xet
about 2 months ago
08be6563
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025168.jsonl.zst
387 kB
xet
about 2 months ago
ab750844
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025175.jsonl.zst
103 kB
xet
about 2 months ago
1b0cdec0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025176.jsonl.zst
51.8 kB
xet
about 2 months ago
1d9d7365
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025179.jsonl.zst
32.9 kB
xet
about 2 months ago
ef645275
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025181.jsonl.zst
112 kB
xet
about 2 months ago
da734772
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025183.jsonl.zst
39.8 kB
xet
about 2 months ago
b9179126
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025192.jsonl.zst
29 kB
xet
about 2 months ago
5ccca0fd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025193.jsonl.zst
86.3 kB
xet
about 2 months ago
6837095f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025195.jsonl.zst
57.8 kB
xet
about 2 months ago
bbc6e108
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025197.jsonl.zst
58.6 kB
xet
about 2 months ago
8dca5cd8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025200.jsonl.zst
320 kB
xet
about 2 months ago
151328e8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025207.jsonl.zst
51.1 kB
xet
about 2 months ago
2e5a0ab4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025209.jsonl.zst
33.8 kB
xet
about 2 months ago
f8b43473
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025210.jsonl.zst
218 kB
xet
about 2 months ago
b67815b2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025213.jsonl.zst
16.2 kB
xet
about 2 months ago
9b9b5712
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025214.jsonl.zst
15.8 kB
xet
about 2 months ago
1511fa87
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025223.jsonl.zst
47.9 kB
xet
about 2 months ago
ff9d8d90
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025224.jsonl.zst
20.6 kB
xet
about 2 months ago
9f8e71a3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025226.jsonl.zst
86 kB
xet
about 2 months ago
4be540ed
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025227.jsonl.zst
10.2 kB
xet
about 2 months ago
38c06200
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025231.jsonl.zst
75.1 kB
xet
about 2 months ago
28c41e7b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025239.jsonl.zst
25.8 kB
xet
about 2 months ago
4552e8af
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00025240.jsonl.zst
56.6 kB
xet
about 2 months ago
d8ebf54e
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors