Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0090
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020032.jsonl.zst
61.6 kB
xet
about 2 months ago
01eed7ec
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020038.jsonl.zst
41.3 kB
xet
about 2 months ago
2b8c2f0f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020042.jsonl.zst
17.8 kB
xet
about 2 months ago
3df7fe92
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020048.jsonl.zst
57.5 kB
xet
about 2 months ago
aafc4cf6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020053.jsonl.zst
6.3 kB
xet
about 2 months ago
1b9f36bb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020055.jsonl.zst
74.7 kB
xet
about 2 months ago
67cfd2ba
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020060.jsonl.zst
38.9 kB
xet
about 2 months ago
2a39f56f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020061.jsonl.zst
172 kB
xet
about 2 months ago
c818e6b3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020063.jsonl.zst
39.5 kB
xet
about 2 months ago
431c1e66
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020071.jsonl.zst
67.9 kB
xet
about 2 months ago
fda72435
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020074.jsonl.zst
56.9 kB
xet
about 2 months ago
fb610c21
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020081.jsonl.zst
110 kB
xet
about 2 months ago
db7c6078
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020083.jsonl.zst
143 kB
xet
about 2 months ago
3951380b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020087.jsonl.zst
127 kB
xet
about 2 months ago
f361ff55
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020088.jsonl.zst
16.4 kB
xet
about 2 months ago
7d33f352
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020095.jsonl.zst
27.1 kB
xet
about 2 months ago
755aa8df
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020099.jsonl.zst
29 kB
xet
about 2 months ago
cbbcbfae
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020101.jsonl.zst
86.9 kB
xet
about 2 months ago
d10bb417
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020106.jsonl.zst
83.6 kB
xet
about 2 months ago
351d1296
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020107.jsonl.zst
65.9 kB
xet
about 2 months ago
ad493da8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020108.jsonl.zst
38.9 kB
xet
about 2 months ago
a3b3350d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020122.jsonl.zst
15.8 kB
xet
about 2 months ago
ed9426ec
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020126.jsonl.zst
17.2 kB
xet
about 2 months ago
d3bdb2bd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020129.jsonl.zst
17.9 kB
xet
about 2 months ago
db3834d0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020131.jsonl.zst
102 kB
xet
about 2 months ago
85b8ac13
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020132.jsonl.zst
53 kB
xet
about 2 months ago
60de6377
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020143.jsonl.zst
19.7 kB
xet
about 2 months ago
ca6e3c19
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020144.jsonl.zst
30.7 kB
xet
about 2 months ago
6828ae7b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020152.jsonl.zst
30.8 kB
xet
about 2 months ago
e594a567
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020154.jsonl.zst
102 kB
xet
about 2 months ago
9adc3a39
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020158.jsonl.zst
18.1 kB
xet
about 2 months ago
c54ef0a5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020165.jsonl.zst
22.2 kB
xet
about 2 months ago
4912f2c9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020172.jsonl.zst
53.2 kB
xet
about 2 months ago
9635097c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020174.jsonl.zst
95.7 kB
xet
about 2 months ago
0cdf7b6c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020180.jsonl.zst
95.3 kB
xet
about 2 months ago
8576b2d1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020182.jsonl.zst
53.2 kB
xet
about 2 months ago
690e7ad6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020188.jsonl.zst
108 kB
xet
about 2 months ago
cba2c5d6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020197.jsonl.zst
56.5 kB
xet
about 2 months ago
b84a59a6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020198.jsonl.zst
8.93 kB
xet
about 2 months ago
046dfb21
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020202.jsonl.zst
255 kB
xet
about 2 months ago
9d2c75e9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020205.jsonl.zst
41.4 kB
xet
about 2 months ago
31b75c89
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020209.jsonl.zst
52.9 kB
xet
about 2 months ago
de5daccd
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020213.jsonl.zst
21.2 kB
xet
about 2 months ago
53d63ef5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020221.jsonl.zst
9.01 kB
xet
about 2 months ago
30e904cc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020227.jsonl.zst
12.4 kB
xet
about 2 months ago
2d6a60d2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020229.jsonl.zst
30.5 kB
xet
about 2 months ago
a7831f17
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020234.jsonl.zst
56 kB
xet
about 2 months ago
11c464ad
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020236.jsonl.zst
53 kB
xet
about 2 months ago
4dbd2eb9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020242.jsonl.zst
49.9 kB
xet
about 2 months ago
ec117e6c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020244.jsonl.zst
30.5 kB
xet
about 2 months ago
e271a864
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020249.jsonl.zst
19.9 kB
xet
about 2 months ago
6ed6f914
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020252.jsonl.zst
34.6 kB
xet
about 2 months ago
79e48048
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020256.jsonl.zst
55.6 kB
xet
about 2 months ago
b4b49b99
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020257.jsonl.zst
52.8 kB
xet
about 2 months ago
2ef7ae61
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020265.jsonl.zst
72 kB
xet
about 2 months ago
c3063645
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020267.jsonl.zst
18.8 kB
xet
about 2 months ago
9ad3e177
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020273.jsonl.zst
126 kB
xet
about 2 months ago
1c828a09
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020274.jsonl.zst
39.4 kB
xet
about 2 months ago
3c76e8c0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020278.jsonl.zst
27.9 kB
xet
about 2 months ago
f1fc45e1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020279.jsonl.zst
42.8 kB
xet
about 2 months ago
0f70ee6e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020289.jsonl.zst
48 kB
xet
about 2 months ago
7fdc2022
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020294.jsonl.zst
44.3 kB
xet
about 2 months ago
6d05d291
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020296.jsonl.zst
110 kB
xet
about 2 months ago
a35fa802
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020297.jsonl.zst
46.2 kB
xet
about 2 months ago
6094b3e9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020305.jsonl.zst
52.7 kB
xet
about 2 months ago
110ae6f0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020311.jsonl.zst
48.9 kB
xet
about 2 months ago
7f7fcbf0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020315.jsonl.zst
42.9 kB
xet
about 2 months ago
de3dfd81
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020318.jsonl.zst
40.4 kB
xet
about 2 months ago
c302c4c2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020321.jsonl.zst
243 kB
xet
about 2 months ago
0dc8dd7f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020324.jsonl.zst
42.4 kB
xet
about 2 months ago
ecb6d319
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020328.jsonl.zst
23.1 kB
xet
about 2 months ago
0c681884
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020334.jsonl.zst
97.2 kB
xet
about 2 months ago
44a6d0b0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020341.jsonl.zst
95 kB
xet
about 2 months ago
e6f73cc9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020342.jsonl.zst
31.9 kB
xet
about 2 months ago
729fc4cb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020344.jsonl.zst
68.2 kB
xet
about 2 months ago
1cc37390
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020348.jsonl.zst
91 kB
xet
about 2 months ago
ece38a53
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020349.jsonl.zst
86.7 kB
xet
about 2 months ago
5d9d86d9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020356.jsonl.zst
34.4 kB
xet
about 2 months ago
1bddba11
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020361.jsonl.zst
57 kB
xet
about 2 months ago
4aa1e800
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020365.jsonl.zst
13.8 kB
xet
about 2 months ago
5eb48c3d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020367.jsonl.zst
67.4 kB
xet
about 2 months ago
124624a8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020370.jsonl.zst
66.2 kB
xet
about 2 months ago
f8bc627f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020373.jsonl.zst
44.4 kB
xet
about 2 months ago
36e859b7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020386.jsonl.zst
12 kB
xet
about 2 months ago
5d2177fe
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020387.jsonl.zst
138 kB
xet
about 2 months ago
c6985d5d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020390.jsonl.zst
28.2 kB
xet
about 2 months ago
ee564056
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020394.jsonl.zst
61.5 kB
xet
about 2 months ago
bfa79015
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020396.jsonl.zst
60.4 kB
xet
about 2 months ago
108d0164
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020402.jsonl.zst
22.3 kB
xet
about 2 months ago
5e864656
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020409.jsonl.zst
32.9 kB
xet
about 2 months ago
69ba6210
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020411.jsonl.zst
63.9 kB
xet
about 2 months ago
9dbf5234
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020413.jsonl.zst
88.6 kB
xet
about 2 months ago
98b56ef2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020418.jsonl.zst
53.3 kB
xet
about 2 months ago
937ca6ff
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020419.jsonl.zst
21.2 kB
xet
about 2 months ago
5fc001a5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020426.jsonl.zst
31 kB
xet
about 2 months ago
0cbe0344
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020431.jsonl.zst
31.1 kB
xet
about 2 months ago
0940cbb9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020433.jsonl.zst
9.74 kB
xet
about 2 months ago
039b39af
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020437.jsonl.zst
15.9 kB
xet
about 2 months ago
6285ca71
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020441.jsonl.zst
67.9 kB
xet
about 2 months ago
ce7ba552
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00020442.jsonl.zst
227 kB
xet
about 2 months ago
d0a4aa55
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors