Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0087
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010202.jsonl.zst
41.9 kB
xet
about 2 months ago
34019285
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010220.jsonl.zst
110 kB
xet
about 2 months ago
9f9a28da
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010226.jsonl.zst
6.56 kB
xet
about 2 months ago
afd220ae
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010232.jsonl.zst
25.5 kB
xet
about 2 months ago
497ee217
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010233.jsonl.zst
89 kB
xet
about 2 months ago
82d3c3e0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010237.jsonl.zst
19.2 kB
xet
about 2 months ago
05bef2d6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010267.jsonl.zst
46.3 kB
xet
about 2 months ago
a9230b1a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010287.jsonl.zst
39.9 kB
xet
about 2 months ago
c452496c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010291.jsonl.zst
45.9 kB
xet
about 2 months ago
f817c33a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010294.jsonl.zst
39.8 kB
xet
about 2 months ago
5d76b80b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010304.jsonl.zst
30.3 kB
xet
about 2 months ago
2548ec6f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010346.jsonl.zst
48.5 kB
xet
about 2 months ago
e9ba242c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010348.jsonl.zst
44.8 kB
xet
about 2 months ago
a572541d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010351.jsonl.zst
60.3 kB
xet
about 2 months ago
906b0b73
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010356.jsonl.zst
91.4 kB
xet
about 2 months ago
75420f4a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010365.jsonl.zst
62.3 kB
xet
about 2 months ago
7bdfe7d9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010392.jsonl.zst
71.4 kB
xet
about 2 months ago
b24c6aed
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010404.jsonl.zst
92.3 kB
xet
about 2 months ago
c03694cf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010410.jsonl.zst
82.5 kB
xet
about 2 months ago
e82ab941
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010413.jsonl.zst
34.6 kB
xet
about 2 months ago
bbe28b2e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010417.jsonl.zst
61.6 kB
xet
about 2 months ago
2164b0da
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010423.jsonl.zst
182 kB
xet
about 2 months ago
258b11e5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010453.jsonl.zst
29.1 kB
xet
about 2 months ago
149fbd7a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010460.jsonl.zst
214 kB
xet
about 2 months ago
143846e2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010470.jsonl.zst
203 kB
xet
about 2 months ago
1ff0aa96
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010473.jsonl.zst
118 kB
xet
about 2 months ago
cb7b2cc8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010478.jsonl.zst
17.5 kB
xet
about 2 months ago
36cb24a8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010488.jsonl.zst
44.8 kB
xet
about 2 months ago
7d18d1af
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010514.jsonl.zst
15.3 kB
xet
about 2 months ago
9b549ac1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010520.jsonl.zst
4.55 kB
xet
about 2 months ago
52d4b4d7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010524.jsonl.zst
343 kB
xet
about 2 months ago
fdb124fe
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010537.jsonl.zst
129 kB
xet
about 2 months ago
e99797ff
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010553.jsonl.zst
21.9 kB
xet
about 2 months ago
5bb4ed59
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010566.jsonl.zst
50.1 kB
xet
about 2 months ago
3ab86ade
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010582.jsonl.zst
12.1 kB
xet
about 2 months ago
efeb107a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010593.jsonl.zst
17.4 kB
xet
about 2 months ago
b0c13605
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010598.jsonl.zst
20.4 kB
xet
about 2 months ago
65ea552e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010600.jsonl.zst
231 kB
xet
about 2 months ago
df64f01d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010614.jsonl.zst
76.7 kB
xet
about 2 months ago
5d64d268
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010626.jsonl.zst
53.9 kB
xet
about 2 months ago
ce09cdab
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010645.jsonl.zst
149 kB
xet
about 2 months ago
a0e41702
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010653.jsonl.zst
21.9 kB
xet
about 2 months ago
06994de6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010655.jsonl.zst
41.3 kB
xet
about 2 months ago
56b5da02
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010662.jsonl.zst
305 kB
xet
about 2 months ago
f6103514
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010684.jsonl.zst
41.2 kB
xet
about 2 months ago
ee5f964a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010704.jsonl.zst
59.2 kB
xet
about 2 months ago
30d45bcc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010712.jsonl.zst
9.97 kB
xet
about 2 months ago
e5dd1794
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010714.jsonl.zst
59.2 kB
xet
about 2 months ago
4efe16b9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010721.jsonl.zst
45.3 kB
xet
about 2 months ago
8bcc6b6b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010740.jsonl.zst
33.3 kB
xet
about 2 months ago
25ceb89c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010741.jsonl.zst
87.8 kB
xet
about 2 months ago
bb6e0412
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010768.jsonl.zst
13 kB
xet
about 2 months ago
9b1048ff
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010777.jsonl.zst
39 kB
xet
about 2 months ago
740a0ceb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010798.jsonl.zst
17.6 kB
xet
about 2 months ago
84f9d599
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010802.jsonl.zst
59.5 kB
xet
about 2 months ago
57d43308
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010831.jsonl.zst
153 kB
xet
about 2 months ago
c9862dd3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010833.jsonl.zst
62.6 kB
xet
about 2 months ago
57f8cf90
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010840.jsonl.zst
17.7 kB
xet
about 2 months ago
060fb7f3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010843.jsonl.zst
2.46 kB
xet
about 2 months ago
6da44e40
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010854.jsonl.zst
17.6 kB
xet
about 2 months ago
4ef01761
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010863.jsonl.zst
36.3 kB
xet
about 2 months ago
4dc5d7b3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010890.jsonl.zst
216 kB
xet
about 2 months ago
26431ba0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010895.jsonl.zst
58.5 kB
xet
about 2 months ago
04db63fe
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010896.jsonl.zst
19 kB
xet
about 2 months ago
ded400d8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010902.jsonl.zst
15.1 kB
xet
about 2 months ago
7c8e249a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010909.jsonl.zst
52 kB
xet
about 2 months ago
d77be337
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010926.jsonl.zst
150 kB
xet
about 2 months ago
30bbaff6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010950.jsonl.zst
124 kB
xet
about 2 months ago
d8df5571
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010956.jsonl.zst
172 kB
xet
about 2 months ago
7502a802
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010961.jsonl.zst
17.1 kB
xet
about 2 months ago
aa877156
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010962.jsonl.zst
48.5 kB
xet
about 2 months ago
074cec1a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010968.jsonl.zst
58.5 kB
xet
about 2 months ago
8ca6b511
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00010990.jsonl.zst
32 kB
xet
about 2 months ago
7490c5d6
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011008.jsonl.zst
13.6 kB
xet
about 2 months ago
eff4ab5b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011018.jsonl.zst
37.4 kB
xet
about 2 months ago
f8779b77
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011021.jsonl.zst
17.6 kB
xet
about 2 months ago
8d8160b3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011028.jsonl.zst
2.34 kB
xet
about 2 months ago
e6b6abff
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011054.jsonl.zst
11.7 kB
xet
about 2 months ago
754ff276
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011078.jsonl.zst
97.4 kB
xet
about 2 months ago
da43213c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011080.jsonl.zst
14.8 kB
xet
about 2 months ago
d74cbabf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011085.jsonl.zst
87.2 kB
xet
about 2 months ago
f5ec3829
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011086.jsonl.zst
35.3 kB
xet
about 2 months ago
484c21e8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011114.jsonl.zst
5.58 kB
xet
about 2 months ago
19d97ba9
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011130.jsonl.zst
46.4 kB
xet
about 2 months ago
93da292e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011140.jsonl.zst
51.6 kB
xet
about 2 months ago
e9749264
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011149.jsonl.zst
38 kB
xet
about 2 months ago
96abe1f0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011152.jsonl.zst
136 kB
xet
about 2 months ago
bbc99278
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011176.jsonl.zst
39 kB
xet
about 2 months ago
ee589c97
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011189.jsonl.zst
85.4 kB
xet
about 2 months ago
a409b0d5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011201.jsonl.zst
8 kB
xet
about 2 months ago
97ccdbc0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011209.jsonl.zst
79.3 kB
xet
about 2 months ago
13acfa0a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011210.jsonl.zst
74.9 kB
xet
about 2 months ago
089bd742
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011214.jsonl.zst
24 kB
xet
about 2 months ago
4de04038
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011235.jsonl.zst
3.52 kB
xet
about 2 months ago
baccbea2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011250.jsonl.zst
47.7 kB
xet
about 2 months ago
931bbed0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011265.jsonl.zst
38.7 kB
xet
about 2 months ago
9793857d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011269.jsonl.zst
35.6 kB
xet
about 2 months ago
e98f9256
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011276.jsonl.zst
35.3 kB
xet
about 2 months ago
48cc681f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011296.jsonl.zst
85.4 kB
xet
about 2 months ago
d2364fa8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_007__data__olmocr_science_pdfs-health__shard_00011303.jsonl.zst
16.4 kB
xet
about 2 months ago
0f71b946
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors