Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0091
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021804.jsonl.zst
21.6 kB
xet
about 2 months ago
5b392a53
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021807.jsonl.zst
230 kB
xet
about 2 months ago
5d4c058c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021811.jsonl.zst
77.2 kB
xet
about 2 months ago
e49fe45b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021813.jsonl.zst
46.6 kB
xet
about 2 months ago
07f7ccbb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021814.jsonl.zst
57.6 kB
xet
about 2 months ago
814889c7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021821.jsonl.zst
159 kB
xet
about 2 months ago
ad30b1cb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021824.jsonl.zst
145 kB
xet
about 2 months ago
b49a86ec
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021828.jsonl.zst
24.4 kB
xet
about 2 months ago
bc13132c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021829.jsonl.zst
263 kB
xet
about 2 months ago
9b7916fb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021833.jsonl.zst
35 kB
xet
about 2 months ago
2b57d231
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021838.jsonl.zst
26.6 kB
xet
about 2 months ago
bd39d16f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021842.jsonl.zst
76.9 kB
xet
about 2 months ago
60dc7e3a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021845.jsonl.zst
97 kB
xet
about 2 months ago
16900dc4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021848.jsonl.zst
116 kB
xet
about 2 months ago
2a2e2968
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021855.jsonl.zst
45.5 kB
xet
about 2 months ago
ed3c6e70
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021862.jsonl.zst
81.3 kB
xet
about 2 months ago
8ef99e92
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021863.jsonl.zst
66.6 kB
xet
about 2 months ago
fe48cb66
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021871.jsonl.zst
28.1 kB
xet
about 2 months ago
762cb18f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021877.jsonl.zst
35.8 kB
xet
about 2 months ago
b8308440
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021879.jsonl.zst
46.1 kB
xet
about 2 months ago
f25db13b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021880.jsonl.zst
96.4 kB
xet
about 2 months ago
16504b85
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021888.jsonl.zst
343 kB
xet
about 2 months ago
b153bef5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021892.jsonl.zst
48.1 kB
xet
about 2 months ago
f380ba3e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021897.jsonl.zst
41.2 kB
xet
about 2 months ago
bae52217
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021909.jsonl.zst
17.2 kB
xet
about 2 months ago
c8f754b2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021913.jsonl.zst
79.9 kB
xet
about 2 months ago
b4d864ae
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021914.jsonl.zst
84.9 kB
xet
about 2 months ago
c04bd054
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021915.jsonl.zst
29.7 kB
xet
about 2 months ago
ba5f1b43
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021921.jsonl.zst
50 kB
xet
about 2 months ago
a5c1a298
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021926.jsonl.zst
31.4 kB
xet
about 2 months ago
7f54b9c3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021928.jsonl.zst
9.35 kB
xet
about 2 months ago
c4e1fd50
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021931.jsonl.zst
3.02 MB
xet
about 2 months ago
ba2b33cf
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021932.jsonl.zst
141 kB
xet
about 2 months ago
0cebb817
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021936.jsonl.zst
149 kB
xet
about 2 months ago
0710de01
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021942.jsonl.zst
4.07 kB
xet
about 2 months ago
e89073b5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021945.jsonl.zst
7.28 kB
xet
about 2 months ago
383babac
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021949.jsonl.zst
172 kB
xet
about 2 months ago
978f4637
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021954.jsonl.zst
105 kB
xet
about 2 months ago
7106100c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021959.jsonl.zst
37.5 kB
xet
about 2 months ago
824b3a8f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021962.jsonl.zst
65.7 kB
xet
about 2 months ago
6902955a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021965.jsonl.zst
19.4 kB
xet
about 2 months ago
73c82c4b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021966.jsonl.zst
42.8 kB
xet
about 2 months ago
52f60ec2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021970.jsonl.zst
104 kB
xet
about 2 months ago
14e77aa2
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021975.jsonl.zst
58.8 kB
xet
about 2 months ago
5743920b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021980.jsonl.zst
19.4 kB
xet
about 2 months ago
3735ede1
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021982.jsonl.zst
36.7 kB
xet
about 2 months ago
6eac15e8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021983.jsonl.zst
65.5 kB
xet
about 2 months ago
fd0e1a72
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021987.jsonl.zst
116 kB
xet
about 2 months ago
aa5bfb05
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021993.jsonl.zst
46.6 kB
xet
about 2 months ago
cc6eb1b0
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021996.jsonl.zst
25.7 kB
xet
about 2 months ago
3d0da67e
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00021999.jsonl.zst
3.35 kB
xet
about 2 months ago
c3f29e17
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022000.jsonl.zst
981 kB
xet
about 2 months ago
d68bfa77
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022003.jsonl.zst
30 kB
xet
about 2 months ago
2daa6977
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022009.jsonl.zst
49.3 kB
xet
about 2 months ago
98f0d168
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022013.jsonl.zst
41.1 kB
xet
about 2 months ago
c23ac82b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022016.jsonl.zst
8.87 kB
xet
about 2 months ago
5532f6b4
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022017.jsonl.zst
3.25 kB
xet
about 2 months ago
c23c4075
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022025.jsonl.zst
55.6 kB
xet
about 2 months ago
4a890402
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022030.jsonl.zst
184 kB
xet
about 2 months ago
30779e47
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022033.jsonl.zst
17.9 kB
xet
about 2 months ago
c921fa9b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022034.jsonl.zst
120 kB
xet
about 2 months ago
a90b444b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022036.jsonl.zst
85.7 kB
xet
about 2 months ago
64a19eee
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022043.jsonl.zst
43.4 kB
xet
about 2 months ago
44ed57fc
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022050.jsonl.zst
57.8 kB
xet
about 2 months ago
6588834f
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022051.jsonl.zst
106 kB
xet
about 2 months ago
e554bfc8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022052.jsonl.zst
22.2 kB
xet
about 2 months ago
27f5d506
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022059.jsonl.zst
42 kB
xet
about 2 months ago
fce65d82
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022063.jsonl.zst
50 kB
xet
about 2 months ago
09706e5c
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022068.jsonl.zst
620 kB
xet
about 2 months ago
a491a004
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022069.jsonl.zst
48.7 kB
xet
about 2 months ago
11223058
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022076.jsonl.zst
32.1 kB
xet
about 2 months ago
546769bb
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022079.jsonl.zst
40.6 kB
xet
about 2 months ago
6de2103d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022085.jsonl.zst
141 kB
xet
about 2 months ago
2df5ee48
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022086.jsonl.zst
186 kB
xet
about 2 months ago
5ff1f6a7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022093.jsonl.zst
47 kB
xet
about 2 months ago
3d95eb1a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022094.jsonl.zst
33.7 kB
xet
about 2 months ago
133953b8
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022100.jsonl.zst
71.9 kB
xet
about 2 months ago
3380dc25
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022101.jsonl.zst
50.7 kB
xet
about 2 months ago
00fd19e7
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022108.jsonl.zst
15 kB
xet
about 2 months ago
de43085a
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022111.jsonl.zst
55.1 kB
xet
about 2 months ago
40f601e5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022115.jsonl.zst
83.5 kB
xet
about 2 months ago
40d347a5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022117.jsonl.zst
557 kB
xet
about 2 months ago
7c0ff663
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022118.jsonl.zst
42 kB
xet
about 2 months ago
79dc365d
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022123.jsonl.zst
84.7 kB
xet
about 2 months ago
50216c75
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022127.jsonl.zst
30.9 kB
xet
about 2 months ago
72c1f847
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022130.jsonl.zst
28.6 kB
xet
about 2 months ago
5f3d3070
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022131.jsonl.zst
41.2 kB
xet
about 2 months ago
c6bda279
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022134.jsonl.zst
36.2 kB
xet
about 2 months ago
0b7d960b
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022142.jsonl.zst
47.6 kB
xet
about 2 months ago
e6724a73
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022147.jsonl.zst
75.2 kB
xet
about 2 months ago
0b9c0782
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022150.jsonl.zst
58.6 kB
xet
about 2 months ago
5cf3a2ce
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022157.jsonl.zst
53.1 kB
xet
about 2 months ago
f2856ab5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022158.jsonl.zst
42.1 kB
xet
about 2 months ago
213096d5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022161.jsonl.zst
69 kB
xet
about 2 months ago
2a84e244
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022162.jsonl.zst
54.8 kB
xet
about 2 months ago
76127fb3
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022166.jsonl.zst
15.4 kB
xet
about 2 months ago
ee6f2410
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022172.jsonl.zst
161 kB
xet
about 2 months ago
d7070939
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022175.jsonl.zst
13 kB
xet
about 2 months ago
4663ed60
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022179.jsonl.zst
25.6 kB
xet
about 2 months ago
f65e1be5
soc127__phase1_pool_shared__olmocr_science_pdfs__part_008__data__olmocr_science_pdfs-health__shard_00022182.jsonl.zst
36.6 kB
xet
about 2 months ago
5ab63b1a
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors