Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0059
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000484.jsonl.zst
229 kB
xet
about 2 months ago
68d50f6f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000485.jsonl.zst
229 kB
xet
about 2 months ago
97a82ae6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000486.jsonl.zst
279 kB
xet
about 2 months ago
9edcd574
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000487.jsonl.zst
182 kB
xet
about 2 months ago
04856ec1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000488.jsonl.zst
174 kB
xet
about 2 months ago
3f012529
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000489.jsonl.zst
235 kB
xet
about 2 months ago
86b715f6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000490.jsonl.zst
211 kB
xet
about 2 months ago
5bc9dc40
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000491.jsonl.zst
174 kB
xet
about 2 months ago
1d7bd121
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000492.jsonl.zst
221 kB
xet
about 2 months ago
45c09678
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000493.jsonl.zst
121 kB
xet
about 2 months ago
7c5a0ea9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000494.jsonl.zst
216 kB
xet
about 2 months ago
0af67c9d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000495.jsonl.zst
293 kB
xet
about 2 months ago
ca27c0bd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000496.jsonl.zst
209 kB
xet
about 2 months ago
b6b7b377
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000497.jsonl.zst
289 kB
xet
about 2 months ago
d580d0b1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000498.jsonl.zst
258 kB
xet
about 2 months ago
6543dce3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000499.jsonl.zst
252 kB
xet
about 2 months ago
f54a62d0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000500.jsonl.zst
142 kB
xet
about 2 months ago
ac523225
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000501.jsonl.zst
202 kB
xet
about 2 months ago
62eb56c6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000502.jsonl.zst
185 kB
xet
about 2 months ago
17fbc528
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000503.jsonl.zst
214 kB
xet
about 2 months ago
7b081c3e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000504.jsonl.zst
233 kB
xet
about 2 months ago
143ff022
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000505.jsonl.zst
237 kB
xet
about 2 months ago
dad936ca
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000506.jsonl.zst
178 kB
xet
about 2 months ago
3cc176c3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000507.jsonl.zst
330 kB
xet
about 2 months ago
a1566f5a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000508.jsonl.zst
230 kB
xet
about 2 months ago
b0d85ba3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000509.jsonl.zst
178 kB
xet
about 2 months ago
3b1ca720
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000510.jsonl.zst
146 kB
xet
about 2 months ago
6322699d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000511.jsonl.zst
148 kB
xet
about 2 months ago
0eea6b74
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000512.jsonl.zst
160 kB
xet
about 2 months ago
6d017057
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000513.jsonl.zst
234 kB
xet
about 2 months ago
3105c5be
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000514.jsonl.zst
189 kB
xet
about 2 months ago
b16ca056
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000515.jsonl.zst
88.8 kB
xet
about 2 months ago
662641d1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000516.jsonl.zst
148 kB
xet
about 2 months ago
012397f2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000517.jsonl.zst
170 kB
xet
about 2 months ago
45530a5c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000518.jsonl.zst
270 kB
xet
about 2 months ago
86f4bc50
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000519.jsonl.zst
197 kB
xet
about 2 months ago
374a1dc3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000520.jsonl.zst
128 kB
xet
about 2 months ago
ca2242cb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000521.jsonl.zst
185 kB
xet
about 2 months ago
23ffa56b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000522.jsonl.zst
157 kB
xet
about 2 months ago
7857bee9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000523.jsonl.zst
134 kB
xet
about 2 months ago
8d36bb0f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000524.jsonl.zst
153 kB
xet
about 2 months ago
a6befe99
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000525.jsonl.zst
238 kB
xet
about 2 months ago
e870c334
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000526.jsonl.zst
131 kB
xet
about 2 months ago
bb2cf710
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000527.jsonl.zst
169 kB
xet
about 2 months ago
24d2741a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000528.jsonl.zst
212 kB
xet
about 2 months ago
4db00cf0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000529.jsonl.zst
237 kB
xet
about 2 months ago
7196cfc5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000530.jsonl.zst
166 kB
xet
about 2 months ago
9b3adf5e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000531.jsonl.zst
86.5 kB
xet
about 2 months ago
0c137052
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000532.jsonl.zst
168 kB
xet
about 2 months ago
20090e86
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000533.jsonl.zst
148 kB
xet
about 2 months ago
9a3a7fd6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000534.jsonl.zst
136 kB
xet
about 2 months ago
6b1afc83
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000535.jsonl.zst
174 kB
xet
about 2 months ago
d8d1428c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000536.jsonl.zst
194 kB
xet
about 2 months ago
566d3da8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000537.jsonl.zst
172 kB
xet
about 2 months ago
a6b77229
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000538.jsonl.zst
131 kB
xet
about 2 months ago
588dfa80
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000539.jsonl.zst
202 kB
xet
about 2 months ago
778ecc42
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000540.jsonl.zst
126 kB
xet
about 2 months ago
73635a22
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000541.jsonl.zst
169 kB
xet
about 2 months ago
9472d0af
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000542.jsonl.zst
160 kB
xet
about 2 months ago
832a1f91
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000543.jsonl.zst
209 kB
xet
about 2 months ago
afc46843
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000544.jsonl.zst
120 kB
xet
about 2 months ago
9cb38ca3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000545.jsonl.zst
106 kB
xet
about 2 months ago
5f166902
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000546.jsonl.zst
179 kB
xet
about 2 months ago
ff0a0899
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000547.jsonl.zst
246 kB
xet
about 2 months ago
b7fc600c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000548.jsonl.zst
271 kB
xet
about 2 months ago
a857a082
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000549.jsonl.zst
204 kB
xet
about 2 months ago
29cc3780
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000550.jsonl.zst
230 kB
xet
about 2 months ago
0bec0c8f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000551.jsonl.zst
301 kB
xet
about 2 months ago
dbe44341
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000552.jsonl.zst
211 kB
xet
about 2 months ago
2d12c71a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000553.jsonl.zst
173 kB
xet
about 2 months ago
78138e0a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000554.jsonl.zst
155 kB
xet
about 2 months ago
4d29f72d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000555.jsonl.zst
356 kB
xet
about 2 months ago
9864ac94
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000556.jsonl.zst
121 kB
xet
about 2 months ago
97671735
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000557.jsonl.zst
278 kB
xet
about 2 months ago
ea40ddb3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000558.jsonl.zst
199 kB
xet
about 2 months ago
565fac64
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000559.jsonl.zst
187 kB
xet
about 2 months ago
313b38e6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000560.jsonl.zst
111 kB
xet
about 2 months ago
138803ea
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000561.jsonl.zst
169 kB
xet
about 2 months ago
01f82f98
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000562.jsonl.zst
200 kB
xet
about 2 months ago
2363c6e0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000563.jsonl.zst
212 kB
xet
about 2 months ago
4f02ae3b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000564.jsonl.zst
183 kB
xet
about 2 months ago
e6b47ec4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000565.jsonl.zst
158 kB
xet
about 2 months ago
a7d41fd5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000566.jsonl.zst
236 kB
xet
about 2 months ago
9629adf6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000567.jsonl.zst
179 kB
xet
about 2 months ago
b06ac2c6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000568.jsonl.zst
169 kB
xet
about 2 months ago
e6c052ac
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000569.jsonl.zst
139 kB
xet
about 2 months ago
a2e0a3eb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000570.jsonl.zst
154 kB
xet
about 2 months ago
edbce50c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000571.jsonl.zst
271 kB
xet
about 2 months ago
3f16c8c2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000572.jsonl.zst
149 kB
xet
about 2 months ago
dd0595d7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000573.jsonl.zst
127 kB
xet
about 2 months ago
0b9a4a27
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000574.jsonl.zst
275 kB
xet
about 2 months ago
ff6b00fc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000575.jsonl.zst
206 kB
xet
about 2 months ago
9b1cfdf5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000576.jsonl.zst
254 kB
xet
about 2 months ago
abc177a5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000577.jsonl.zst
161 kB
xet
about 2 months ago
62d0929d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000578.jsonl.zst
260 kB
xet
about 2 months ago
28c64328
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000579.jsonl.zst
129 kB
xet
about 2 months ago
edc4bbe5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000580.jsonl.zst
288 kB
xet
about 2 months ago
6dee6243
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000581.jsonl.zst
134 kB
xet
about 2 months ago
7010df88
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000582.jsonl.zst
171 kB
xet
about 2 months ago
26d55986
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000583.jsonl.zst
202 kB
xet
about 2 months ago
fdf19e3e
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors