Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0056
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000625.jsonl.zst
266 kB
xet
about 2 months ago
7daf1660
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000626.jsonl.zst
289 kB
xet
about 2 months ago
431deb5a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000627.jsonl.zst
366 kB
xet
about 2 months ago
e899ba10
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000630.jsonl.zst
282 kB
xet
about 2 months ago
602e4d97
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000631.jsonl.zst
321 kB
xet
about 2 months ago
f81bf264
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000632.jsonl.zst
203 kB
xet
about 2 months ago
4824014b
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000633.jsonl.zst
161 kB
xet
about 2 months ago
6358cbcc
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000634.jsonl.zst
187 kB
xet
about 2 months ago
22bcfbf3
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000636.jsonl.zst
225 kB
xet
about 2 months ago
1a79728f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000637.jsonl.zst
263 kB
xet
about 2 months ago
0f6ed665
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000638.jsonl.zst
239 kB
xet
about 2 months ago
34b76be2
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000639.jsonl.zst
212 kB
xet
about 2 months ago
1e231f0b
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000640.jsonl.zst
280 kB
xet
about 2 months ago
9f276092
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000641.jsonl.zst
217 kB
xet
about 2 months ago
d65659c1
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000642.jsonl.zst
220 kB
xet
about 2 months ago
01889b70
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000643.jsonl.zst
183 kB
xet
about 2 months ago
30fa1bf2
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000644.jsonl.zst
233 kB
xet
about 2 months ago
4be8e646
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000645.jsonl.zst
179 kB
xet
about 2 months ago
f3f6a2cd
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000646.jsonl.zst
295 kB
xet
about 2 months ago
cabe0d6f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000647.jsonl.zst
147 kB
xet
about 2 months ago
64b3366f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000649.jsonl.zst
215 kB
xet
about 2 months ago
70fd9642
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000650.jsonl.zst
244 kB
xet
about 2 months ago
f24e6dfa
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000652.jsonl.zst
195 kB
xet
about 2 months ago
2d5c0613
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000653.jsonl.zst
196 kB
xet
about 2 months ago
ceb5e627
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000654.jsonl.zst
221 kB
xet
about 2 months ago
cc28ba4f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000656.jsonl.zst
167 kB
xet
about 2 months ago
e7c30e5a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000657.jsonl.zst
408 kB
xet
about 2 months ago
e94e4d0c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000658.jsonl.zst
535 kB
xet
about 2 months ago
f8beaecc
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000659.jsonl.zst
358 kB
xet
about 2 months ago
2f2e698e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000660.jsonl.zst
179 kB
xet
about 2 months ago
9fb69e4c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000661.jsonl.zst
192 kB
xet
about 2 months ago
e64f4016
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000662.jsonl.zst
243 kB
xet
about 2 months ago
95bd544c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000664.jsonl.zst
362 kB
xet
about 2 months ago
f46d1810
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000665.jsonl.zst
411 kB
xet
about 2 months ago
77827a1d
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000667.jsonl.zst
214 kB
xet
about 2 months ago
14e62a84
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000668.jsonl.zst
252 kB
xet
about 2 months ago
02283b17
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000669.jsonl.zst
568 kB
xet
about 2 months ago
cbad14dd
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000670.jsonl.zst
220 kB
xet
about 2 months ago
3f6929d5
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000671.jsonl.zst
186 kB
xet
about 2 months ago
359323f0
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000672.jsonl.zst
272 kB
xet
about 2 months ago
7f7b2057
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000673.jsonl.zst
312 kB
xet
about 2 months ago
04683d86
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000675.jsonl.zst
232 kB
xet
about 2 months ago
43a7f26f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000677.jsonl.zst
219 kB
xet
about 2 months ago
097b0dcf
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000678.jsonl.zst
163 kB
xet
about 2 months ago
1920b01a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000679.jsonl.zst
276 kB
xet
about 2 months ago
9ae0e6ec
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000680.jsonl.zst
223 kB
xet
about 2 months ago
584187ba
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000681.jsonl.zst
254 kB
xet
about 2 months ago
5b1b65f6
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000682.jsonl.zst
342 kB
xet
about 2 months ago
6f211805
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000683.jsonl.zst
263 kB
xet
about 2 months ago
edaf019c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000684.jsonl.zst
256 kB
xet
about 2 months ago
9736a8dc
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000685.jsonl.zst
159 kB
xet
about 2 months ago
915bec80
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000686.jsonl.zst
250 kB
xet
about 2 months ago
820ff5a8
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000687.jsonl.zst
262 kB
xet
about 2 months ago
2ef71437
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000689.jsonl.zst
271 kB
xet
about 2 months ago
c033aa85
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000690.jsonl.zst
149 kB
xet
about 2 months ago
6122f9ef
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000691.jsonl.zst
238 kB
xet
about 2 months ago
afb3351e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000692.jsonl.zst
281 kB
xet
about 2 months ago
14846155
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000693.jsonl.zst
145 kB
xet
about 2 months ago
85762c08
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000694.jsonl.zst
214 kB
xet
about 2 months ago
f119a9d0
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000695.jsonl.zst
171 kB
xet
about 2 months ago
9c660905
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000696.jsonl.zst
245 kB
xet
about 2 months ago
bd3f605a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000699.jsonl.zst
106 kB
xet
about 2 months ago
4de5c5c0
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000700.jsonl.zst
247 kB
xet
about 2 months ago
f39fc457
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000701.jsonl.zst
297 kB
xet
about 2 months ago
0857c40a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000702.jsonl.zst
298 kB
xet
about 2 months ago
403c4047
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000703.jsonl.zst
170 kB
xet
about 2 months ago
1522a30a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000704.jsonl.zst
174 kB
xet
about 2 months ago
6a7ceebc
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000705.jsonl.zst
212 kB
xet
about 2 months ago
461b1533
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000706.jsonl.zst
219 kB
xet
about 2 months ago
edaf98b7
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000707.jsonl.zst
311 kB
xet
about 2 months ago
7e01b778
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000708.jsonl.zst
161 kB
xet
about 2 months ago
9d20e115
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000000.jsonl.zst
229 kB
xet
about 2 months ago
8571fc5a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000001.jsonl.zst
143 kB
xet
about 2 months ago
edd893ff
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000002.jsonl.zst
190 kB
xet
about 2 months ago
5b92a8f8
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000003.jsonl.zst
193 kB
xet
about 2 months ago
5c85348e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000004.jsonl.zst
256 kB
xet
about 2 months ago
700ec4b7
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000005.jsonl.zst
407 kB
xet
about 2 months ago
f762f2f4
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000006.jsonl.zst
399 kB
xet
about 2 months ago
9e91d343
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000007.jsonl.zst
287 kB
xet
about 2 months ago
cdae7b6e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000008.jsonl.zst
140 kB
xet
about 2 months ago
04a3d4db
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000009.jsonl.zst
199 kB
xet
about 2 months ago
0ec85c16
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000010.jsonl.zst
173 kB
xet
about 2 months ago
ad9695bd
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000011.jsonl.zst
172 kB
xet
about 2 months ago
bc756a5e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000012.jsonl.zst
188 kB
xet
about 2 months ago
d0870e27
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000013.jsonl.zst
173 kB
xet
about 2 months ago
fb3daa90
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000014.jsonl.zst
318 kB
xet
about 2 months ago
037fac86
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000015.jsonl.zst
207 kB
xet
about 2 months ago
fa6c40b1
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000016.jsonl.zst
258 kB
xet
about 2 months ago
cde73682
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000017.jsonl.zst
155 kB
xet
about 2 months ago
4d3c376b
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000018.jsonl.zst
364 kB
xet
about 2 months ago
cb75b62d
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000019.jsonl.zst
239 kB
xet
about 2 months ago
6d8f1abb
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000020.jsonl.zst
333 kB
xet
about 2 months ago
79dc71ed
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000021.jsonl.zst
332 kB
xet
about 2 months ago
f4555928
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000022.jsonl.zst
157 kB
xet
about 2 months ago
b83f605e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000023.jsonl.zst
285 kB
xet
about 2 months ago
ada8678f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000024.jsonl.zst
161 kB
xet
about 2 months ago
95986d3b
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000025.jsonl.zst
181 kB
xet
about 2 months ago
8d11e87d
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000026.jsonl.zst
272 kB
xet
about 2 months ago
f7dc8ff6
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000027.jsonl.zst
298 kB
xet
about 2 months ago
2a210772
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0014__shard_00000028.jsonl.zst
176 kB
xet
about 2 months ago
577424ad
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors