Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0055
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000088.jsonl.zst
184 kB
xet
about 2 months ago
1ca291a4
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000089.jsonl.zst
261 kB
xet
about 2 months ago
c41d678a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000091.jsonl.zst
231 kB
xet
about 2 months ago
8c1e49ee
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000092.jsonl.zst
245 kB
xet
about 2 months ago
5ef6733e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000093.jsonl.zst
217 kB
xet
about 2 months ago
f16d9863
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000094.jsonl.zst
150 kB
xet
about 2 months ago
6323fc62
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000095.jsonl.zst
288 kB
xet
about 2 months ago
2c20cd86
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000096.jsonl.zst
277 kB
xet
about 2 months ago
7a6b033f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000097.jsonl.zst
327 kB
xet
about 2 months ago
62cbbaa1
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000098.jsonl.zst
127 kB
xet
about 2 months ago
bf4b640f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000099.jsonl.zst
446 kB
xet
about 2 months ago
d78be080
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000100.jsonl.zst
191 kB
xet
about 2 months ago
b8e0ba26
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000101.jsonl.zst
210 kB
xet
about 2 months ago
980da16e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000103.jsonl.zst
154 kB
xet
about 2 months ago
263ff804
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000104.jsonl.zst
171 kB
xet
about 2 months ago
cb194920
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000105.jsonl.zst
159 kB
xet
about 2 months ago
95ade1e3
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000106.jsonl.zst
308 kB
xet
about 2 months ago
0e0975a1
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000107.jsonl.zst
202 kB
xet
about 2 months ago
0b441e33
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000110.jsonl.zst
251 kB
xet
about 2 months ago
2bdc16cd
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000111.jsonl.zst
294 kB
xet
about 2 months ago
88da4900
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000112.jsonl.zst
201 kB
xet
about 2 months ago
3dabb813
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000113.jsonl.zst
241 kB
xet
about 2 months ago
d0737bb4
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000114.jsonl.zst
387 kB
xet
about 2 months ago
3ff71b5f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000115.jsonl.zst
256 kB
xet
about 2 months ago
0baa7872
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000117.jsonl.zst
224 kB
xet
about 2 months ago
cc564fbf
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000120.jsonl.zst
158 kB
xet
about 2 months ago
205005c3
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000121.jsonl.zst
160 kB
xet
about 2 months ago
d3da6c85
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000122.jsonl.zst
252 kB
xet
about 2 months ago
0ddc7047
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000123.jsonl.zst
313 kB
xet
about 2 months ago
453862ac
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000124.jsonl.zst
173 kB
xet
about 2 months ago
2aaaf8c2
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000125.jsonl.zst
188 kB
xet
about 2 months ago
3a5bc6fd
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000126.jsonl.zst
210 kB
xet
about 2 months ago
240d6321
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000127.jsonl.zst
295 kB
xet
about 2 months ago
045ef9fe
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000128.jsonl.zst
172 kB
xet
about 2 months ago
42117de8
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000129.jsonl.zst
388 kB
xet
about 2 months ago
73a4c5de
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000130.jsonl.zst
267 kB
xet
about 2 months ago
d26a8831
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000132.jsonl.zst
180 kB
xet
about 2 months ago
8064b99a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000133.jsonl.zst
286 kB
xet
about 2 months ago
4e341ab3
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000134.jsonl.zst
162 kB
xet
about 2 months ago
487fecd1
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000135.jsonl.zst
238 kB
xet
about 2 months ago
6f70a6d6
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000137.jsonl.zst
291 kB
xet
about 2 months ago
9349d1e7
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000138.jsonl.zst
342 kB
xet
about 2 months ago
ebff7cb5
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000139.jsonl.zst
204 kB
xet
about 2 months ago
22f4d0c6
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000140.jsonl.zst
123 kB
xet
about 2 months ago
d724decf
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000141.jsonl.zst
99.5 kB
xet
about 2 months ago
f679e330
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000144.jsonl.zst
215 kB
xet
about 2 months ago
35c7a24f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000146.jsonl.zst
183 kB
xet
about 2 months ago
0d15d37f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000147.jsonl.zst
313 kB
xet
about 2 months ago
e7716ef5
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000149.jsonl.zst
206 kB
xet
about 2 months ago
5999dd6c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000150.jsonl.zst
261 kB
xet
about 2 months ago
14592630
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000151.jsonl.zst
283 kB
xet
about 2 months ago
2456bbc9
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000152.jsonl.zst
254 kB
xet
about 2 months ago
3a0da525
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000153.jsonl.zst
190 kB
xet
about 2 months ago
18aa2019
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000154.jsonl.zst
239 kB
xet
about 2 months ago
1e1713d9
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000155.jsonl.zst
372 kB
xet
about 2 months ago
6b179a4c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000156.jsonl.zst
399 kB
xet
about 2 months ago
5e88dbb2
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000158.jsonl.zst
161 kB
xet
about 2 months ago
6a9a7e22
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000159.jsonl.zst
209 kB
xet
about 2 months ago
579eca7c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000160.jsonl.zst
202 kB
xet
about 2 months ago
dbc06ef6
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000161.jsonl.zst
259 kB
xet
about 2 months ago
fa4695fb
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000162.jsonl.zst
128 kB
xet
about 2 months ago
b437fed2
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000163.jsonl.zst
309 kB
xet
about 2 months ago
d5390c56
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000164.jsonl.zst
187 kB
xet
about 2 months ago
62fff426
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000165.jsonl.zst
455 kB
xet
about 2 months ago
350c054a
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000166.jsonl.zst
363 kB
xet
about 2 months ago
fd79f693
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000167.jsonl.zst
154 kB
xet
about 2 months ago
afb1dac6
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000171.jsonl.zst
361 kB
xet
about 2 months ago
44bb9079
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000172.jsonl.zst
334 kB
xet
about 2 months ago
54759644
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000174.jsonl.zst
385 kB
xet
about 2 months ago
19fe50ea
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000175.jsonl.zst
367 kB
xet
about 2 months ago
9b7a5ae3
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000176.jsonl.zst
193 kB
xet
about 2 months ago
379583fc
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000177.jsonl.zst
226 kB
xet
about 2 months ago
4081e6bd
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000178.jsonl.zst
215 kB
xet
about 2 months ago
880d3e85
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000179.jsonl.zst
215 kB
xet
about 2 months ago
b26f3c44
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000180.jsonl.zst
190 kB
xet
about 2 months ago
350a9481
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000181.jsonl.zst
160 kB
xet
about 2 months ago
a7ebc2f6
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000182.jsonl.zst
300 kB
xet
about 2 months ago
8a41d4e1
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000183.jsonl.zst
205 kB
xet
about 2 months ago
26c83397
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000184.jsonl.zst
206 kB
xet
about 2 months ago
721565b8
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000185.jsonl.zst
159 kB
xet
about 2 months ago
cfdce414
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000186.jsonl.zst
283 kB
xet
about 2 months ago
15a7ee55
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000187.jsonl.zst
140 kB
xet
about 2 months ago
a33f4381
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000188.jsonl.zst
326 kB
xet
about 2 months ago
b0fb7cb7
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000189.jsonl.zst
314 kB
xet
about 2 months ago
d5fa9b93
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000190.jsonl.zst
513 kB
xet
about 2 months ago
5051c0fa
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000191.jsonl.zst
631 kB
xet
about 2 months ago
aea7852e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000192.jsonl.zst
194 kB
xet
about 2 months ago
3470b769
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000193.jsonl.zst
232 kB
xet
about 2 months ago
9e514cb7
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000194.jsonl.zst
265 kB
xet
about 2 months ago
cf2cdd92
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000195.jsonl.zst
202 kB
xet
about 2 months ago
6ec78198
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000196.jsonl.zst
406 kB
xet
about 2 months ago
b1f4c8c9
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000197.jsonl.zst
256 kB
xet
about 2 months ago
bdd3c02e
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000198.jsonl.zst
306 kB
xet
about 2 months ago
30400d6f
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000199.jsonl.zst
398 kB
xet
about 2 months ago
0ebb1fa2
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000200.jsonl.zst
250 kB
xet
about 2 months ago
45550677
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000201.jsonl.zst
309 kB
xet
about 2 months ago
44c72a24
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000203.jsonl.zst
305 kB
xet
about 2 months ago
36cc4b48
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000204.jsonl.zst
156 kB
xet
about 2 months ago
0172337b
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000205.jsonl.zst
172 kB
xet
about 2 months ago
dd97a97c
soc127__phase1_pool_shared__common_crawl__part_004__data__common_crawl-science_math_and_technology-0013__shard_00000206.jsonl.zst
137 kB
xet
about 2 months ago
5c5f717c
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors