Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0058
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000043.jsonl.zst
189 kB
xet
about 2 months ago
04bb5304
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000044.jsonl.zst
173 kB
xet
about 2 months ago
7a75678f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000045.jsonl.zst
221 kB
xet
about 2 months ago
0469dcfc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000046.jsonl.zst
109 kB
xet
about 2 months ago
43630ea5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000047.jsonl.zst
152 kB
xet
about 2 months ago
22bb5cbd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000048.jsonl.zst
269 kB
xet
about 2 months ago
b200bc01
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000049.jsonl.zst
190 kB
xet
about 2 months ago
911e2d8d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000050.jsonl.zst
262 kB
xet
about 2 months ago
fbfa01eb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000051.jsonl.zst
114 kB
xet
about 2 months ago
89424e0c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000052.jsonl.zst
389 kB
xet
about 2 months ago
e3457c95
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000053.jsonl.zst
174 kB
xet
about 2 months ago
6b3701e2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000054.jsonl.zst
258 kB
xet
about 2 months ago
29a5b018
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000055.jsonl.zst
113 kB
xet
about 2 months ago
0831e8bc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000056.jsonl.zst
277 kB
xet
about 2 months ago
fabb416c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000057.jsonl.zst
161 kB
xet
about 2 months ago
13a413ad
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000058.jsonl.zst
149 kB
xet
about 2 months ago
133def99
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000059.jsonl.zst
206 kB
xet
about 2 months ago
518697e6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000060.jsonl.zst
178 kB
xet
about 2 months ago
794a6566
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000061.jsonl.zst
221 kB
xet
about 2 months ago
bc2bf650
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000062.jsonl.zst
244 kB
xet
about 2 months ago
9e34fc7c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000063.jsonl.zst
202 kB
xet
about 2 months ago
10655696
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000064.jsonl.zst
183 kB
xet
about 2 months ago
0d279cf6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000065.jsonl.zst
166 kB
xet
about 2 months ago
6cc0cfe2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000066.jsonl.zst
161 kB
xet
about 2 months ago
96e2740e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000067.jsonl.zst
496 kB
xet
about 2 months ago
b02cd905
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000068.jsonl.zst
206 kB
xet
about 2 months ago
a82273f8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000069.jsonl.zst
260 kB
xet
about 2 months ago
93bdc9a5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000070.jsonl.zst
168 kB
xet
about 2 months ago
aafe5b9d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000071.jsonl.zst
218 kB
xet
about 2 months ago
4e3325d4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000072.jsonl.zst
180 kB
xet
about 2 months ago
f13b5eac
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000073.jsonl.zst
249 kB
xet
about 2 months ago
68bd2de3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000074.jsonl.zst
245 kB
xet
about 2 months ago
20f0f06d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000075.jsonl.zst
141 kB
xet
about 2 months ago
57b41d97
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000076.jsonl.zst
159 kB
xet
about 2 months ago
85c7fb36
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000077.jsonl.zst
115 kB
xet
about 2 months ago
489d9dff
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000078.jsonl.zst
151 kB
xet
about 2 months ago
33bc1a76
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000079.jsonl.zst
250 kB
xet
about 2 months ago
f127bd21
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000080.jsonl.zst
158 kB
xet
about 2 months ago
796fd1e1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000081.jsonl.zst
47.3 kB
xet
about 2 months ago
40faa1ef
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000082.jsonl.zst
42.1 kB
xet
about 2 months ago
3ef40dca
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000083.jsonl.zst
83.6 kB
xet
about 2 months ago
589665ee
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000084.jsonl.zst
63.3 kB
xet
about 2 months ago
0c5a153b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000085.jsonl.zst
164 kB
xet
about 2 months ago
70bf90ba
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000086.jsonl.zst
234 kB
xet
about 2 months ago
7afbf07a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000087.jsonl.zst
203 kB
xet
about 2 months ago
9c156236
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000088.jsonl.zst
226 kB
xet
about 2 months ago
2b8b63a0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000089.jsonl.zst
139 kB
xet
about 2 months ago
b6c039a6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000090.jsonl.zst
145 kB
xet
about 2 months ago
ac0829b4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000091.jsonl.zst
152 kB
xet
about 2 months ago
3d48b019
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000092.jsonl.zst
299 kB
xet
about 2 months ago
93833dd2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000093.jsonl.zst
172 kB
xet
about 2 months ago
5e5eced3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000094.jsonl.zst
210 kB
xet
about 2 months ago
2f65aaeb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000095.jsonl.zst
156 kB
xet
about 2 months ago
0fd6dfc2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000096.jsonl.zst
136 kB
xet
about 2 months ago
2fd88189
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000097.jsonl.zst
145 kB
xet
about 2 months ago
7c665335
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000098.jsonl.zst
175 kB
xet
about 2 months ago
6c7d2baa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000099.jsonl.zst
299 kB
xet
about 2 months ago
9402d4e3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000100.jsonl.zst
161 kB
xet
about 2 months ago
cb053f7a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000101.jsonl.zst
151 kB
xet
about 2 months ago
1f61295f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000102.jsonl.zst
266 kB
xet
about 2 months ago
9f16ffe7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000103.jsonl.zst
198 kB
xet
about 2 months ago
32cba475
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000104.jsonl.zst
145 kB
xet
about 2 months ago
cc864a3b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000105.jsonl.zst
246 kB
xet
about 2 months ago
0a7a5cad
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000106.jsonl.zst
216 kB
xet
about 2 months ago
e8600b67
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000107.jsonl.zst
187 kB
xet
about 2 months ago
ef046ade
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000108.jsonl.zst
198 kB
xet
about 2 months ago
48c7070a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000109.jsonl.zst
179 kB
xet
about 2 months ago
b4104e09
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000110.jsonl.zst
312 kB
xet
about 2 months ago
51f0318a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000111.jsonl.zst
251 kB
xet
about 2 months ago
c7dbba79
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000112.jsonl.zst
324 kB
xet
about 2 months ago
75619f52
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000113.jsonl.zst
163 kB
xet
about 2 months ago
4812791c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000114.jsonl.zst
237 kB
xet
about 2 months ago
47ee31c0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000115.jsonl.zst
195 kB
xet
about 2 months ago
869a9b0d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000116.jsonl.zst
181 kB
xet
about 2 months ago
42c9a088
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000117.jsonl.zst
152 kB
xet
about 2 months ago
eca30e09
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000118.jsonl.zst
255 kB
xet
about 2 months ago
a50c8dcf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000119.jsonl.zst
266 kB
xet
about 2 months ago
cb9e746a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000120.jsonl.zst
263 kB
xet
about 2 months ago
9475c038
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000121.jsonl.zst
184 kB
xet
about 2 months ago
2caba49d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000122.jsonl.zst
187 kB
xet
about 2 months ago
b2c4752a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000123.jsonl.zst
235 kB
xet
about 2 months ago
ffc98eb6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000124.jsonl.zst
160 kB
xet
about 2 months ago
d0df74e8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000125.jsonl.zst
202 kB
xet
about 2 months ago
960c0755
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000126.jsonl.zst
192 kB
xet
about 2 months ago
4e03dd02
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000127.jsonl.zst
308 kB
xet
about 2 months ago
a6f5d9da
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000128.jsonl.zst
177 kB
xet
about 2 months ago
41bb1d31
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000129.jsonl.zst
189 kB
xet
about 2 months ago
1bc012be
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000130.jsonl.zst
122 kB
xet
about 2 months ago
7092f3b8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000131.jsonl.zst
169 kB
xet
about 2 months ago
055b01e0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000132.jsonl.zst
122 kB
xet
about 2 months ago
f843bc55
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000133.jsonl.zst
247 kB
xet
about 2 months ago
5b161a35
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000134.jsonl.zst
207 kB
xet
about 2 months ago
359f180b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000135.jsonl.zst
201 kB
xet
about 2 months ago
f3fa27da
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000136.jsonl.zst
190 kB
xet
about 2 months ago
4c8c3b37
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000137.jsonl.zst
181 kB
xet
about 2 months ago
9bce2435
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000138.jsonl.zst
159 kB
xet
about 2 months ago
e83e2779
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000139.jsonl.zst
168 kB
xet
about 2 months ago
a82a17fd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000140.jsonl.zst
133 kB
xet
about 2 months ago
c4a28473
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000141.jsonl.zst
272 kB
xet
about 2 months ago
80271546
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0015__shard_00000142.jsonl.zst
188 kB
xet
about 2 months ago
400e5977
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors