Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0060
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000094.jsonl.zst
200 kB
xet
about 2 months ago
d56d7cf3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000095.jsonl.zst
114 kB
xet
about 2 months ago
eaebc92a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000096.jsonl.zst
320 kB
xet
about 2 months ago
7059339f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000097.jsonl.zst
176 kB
xet
about 2 months ago
5a8af64b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000098.jsonl.zst
236 kB
xet
about 2 months ago
662aa0f9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000099.jsonl.zst
257 kB
xet
about 2 months ago
6f9f7535
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000100.jsonl.zst
197 kB
xet
about 2 months ago
7772e22d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000101.jsonl.zst
157 kB
xet
about 2 months ago
6bb43c47
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000102.jsonl.zst
79.5 kB
xet
about 2 months ago
6a23b472
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000103.jsonl.zst
146 kB
xet
about 2 months ago
38bca022
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000104.jsonl.zst
118 kB
xet
about 2 months ago
380945a8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000105.jsonl.zst
128 kB
xet
about 2 months ago
e2bbf91f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000106.jsonl.zst
182 kB
xet
about 2 months ago
b0f08390
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000107.jsonl.zst
204 kB
xet
about 2 months ago
ac88bdac
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000108.jsonl.zst
188 kB
xet
about 2 months ago
4c57fb6f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000109.jsonl.zst
142 kB
xet
about 2 months ago
6d9082fc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000110.jsonl.zst
89 kB
xet
about 2 months ago
d721fcb1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000111.jsonl.zst
179 kB
xet
about 2 months ago
1d293642
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000112.jsonl.zst
190 kB
xet
about 2 months ago
5e1bd7b9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000113.jsonl.zst
237 kB
xet
about 2 months ago
441273e4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000114.jsonl.zst
218 kB
xet
about 2 months ago
5e74e783
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000115.jsonl.zst
188 kB
xet
about 2 months ago
6a36639e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000116.jsonl.zst
191 kB
xet
about 2 months ago
adbb3a52
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000117.jsonl.zst
152 kB
xet
about 2 months ago
e5f36c00
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000118.jsonl.zst
177 kB
xet
about 2 months ago
403c73b1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000119.jsonl.zst
163 kB
xet
about 2 months ago
37bca16c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000120.jsonl.zst
297 kB
xet
about 2 months ago
dbbc6cf7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000121.jsonl.zst
209 kB
xet
about 2 months ago
2d91eef9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000122.jsonl.zst
167 kB
xet
about 2 months ago
2463ae67
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000123.jsonl.zst
143 kB
xet
about 2 months ago
63738e05
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000124.jsonl.zst
78 kB
xet
about 2 months ago
8cea5507
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000125.jsonl.zst
117 kB
xet
about 2 months ago
1952602a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000126.jsonl.zst
229 kB
xet
about 2 months ago
e38333dc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000127.jsonl.zst
156 kB
xet
about 2 months ago
12c5f8fe
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000128.jsonl.zst
137 kB
xet
about 2 months ago
1f17ec2f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000129.jsonl.zst
192 kB
xet
about 2 months ago
58139860
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000130.jsonl.zst
153 kB
xet
about 2 months ago
02511a46
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000131.jsonl.zst
116 kB
xet
about 2 months ago
3387213e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000132.jsonl.zst
196 kB
xet
about 2 months ago
6ce93512
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000133.jsonl.zst
251 kB
xet
about 2 months ago
61a912b3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000134.jsonl.zst
204 kB
xet
about 2 months ago
104252e8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000135.jsonl.zst
228 kB
xet
about 2 months ago
01a1165e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000136.jsonl.zst
182 kB
xet
about 2 months ago
5b071626
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000137.jsonl.zst
175 kB
xet
about 2 months ago
13f57858
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000138.jsonl.zst
190 kB
xet
about 2 months ago
63d240aa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000139.jsonl.zst
169 kB
xet
about 2 months ago
679aa195
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000140.jsonl.zst
151 kB
xet
about 2 months ago
d3d1af13
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000141.jsonl.zst
147 kB
xet
about 2 months ago
4823ace4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000142.jsonl.zst
156 kB
xet
about 2 months ago
14ec46a7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000143.jsonl.zst
164 kB
xet
about 2 months ago
464744a9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000144.jsonl.zst
178 kB
xet
about 2 months ago
28214fc2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000145.jsonl.zst
208 kB
xet
about 2 months ago
fc8f214d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000146.jsonl.zst
131 kB
xet
about 2 months ago
0bf18516
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000147.jsonl.zst
114 kB
xet
about 2 months ago
fcf05129
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000148.jsonl.zst
133 kB
xet
about 2 months ago
67db79c1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000149.jsonl.zst
82.8 kB
xet
about 2 months ago
61539ae2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000150.jsonl.zst
166 kB
xet
about 2 months ago
90e62304
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000151.jsonl.zst
96.1 kB
xet
about 2 months ago
b7d589de
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000152.jsonl.zst
109 kB
xet
about 2 months ago
763a4c1c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000153.jsonl.zst
219 kB
xet
about 2 months ago
505c9149
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000154.jsonl.zst
162 kB
xet
about 2 months ago
53d1a108
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000155.jsonl.zst
185 kB
xet
about 2 months ago
b02b52c7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000156.jsonl.zst
197 kB
xet
about 2 months ago
5826d016
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000157.jsonl.zst
180 kB
xet
about 2 months ago
dd205732
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000158.jsonl.zst
173 kB
xet
about 2 months ago
1ea40d20
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000159.jsonl.zst
161 kB
xet
about 2 months ago
b792a565
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000160.jsonl.zst
91.9 kB
xet
about 2 months ago
247eb013
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000161.jsonl.zst
201 kB
xet
about 2 months ago
06e9b266
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000162.jsonl.zst
254 kB
xet
about 2 months ago
5bb1e256
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000163.jsonl.zst
116 kB
xet
about 2 months ago
cfd0632d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000164.jsonl.zst
144 kB
xet
about 2 months ago
58aaeb45
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000165.jsonl.zst
213 kB
xet
about 2 months ago
b4ac43f2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000166.jsonl.zst
249 kB
xet
about 2 months ago
8be3c7be
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000167.jsonl.zst
221 kB
xet
about 2 months ago
f9291cf0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000168.jsonl.zst
133 kB
xet
about 2 months ago
06364756
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000169.jsonl.zst
174 kB
xet
about 2 months ago
159fb978
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000170.jsonl.zst
321 kB
xet
about 2 months ago
797260fc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000171.jsonl.zst
134 kB
xet
about 2 months ago
27336e2b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000172.jsonl.zst
165 kB
xet
about 2 months ago
57117bf5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000173.jsonl.zst
170 kB
xet
about 2 months ago
2c0b931b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000174.jsonl.zst
121 kB
xet
about 2 months ago
80ea0c8d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000175.jsonl.zst
137 kB
xet
about 2 months ago
966e0c37
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000176.jsonl.zst
128 kB
xet
about 2 months ago
1df72bb2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000177.jsonl.zst
108 kB
xet
about 2 months ago
d82c26b9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000178.jsonl.zst
62.1 kB
xet
about 2 months ago
4d19a9f0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000179.jsonl.zst
131 kB
xet
about 2 months ago
22a98ff4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000181.jsonl.zst
256 kB
xet
about 2 months ago
18b458ca
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000182.jsonl.zst
154 kB
xet
about 2 months ago
8b2b7024
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000183.jsonl.zst
155 kB
xet
about 2 months ago
4a514efd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000184.jsonl.zst
142 kB
xet
about 2 months ago
ece605a9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000185.jsonl.zst
162 kB
xet
about 2 months ago
5432f950
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000186.jsonl.zst
190 kB
xet
about 2 months ago
e4660cfa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000187.jsonl.zst
154 kB
xet
about 2 months ago
7e28fb88
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000188.jsonl.zst
104 kB
xet
about 2 months ago
0a11a797
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000189.jsonl.zst
226 kB
xet
about 2 months ago
6a265745
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000190.jsonl.zst
220 kB
xet
about 2 months ago
31723d2d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000191.jsonl.zst
150 kB
xet
about 2 months ago
03ac62e9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000192.jsonl.zst
238 kB
xet
about 2 months ago
1e464ef5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000193.jsonl.zst
222 kB
xet
about 2 months ago
735f3b4c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0016__shard_00000194.jsonl.zst
155 kB
xet
about 2 months ago
e9a5f52b
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors