Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0062
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000080.jsonl.zst
164 kB
xet
about 2 months ago
b28142b3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000081.jsonl.zst
93.7 kB
xet
about 2 months ago
2d3869c2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000082.jsonl.zst
139 kB
xet
about 2 months ago
f973e44d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000083.jsonl.zst
188 kB
xet
about 2 months ago
ffd11cf8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000084.jsonl.zst
130 kB
xet
about 2 months ago
2d502f43
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000085.jsonl.zst
130 kB
xet
about 2 months ago
753a98d4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000086.jsonl.zst
209 kB
xet
about 2 months ago
396aad6f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000087.jsonl.zst
164 kB
xet
about 2 months ago
f534f3a3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000088.jsonl.zst
139 kB
xet
about 2 months ago
6ceef51f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000089.jsonl.zst
140 kB
xet
about 2 months ago
3f0b9e88
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000090.jsonl.zst
167 kB
xet
about 2 months ago
a4472a16
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000091.jsonl.zst
168 kB
xet
about 2 months ago
5ed40dea
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000092.jsonl.zst
106 kB
xet
about 2 months ago
88fcb72b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000093.jsonl.zst
149 kB
xet
about 2 months ago
a13e83bb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000094.jsonl.zst
211 kB
xet
about 2 months ago
f7312717
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000095.jsonl.zst
295 kB
xet
about 2 months ago
70ef305c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000096.jsonl.zst
206 kB
xet
about 2 months ago
3edc71c9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000097.jsonl.zst
152 kB
xet
about 2 months ago
8487481b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000098.jsonl.zst
181 kB
xet
about 2 months ago
5f4dc055
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000099.jsonl.zst
146 kB
xet
about 2 months ago
6d9d1a5a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000100.jsonl.zst
86.5 kB
xet
about 2 months ago
d73e532c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000101.jsonl.zst
121 kB
xet
about 2 months ago
1d123ca9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000102.jsonl.zst
87.5 kB
xet
about 2 months ago
f9eb94be
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000103.jsonl.zst
184 kB
xet
about 2 months ago
d7fec264
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000104.jsonl.zst
136 kB
xet
about 2 months ago
57be17b2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000105.jsonl.zst
214 kB
xet
about 2 months ago
af8725c4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000106.jsonl.zst
111 kB
xet
about 2 months ago
c7a101f3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000107.jsonl.zst
327 kB
xet
about 2 months ago
3808c669
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000108.jsonl.zst
280 kB
xet
about 2 months ago
511bb50f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000109.jsonl.zst
206 kB
xet
about 2 months ago
a5be81cc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000110.jsonl.zst
154 kB
xet
about 2 months ago
cfaa28b4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000111.jsonl.zst
143 kB
xet
about 2 months ago
069ca336
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000112.jsonl.zst
146 kB
xet
about 2 months ago
bb6905e6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000113.jsonl.zst
130 kB
xet
about 2 months ago
64b361bd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000114.jsonl.zst
214 kB
xet
about 2 months ago
5cbaee2a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000115.jsonl.zst
198 kB
xet
about 2 months ago
995c5a07
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000116.jsonl.zst
201 kB
xet
about 2 months ago
961112a2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000117.jsonl.zst
136 kB
xet
about 2 months ago
39204069
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000118.jsonl.zst
111 kB
xet
about 2 months ago
44e465b4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000119.jsonl.zst
121 kB
xet
about 2 months ago
a2ba19b0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000120.jsonl.zst
131 kB
xet
about 2 months ago
1d1d10a7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000121.jsonl.zst
109 kB
xet
about 2 months ago
34e5f53e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000122.jsonl.zst
165 kB
xet
about 2 months ago
9d1c6ca6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000123.jsonl.zst
160 kB
xet
about 2 months ago
f9dfbbca
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000124.jsonl.zst
111 kB
xet
about 2 months ago
7921fec3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000125.jsonl.zst
130 kB
xet
about 2 months ago
e1f67719
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000126.jsonl.zst
131 kB
xet
about 2 months ago
ff5d169f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000127.jsonl.zst
134 kB
xet
about 2 months ago
e265c138
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000128.jsonl.zst
198 kB
xet
about 2 months ago
a487a38e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000129.jsonl.zst
249 kB
xet
about 2 months ago
1409f2f5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000130.jsonl.zst
210 kB
xet
about 2 months ago
16cbb667
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000131.jsonl.zst
159 kB
xet
about 2 months ago
0d9f7c30
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000132.jsonl.zst
191 kB
xet
about 2 months ago
fa5d7401
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000133.jsonl.zst
215 kB
xet
about 2 months ago
a5722559
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000134.jsonl.zst
95.7 kB
xet
about 2 months ago
b23c03fe
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000135.jsonl.zst
145 kB
xet
about 2 months ago
44806ec6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000136.jsonl.zst
89.9 kB
xet
about 2 months ago
a3598625
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000137.jsonl.zst
223 kB
xet
about 2 months ago
d97417ee
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000138.jsonl.zst
221 kB
xet
about 2 months ago
31184aa0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000139.jsonl.zst
111 kB
xet
about 2 months ago
48e5604e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000140.jsonl.zst
131 kB
xet
about 2 months ago
cc995ab0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000141.jsonl.zst
182 kB
xet
about 2 months ago
a1de896e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000142.jsonl.zst
171 kB
xet
about 2 months ago
8610e37f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000143.jsonl.zst
239 kB
xet
about 2 months ago
df811f2b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000144.jsonl.zst
236 kB
xet
about 2 months ago
d7d37505
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000145.jsonl.zst
132 kB
xet
about 2 months ago
e95f99c4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000146.jsonl.zst
140 kB
xet
about 2 months ago
f044a0ff
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000147.jsonl.zst
253 kB
xet
about 2 months ago
08822a1f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000148.jsonl.zst
175 kB
xet
about 2 months ago
72f8c63e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000149.jsonl.zst
101 kB
xet
about 2 months ago
d37a29ab
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000150.jsonl.zst
172 kB
xet
about 2 months ago
2945432e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000151.jsonl.zst
169 kB
xet
about 2 months ago
1968209f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000152.jsonl.zst
147 kB
xet
about 2 months ago
c863706f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000153.jsonl.zst
222 kB
xet
about 2 months ago
bbd6da16
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000154.jsonl.zst
232 kB
xet
about 2 months ago
888942a4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000155.jsonl.zst
208 kB
xet
about 2 months ago
2c6211b4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000156.jsonl.zst
157 kB
xet
about 2 months ago
987a5225
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000157.jsonl.zst
83.9 kB
xet
about 2 months ago
0f23ca53
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000158.jsonl.zst
190 kB
xet
about 2 months ago
6823907a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000159.jsonl.zst
89.8 kB
xet
about 2 months ago
7465233c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000160.jsonl.zst
125 kB
xet
about 2 months ago
fc049c5c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000161.jsonl.zst
128 kB
xet
about 2 months ago
8c94c1cb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000162.jsonl.zst
166 kB
xet
about 2 months ago
3ee3a5fe
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000163.jsonl.zst
184 kB
xet
about 2 months ago
efe602d9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000164.jsonl.zst
105 kB
xet
about 2 months ago
950b2f49
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000165.jsonl.zst
184 kB
xet
about 2 months ago
9d78f17d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000166.jsonl.zst
204 kB
xet
about 2 months ago
05700545
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000167.jsonl.zst
199 kB
xet
about 2 months ago
05f45b5f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000168.jsonl.zst
186 kB
xet
about 2 months ago
3861657b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000169.jsonl.zst
127 kB
xet
about 2 months ago
4b56c113
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000170.jsonl.zst
138 kB
xet
about 2 months ago
26764403
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000171.jsonl.zst
252 kB
xet
about 2 months ago
77a332b9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000172.jsonl.zst
117 kB
xet
about 2 months ago
8ea49abd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000173.jsonl.zst
147 kB
xet
about 2 months ago
15a351aa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000174.jsonl.zst
276 kB
xet
about 2 months ago
9427d11f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000175.jsonl.zst
152 kB
xet
about 2 months ago
4ed0d97c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000176.jsonl.zst
164 kB
xet
about 2 months ago
f246f182
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000177.jsonl.zst
141 kB
xet
about 2 months ago
59c03a6a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000178.jsonl.zst
84.6 kB
xet
about 2 months ago
1295e1fd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000179.jsonl.zst
197 kB
xet
about 2 months ago
c71d05a6
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors