Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0063
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000518.jsonl.zst
155 kB
xet
about 2 months ago
bf6e36cf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000519.jsonl.zst
100 kB
xet
about 2 months ago
37269d99
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000520.jsonl.zst
107 kB
xet
about 2 months ago
f49b547b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000521.jsonl.zst
129 kB
xet
about 2 months ago
4c89fa50
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000522.jsonl.zst
137 kB
xet
about 2 months ago
ac579768
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000523.jsonl.zst
204 kB
xet
about 2 months ago
e48e0453
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000524.jsonl.zst
104 kB
xet
about 2 months ago
1dd5317c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000525.jsonl.zst
99.5 kB
xet
about 2 months ago
368537e7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000526.jsonl.zst
137 kB
xet
about 2 months ago
e5413827
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000527.jsonl.zst
142 kB
xet
about 2 months ago
48c2bf56
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000528.jsonl.zst
174 kB
xet
about 2 months ago
e6ab93e9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000529.jsonl.zst
128 kB
xet
about 2 months ago
77d21b5e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000530.jsonl.zst
199 kB
xet
about 2 months ago
3afea53c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000531.jsonl.zst
112 kB
xet
about 2 months ago
df5ac6d4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000532.jsonl.zst
163 kB
xet
about 2 months ago
7881651a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000533.jsonl.zst
118 kB
xet
about 2 months ago
69f2c12a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000534.jsonl.zst
176 kB
xet
about 2 months ago
6d40d704
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000535.jsonl.zst
117 kB
xet
about 2 months ago
c3fe58ea
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000536.jsonl.zst
179 kB
xet
about 2 months ago
a7532a61
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000537.jsonl.zst
234 kB
xet
about 2 months ago
16ec2126
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000538.jsonl.zst
142 kB
xet
about 2 months ago
881ed676
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000539.jsonl.zst
165 kB
xet
about 2 months ago
22cf1166
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000540.jsonl.zst
177 kB
xet
about 2 months ago
33ff1333
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000541.jsonl.zst
116 kB
xet
about 2 months ago
8507b0e3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000542.jsonl.zst
149 kB
xet
about 2 months ago
d7a9d3eb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000543.jsonl.zst
142 kB
xet
about 2 months ago
0786c54e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000544.jsonl.zst
109 kB
xet
about 2 months ago
ef7fd49a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000545.jsonl.zst
112 kB
xet
about 2 months ago
28c1e82d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000546.jsonl.zst
196 kB
xet
about 2 months ago
e7056550
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000547.jsonl.zst
122 kB
xet
about 2 months ago
780608bc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000548.jsonl.zst
317 kB
xet
about 2 months ago
ec9d1147
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000549.jsonl.zst
117 kB
xet
about 2 months ago
65e869b8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000550.jsonl.zst
212 kB
xet
about 2 months ago
4c53488b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000551.jsonl.zst
188 kB
xet
about 2 months ago
02150c80
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000552.jsonl.zst
125 kB
xet
about 2 months ago
05be48d6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000553.jsonl.zst
114 kB
xet
about 2 months ago
116dac6a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000554.jsonl.zst
124 kB
xet
about 2 months ago
8afa829c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000555.jsonl.zst
106 kB
xet
about 2 months ago
d0a9021c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000556.jsonl.zst
165 kB
xet
about 2 months ago
4a47d7a7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000557.jsonl.zst
161 kB
xet
about 2 months ago
7ddec7ed
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000558.jsonl.zst
92 kB
xet
about 2 months ago
7cf7127f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000559.jsonl.zst
109 kB
xet
about 2 months ago
76c73f88
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000560.jsonl.zst
142 kB
xet
about 2 months ago
a849c757
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000561.jsonl.zst
189 kB
xet
about 2 months ago
833d838b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000562.jsonl.zst
224 kB
xet
about 2 months ago
431d1353
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000563.jsonl.zst
131 kB
xet
about 2 months ago
18aab349
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000564.jsonl.zst
146 kB
xet
about 2 months ago
83a4d875
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000565.jsonl.zst
126 kB
xet
about 2 months ago
8ed20ad2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000566.jsonl.zst
179 kB
xet
about 2 months ago
213507b2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000567.jsonl.zst
167 kB
xet
about 2 months ago
225f8116
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000568.jsonl.zst
138 kB
xet
about 2 months ago
87551816
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000569.jsonl.zst
124 kB
xet
about 2 months ago
d40266bd
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000570.jsonl.zst
131 kB
xet
about 2 months ago
dfbdf21b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000571.jsonl.zst
142 kB
xet
about 2 months ago
08fbaa1d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000572.jsonl.zst
331 kB
xet
about 2 months ago
da2ade6b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000573.jsonl.zst
189 kB
xet
about 2 months ago
fb6c7e77
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000574.jsonl.zst
229 kB
xet
about 2 months ago
14efc8d8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000575.jsonl.zst
211 kB
xet
about 2 months ago
832e101f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000576.jsonl.zst
69.7 kB
xet
about 2 months ago
e45263c5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000577.jsonl.zst
381 kB
xet
about 2 months ago
2df26c4d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000578.jsonl.zst
134 kB
xet
about 2 months ago
c3515841
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000579.jsonl.zst
217 kB
xet
about 2 months ago
7d4bc3fa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000580.jsonl.zst
145 kB
xet
about 2 months ago
174e6f1b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000581.jsonl.zst
120 kB
xet
about 2 months ago
21eac046
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000582.jsonl.zst
175 kB
xet
about 2 months ago
4d26278a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000583.jsonl.zst
224 kB
xet
about 2 months ago
72aa370f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000584.jsonl.zst
130 kB
xet
about 2 months ago
e054d77c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000585.jsonl.zst
393 kB
xet
about 2 months ago
fd05d96c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000586.jsonl.zst
163 kB
xet
about 2 months ago
221ff6a0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000587.jsonl.zst
199 kB
xet
about 2 months ago
7931e01e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000588.jsonl.zst
362 kB
xet
about 2 months ago
f7639700
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000589.jsonl.zst
139 kB
xet
about 2 months ago
3e1d4f62
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000590.jsonl.zst
124 kB
xet
about 2 months ago
457d8dda
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000591.jsonl.zst
110 kB
xet
about 2 months ago
86ec399f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000592.jsonl.zst
138 kB
xet
about 2 months ago
e12b0f92
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000593.jsonl.zst
194 kB
xet
about 2 months ago
e1f13530
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000594.jsonl.zst
134 kB
xet
about 2 months ago
a6000646
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000595.jsonl.zst
222 kB
xet
about 2 months ago
09868ea6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000596.jsonl.zst
81.2 kB
xet
about 2 months ago
efaea61b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000597.jsonl.zst
93.2 kB
xet
about 2 months ago
adbb4bb6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000598.jsonl.zst
108 kB
xet
about 2 months ago
9d9f51c4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000599.jsonl.zst
124 kB
xet
about 2 months ago
f1181d46
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000600.jsonl.zst
200 kB
xet
about 2 months ago
fd1073bb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000601.jsonl.zst
114 kB
xet
about 2 months ago
82bbd7a9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000602.jsonl.zst
65.5 kB
xet
about 2 months ago
994b04c3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000603.jsonl.zst
203 kB
xet
about 2 months ago
37c54902
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000605.jsonl.zst
74.3 kB
xet
about 2 months ago
5f04cf8c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000606.jsonl.zst
105 kB
xet
about 2 months ago
7347d0bc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000607.jsonl.zst
182 kB
xet
about 2 months ago
6769fe94
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000608.jsonl.zst
134 kB
xet
about 2 months ago
86233a3d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000609.jsonl.zst
125 kB
xet
about 2 months ago
04b3004b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000610.jsonl.zst
183 kB
xet
about 2 months ago
75b1a69f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000611.jsonl.zst
161 kB
xet
about 2 months ago
8e9970fc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000612.jsonl.zst
104 kB
xet
about 2 months ago
fde75170
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000613.jsonl.zst
216 kB
xet
about 2 months ago
472fbb7b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000614.jsonl.zst
103 kB
xet
about 2 months ago
aad8dd47
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000615.jsonl.zst
164 kB
xet
about 2 months ago
6328722d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000616.jsonl.zst
186 kB
xet
about 2 months ago
8aa31e5d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000617.jsonl.zst
206 kB
xet
about 2 months ago
9db4a7f6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0017__shard_00000618.jsonl.zst
103 kB
xet
about 2 months ago
9cfd5d26
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors