Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0067
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000344.jsonl.zst
119 kB
xet
about 2 months ago
68f0f3e9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000345.jsonl.zst
87.9 kB
xet
about 2 months ago
0b981b2e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000346.jsonl.zst
122 kB
xet
about 2 months ago
a8015d0e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000347.jsonl.zst
138 kB
xet
about 2 months ago
a35b8070
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000348.jsonl.zst
147 kB
xet
about 2 months ago
2932b756
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000349.jsonl.zst
124 kB
xet
about 2 months ago
50562e13
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000350.jsonl.zst
155 kB
xet
about 2 months ago
87f2b554
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000351.jsonl.zst
146 kB
xet
about 2 months ago
973fea95
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000352.jsonl.zst
189 kB
xet
about 2 months ago
88e6b6a3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000353.jsonl.zst
179 kB
xet
about 2 months ago
f9edbf1b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000354.jsonl.zst
105 kB
xet
about 2 months ago
d440c70c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000355.jsonl.zst
132 kB
xet
about 2 months ago
4e5848e0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000356.jsonl.zst
181 kB
xet
about 2 months ago
50781747
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000357.jsonl.zst
202 kB
xet
about 2 months ago
85d36f9f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000358.jsonl.zst
135 kB
xet
about 2 months ago
24bc40a7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000359.jsonl.zst
165 kB
xet
about 2 months ago
6d75c415
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000360.jsonl.zst
223 kB
xet
about 2 months ago
e3fd10e5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000361.jsonl.zst
140 kB
xet
about 2 months ago
031b60d9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000362.jsonl.zst
117 kB
xet
about 2 months ago
122a9d25
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000363.jsonl.zst
127 kB
xet
about 2 months ago
57eac6e5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000364.jsonl.zst
180 kB
xet
about 2 months ago
86c48191
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000365.jsonl.zst
144 kB
xet
about 2 months ago
3337872a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000366.jsonl.zst
166 kB
xet
about 2 months ago
991be7f7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000367.jsonl.zst
163 kB
xet
about 2 months ago
6d65c6b4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000368.jsonl.zst
120 kB
xet
about 2 months ago
f67a26ec
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000369.jsonl.zst
120 kB
xet
about 2 months ago
4fc8d3f0
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000370.jsonl.zst
126 kB
xet
about 2 months ago
e8f20afb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000371.jsonl.zst
149 kB
xet
about 2 months ago
9e9a9dac
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000372.jsonl.zst
135 kB
xet
about 2 months ago
66b17fed
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000373.jsonl.zst
185 kB
xet
about 2 months ago
a503f879
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000374.jsonl.zst
171 kB
xet
about 2 months ago
456646ec
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000375.jsonl.zst
128 kB
xet
about 2 months ago
dd71c37d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000376.jsonl.zst
210 kB
xet
about 2 months ago
dc1a090b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000377.jsonl.zst
152 kB
xet
about 2 months ago
dbd64876
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000378.jsonl.zst
125 kB
xet
about 2 months ago
b5aafada
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000379.jsonl.zst
149 kB
xet
about 2 months ago
dfd0e2d6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000380.jsonl.zst
190 kB
xet
about 2 months ago
b589d401
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000381.jsonl.zst
104 kB
xet
about 2 months ago
d047e9bb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000383.jsonl.zst
76.6 kB
xet
about 2 months ago
c44ce272
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000384.jsonl.zst
56.7 kB
xet
about 2 months ago
31871409
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000385.jsonl.zst
100 kB
xet
about 2 months ago
4caaedd8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000386.jsonl.zst
134 kB
xet
about 2 months ago
0b8aca34
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000387.jsonl.zst
133 kB
xet
about 2 months ago
8268abc3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000388.jsonl.zst
169 kB
xet
about 2 months ago
d9fdbda6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000389.jsonl.zst
177 kB
xet
about 2 months ago
0737738b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000390.jsonl.zst
147 kB
xet
about 2 months ago
91115675
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000391.jsonl.zst
330 kB
xet
about 2 months ago
025b7d9d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000392.jsonl.zst
177 kB
xet
about 2 months ago
e36da612
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000393.jsonl.zst
185 kB
xet
about 2 months ago
b274ea62
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000394.jsonl.zst
187 kB
xet
about 2 months ago
49f55fd9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000395.jsonl.zst
213 kB
xet
about 2 months ago
a8354734
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000396.jsonl.zst
150 kB
xet
about 2 months ago
cd4ffd83
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000397.jsonl.zst
84 kB
xet
about 2 months ago
565255c8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000398.jsonl.zst
193 kB
xet
about 2 months ago
68022728
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000399.jsonl.zst
149 kB
xet
about 2 months ago
d9f10520
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000400.jsonl.zst
150 kB
xet
about 2 months ago
fb6195c8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000401.jsonl.zst
148 kB
xet
about 2 months ago
de7d9e03
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000402.jsonl.zst
124 kB
xet
about 2 months ago
6970e830
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000403.jsonl.zst
164 kB
xet
about 2 months ago
df006bda
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000404.jsonl.zst
179 kB
xet
about 2 months ago
8ebc7f43
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000405.jsonl.zst
139 kB
xet
about 2 months ago
19882647
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000406.jsonl.zst
124 kB
xet
about 2 months ago
f7427119
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000407.jsonl.zst
113 kB
xet
about 2 months ago
ddaf28ac
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000408.jsonl.zst
161 kB
xet
about 2 months ago
a19368f9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000409.jsonl.zst
162 kB
xet
about 2 months ago
fe9790ac
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000410.jsonl.zst
158 kB
xet
about 2 months ago
fc94e7ba
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000411.jsonl.zst
133 kB
xet
about 2 months ago
9e32f8be
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000412.jsonl.zst
135 kB
xet
about 2 months ago
4a9f9e22
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000413.jsonl.zst
119 kB
xet
about 2 months ago
1f50a827
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000414.jsonl.zst
138 kB
xet
about 2 months ago
973a8f36
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000415.jsonl.zst
87.1 kB
xet
about 2 months ago
e1291a23
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000416.jsonl.zst
151 kB
xet
about 2 months ago
ffeb5caa
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000417.jsonl.zst
108 kB
xet
about 2 months ago
b11a8dae
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000418.jsonl.zst
252 kB
xet
about 2 months ago
197ef5c9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000419.jsonl.zst
160 kB
xet
about 2 months ago
23948093
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000420.jsonl.zst
158 kB
xet
about 2 months ago
fe285719
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000421.jsonl.zst
163 kB
xet
about 2 months ago
aae98450
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000422.jsonl.zst
164 kB
xet
about 2 months ago
d9aff94b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000423.jsonl.zst
187 kB
xet
about 2 months ago
09d0b0f6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000424.jsonl.zst
163 kB
xet
about 2 months ago
563d766e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000425.jsonl.zst
117 kB
xet
about 2 months ago
622dbacc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000426.jsonl.zst
206 kB
xet
about 2 months ago
6e40b825
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000427.jsonl.zst
161 kB
xet
about 2 months ago
3c895251
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000428.jsonl.zst
181 kB
xet
about 2 months ago
3b358adf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000429.jsonl.zst
211 kB
xet
about 2 months ago
180dc7cf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000430.jsonl.zst
150 kB
xet
about 2 months ago
7ae8f145
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000431.jsonl.zst
221 kB
xet
about 2 months ago
51567da1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000432.jsonl.zst
228 kB
xet
about 2 months ago
aa37c6c7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000433.jsonl.zst
141 kB
xet
about 2 months ago
576d4661
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000434.jsonl.zst
171 kB
xet
about 2 months ago
7532d9ac
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000435.jsonl.zst
129 kB
xet
about 2 months ago
81a00135
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000436.jsonl.zst
149 kB
xet
about 2 months ago
058a5e97
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000437.jsonl.zst
173 kB
xet
about 2 months ago
01f0be0e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000438.jsonl.zst
151 kB
xet
about 2 months ago
0fda1e7f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000439.jsonl.zst
158 kB
xet
about 2 months ago
6a247f60
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000440.jsonl.zst
106 kB
xet
about 2 months ago
d8a55700
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000441.jsonl.zst
131 kB
xet
about 2 months ago
44336143
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000442.jsonl.zst
138 kB
xet
about 2 months ago
f3888ab3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000443.jsonl.zst
123 kB
xet
about 2 months ago
3d8b07a7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000444.jsonl.zst
175 kB
xet
about 2 months ago
6497aff5
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors