Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Buckets:
HCAI-Lab
/
dolma3-6t-sample-5000-docs
Follow
Human-Centered AI Lab
10
Files
xet
HCAI-Lab/dolma3-6t-sample-5000-docs
/
worker_0066
11.1 GB
56,043 files
Updated about 2 months ago
Ctrl+K
Name
Size
Uploaded
Xet hash
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000869.jsonl.zst
160 kB
xet
about 2 months ago
f7b059d7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000870.jsonl.zst
115 kB
xet
about 2 months ago
186da870
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000871.jsonl.zst
166 kB
xet
about 2 months ago
937cb158
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000872.jsonl.zst
136 kB
xet
about 2 months ago
84a7143c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000873.jsonl.zst
123 kB
xet
about 2 months ago
2bf1b69b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000874.jsonl.zst
233 kB
xet
about 2 months ago
4c2fe119
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000875.jsonl.zst
164 kB
xet
about 2 months ago
d3a8208a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000876.jsonl.zst
181 kB
xet
about 2 months ago
f301983a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000877.jsonl.zst
167 kB
xet
about 2 months ago
28043c18
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000878.jsonl.zst
138 kB
xet
about 2 months ago
39318496
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000879.jsonl.zst
329 kB
xet
about 2 months ago
c70016d7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000880.jsonl.zst
89.9 kB
xet
about 2 months ago
9baabfe3
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000881.jsonl.zst
70.9 kB
xet
about 2 months ago
190a3715
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000882.jsonl.zst
144 kB
xet
about 2 months ago
c10d765f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000883.jsonl.zst
165 kB
xet
about 2 months ago
4badf4ce
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000884.jsonl.zst
111 kB
xet
about 2 months ago
ddb73d32
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000885.jsonl.zst
112 kB
xet
about 2 months ago
f6ce2dba
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000886.jsonl.zst
144 kB
xet
about 2 months ago
a1d0893a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000887.jsonl.zst
102 kB
xet
about 2 months ago
a62ff8e6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000888.jsonl.zst
88 kB
xet
about 2 months ago
10608029
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000889.jsonl.zst
126 kB
xet
about 2 months ago
1f77c34a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000890.jsonl.zst
103 kB
xet
about 2 months ago
2202bbdc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000891.jsonl.zst
101 kB
xet
about 2 months ago
1de2b94d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000892.jsonl.zst
147 kB
xet
about 2 months ago
87e8b53a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000893.jsonl.zst
197 kB
xet
about 2 months ago
0204618f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000894.jsonl.zst
134 kB
xet
about 2 months ago
316a2abe
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000895.jsonl.zst
172 kB
xet
about 2 months ago
e4453b03
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000896.jsonl.zst
160 kB
xet
about 2 months ago
dbb7d65e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000897.jsonl.zst
137 kB
xet
about 2 months ago
62138013
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000898.jsonl.zst
143 kB
xet
about 2 months ago
0b156078
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000899.jsonl.zst
154 kB
xet
about 2 months ago
ce46340e
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000900.jsonl.zst
112 kB
xet
about 2 months ago
6360a5ea
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000901.jsonl.zst
93.4 kB
xet
about 2 months ago
80339fe4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000902.jsonl.zst
165 kB
xet
about 2 months ago
4652e134
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000903.jsonl.zst
133 kB
xet
about 2 months ago
d2f190c4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000904.jsonl.zst
110 kB
xet
about 2 months ago
330c0f94
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000905.jsonl.zst
179 kB
xet
about 2 months ago
d74f7de7
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000906.jsonl.zst
154 kB
xet
about 2 months ago
a11b5d5f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000907.jsonl.zst
184 kB
xet
about 2 months ago
737d4886
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000908.jsonl.zst
132 kB
xet
about 2 months ago
c721f07c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000909.jsonl.zst
147 kB
xet
about 2 months ago
984531fe
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000910.jsonl.zst
105 kB
xet
about 2 months ago
9b6f6f31
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000911.jsonl.zst
140 kB
xet
about 2 months ago
12cf907f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000912.jsonl.zst
106 kB
xet
about 2 months ago
c9a0fd33
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000913.jsonl.zst
123 kB
xet
about 2 months ago
2ab819d8
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000914.jsonl.zst
82.4 kB
xet
about 2 months ago
717a40ae
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000915.jsonl.zst
143 kB
xet
about 2 months ago
03df42e2
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000916.jsonl.zst
113 kB
xet
about 2 months ago
01295817
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000917.jsonl.zst
116 kB
xet
about 2 months ago
f7a2168a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000918.jsonl.zst
227 kB
xet
about 2 months ago
f83b7de5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000919.jsonl.zst
64.7 kB
xet
about 2 months ago
11015417
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000920.jsonl.zst
154 kB
xet
about 2 months ago
84b65230
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000921.jsonl.zst
130 kB
xet
about 2 months ago
55393220
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000922.jsonl.zst
117 kB
xet
about 2 months ago
0ba7780b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000923.jsonl.zst
175 kB
xet
about 2 months ago
f40608d9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000924.jsonl.zst
117 kB
xet
about 2 months ago
935e73e1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000925.jsonl.zst
154 kB
xet
about 2 months ago
9a6abe7f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000926.jsonl.zst
128 kB
xet
about 2 months ago
551c2e15
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000927.jsonl.zst
318 kB
xet
about 2 months ago
f780974f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000928.jsonl.zst
148 kB
xet
about 2 months ago
6ece7203
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000929.jsonl.zst
167 kB
xet
about 2 months ago
51ef1346
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000930.jsonl.zst
104 kB
xet
about 2 months ago
8c4f400d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000931.jsonl.zst
175 kB
xet
about 2 months ago
56ed7af1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000932.jsonl.zst
89.9 kB
xet
about 2 months ago
9f0c4a02
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000933.jsonl.zst
170 kB
xet
about 2 months ago
3032dea4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000934.jsonl.zst
198 kB
xet
about 2 months ago
10e310cf
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000935.jsonl.zst
127 kB
xet
about 2 months ago
a5955b56
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000936.jsonl.zst
255 kB
xet
about 2 months ago
9465dc75
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000937.jsonl.zst
122 kB
xet
about 2 months ago
5969e818
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000938.jsonl.zst
150 kB
xet
about 2 months ago
ac4b8fd4
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000939.jsonl.zst
144 kB
xet
about 2 months ago
689b38a6
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000940.jsonl.zst
137 kB
xet
about 2 months ago
59e70072
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000941.jsonl.zst
179 kB
xet
about 2 months ago
0a32974d
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000942.jsonl.zst
131 kB
xet
about 2 months ago
1c652fcb
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000943.jsonl.zst
170 kB
xet
about 2 months ago
98d44d87
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000944.jsonl.zst
149 kB
xet
about 2 months ago
06f4af9f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000945.jsonl.zst
86.4 kB
xet
about 2 months ago
b93aea76
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000946.jsonl.zst
135 kB
xet
about 2 months ago
075cff7a
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000947.jsonl.zst
143 kB
xet
about 2 months ago
a996a3c1
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000948.jsonl.zst
130 kB
xet
about 2 months ago
45ad479b
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000949.jsonl.zst
187 kB
xet
about 2 months ago
2fba10ad
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000950.jsonl.zst
112 kB
xet
about 2 months ago
e603f2cc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000951.jsonl.zst
159 kB
xet
about 2 months ago
a152e440
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000953.jsonl.zst
62.4 kB
xet
about 2 months ago
c8bc95fe
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000954.jsonl.zst
86 kB
xet
about 2 months ago
81947e60
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000955.jsonl.zst
81.2 kB
xet
about 2 months ago
c7bf4cd9
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000956.jsonl.zst
139 kB
xet
about 2 months ago
602b0291
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000957.jsonl.zst
107 kB
xet
about 2 months ago
67225d13
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000958.jsonl.zst
170 kB
xet
about 2 months ago
d605534f
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000959.jsonl.zst
128 kB
xet
about 2 months ago
5e223757
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000960.jsonl.zst
160 kB
xet
about 2 months ago
1f3d5932
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000961.jsonl.zst
106 kB
xet
about 2 months ago
63656d89
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000962.jsonl.zst
198 kB
xet
about 2 months ago
4ece647c
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000963.jsonl.zst
157 kB
xet
about 2 months ago
debebfbc
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000964.jsonl.zst
117 kB
xet
about 2 months ago
9e3c3cc5
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0018__shard_00000965.jsonl.zst
31.1 kB
xet
about 2 months ago
4a932016
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000000.jsonl.zst
122 kB
xet
about 2 months ago
75bc0669
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000001.jsonl.zst
207 kB
xet
about 2 months ago
6defa108
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000002.jsonl.zst
129 kB
xet
about 2 months ago
e7060749
soc127__phase1_pool_shared__common_crawl__part_005__data__common_crawl-science_math_and_technology-0019__shard_00000003.jsonl.zst
185 kB
xet
about 2 months ago
85fdd0fc
Load more
Sync this bucket
Mount this bucket
Total size
11.1 GB
Files
56,043
Last updated
Mar 24
Pre-warmed CDN
US
EU
US
EU
Contributors