Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
FUXI
/
yuyan-10b
like
1
Follow
fuxi
9
PyTorch
Chinese
bert
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
yuyan-10b
/
tools
/
openwebtext
62.7 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Shawn001
Upload 21 files
1101a21
almost 3 years ago
README.md
3.43 kB
Upload 21 files
almost 3 years ago
add_id.py
2.2 kB
Upload 21 files
almost 3 years ago
blacklist_urls.py
7.34 kB
Upload 21 files
almost 3 years ago
cleanup_dataset.py
4.24 kB
Upload 21 files
almost 3 years ago
cleanup_fix_dataset.py
7.23 kB
Upload 21 files
almost 3 years ago
filter_ngrams.py
18.9 kB
Upload 21 files
almost 3 years ago
find_duplicates.py
12 kB
Upload 21 files
almost 3 years ago
group_duplicate_url.py
3.22 kB
Upload 21 files
almost 3 years ago
merge_jsons.py
1.58 kB
Upload 21 files
almost 3 years ago
remove_group_duplicates.py
2.54 kB
Upload 21 files
almost 3 years ago