Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
applied-ai-018
/
peacock-data-public-evaluation
like
0
Model card
Files
Files and versions
xet
Community
main
peacock-data-public-evaluation
/
Megatron-DeepSpeed
/
tools
/
openwebtext
57.6 kB
1 contributor
History:
1 commit
applied-ai-018
Add files using upload-large-folder tool
ced76ab
verified
about 1 year ago
README.md
Safe
3.43 kB
Add files using upload-large-folder tool
about 1 year ago
add_id.py
Safe
1.64 kB
Add files using upload-large-folder tool
about 1 year ago
blacklist_urls.py
Safe
6.78 kB
Add files using upload-large-folder tool
about 1 year ago
cleanup_dataset.py
Safe
3.68 kB
Add files using upload-large-folder tool
about 1 year ago
cleanup_fix_dataset.py
Safe
6.66 kB
Add files using upload-large-folder tool
about 1 year ago
filter_ngrams.py
Safe
18.3 kB
Add files using upload-large-folder tool
about 1 year ago
find_duplicates.py
Safe
11.5 kB
Add files using upload-large-folder tool
about 1 year ago
group_duplicate_url.py
Safe
2.66 kB
Add files using upload-large-folder tool
about 1 year ago
merge_jsons.py
Safe
1.02 kB
Add files using upload-large-folder tool
about 1 year ago
remove_group_duplicates.py
Safe
1.98 kB
Add files using upload-large-folder tool
about 1 year ago