Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
toksuite
/
supertoken_models-llama_bigscience-bloom
like
0
Follow
TokSuite
5
Text Generation
Transformers
Safetensors
toksuite/toksuite_pretraining_data
5 languages
llama
toksuite
tokenization
bloom
multilingual
bpe
robustness
research
text-generation-inference
arxiv:
2512.20757
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
supertoken_models-llama_bigscience-bloom
4.64 GB
2 contributors
History:
23 commits
gsaltintas
Update README.md
06954d7
verified
1 day ago
.gitattributes
Safe
1.69 kB
Upload model-performance-comparison.png
6 days ago
README.md
7.99 kB
Update README.md
1 day ago
bigscience--bloom.yaml
Safe
68 Bytes
Upload tokenizer file bigscience--bloom.yaml - Upload model files
4 months ago
bigscience--bloom_super_mapping.json
Safe
4.37 MB
Upload tokenizer file bigscience--bloom_super_mapping.json - Upload model files
4 months ago
config.json
Safe
610 Bytes
Upload config
4 months ago
generation_config.json
Safe
150 Bytes
Upload model files
4 months ago
model-performance-comparison.png
Safe
279 kB
xet
Upload model-performance-comparison.png
6 days ago
model.safetensors
Safe
4.62 GB
xet
Upload model files
4 months ago
tokenizer.json
Safe
12.3 MB
xet
Upload tokenizer file bigscience--bloom_vocab.json - Upload model files
4 months ago
tokenizer_config.json
Safe
76 Bytes
Upload tokenizer file bigscience--bloom_info.json - Upload model files
4 months ago
toksuite-logo.png
Safe
1 MB
xet
Upload toksuite-logo.png
8 days ago