Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
khopilot
/
khmer-tokenizer-v7
like
1
Feature Extraction
Transformers
Khmer
khmer
tokenization
graph-regularization
sentencepiece
nlp
semantic-embeddings
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
khmer-tokenizer-v7
41 MB
1 contributor
History:
29 commits
khopilot
Upload CHANGELOG.md with huggingface_hub
7f966f9
verified
4 months ago
CHANGELOG.md
2.4 kB
Upload CHANGELOG.md with huggingface_hub
4 months ago
CITATION.cff
757 Bytes
Upload CITATION.cff with huggingface_hub
4 months ago
README.md
7.03 kB
Upload README.md with huggingface_hub
4 months ago
edges_pruned.tsv
126 kB
Upload edges_pruned.tsv with huggingface_hub
4 months ago
lexeme_embeddings.pt
38.9 MB
xet
Upload lexeme_embeddings.pt with huggingface_hub
4 months ago
lexeme_subwords_prod8k_v22.tsv
688 kB
Upload lexeme_subwords_prod8k_v22.tsv with huggingface_hub
4 months ago
metrics_corrected.yaml
5.86 kB
Upload metrics_corrected.yaml with huggingface_hub
4 months ago
nodes.tsv
950 kB
Upload nodes.tsv with huggingface_hub
4 months ago
spm_km_8k_prod.model
164 kB
xet
Upload spm_km_8k_prod.model with huggingface_hub
4 months ago
spm_km_8k_prod.vocab
169 kB
Upload spm_km_8k_prod.vocab with huggingface_hub
4 months ago
tokenizer_config.json
410 Bytes
Upload tokenizer_config.json with huggingface_hub
4 months ago