Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wikilangs
/
ast
like
0
Follow
WikiLangs
2
Feature Extraction
omarkamali/wikipedia-monthly
Asturian
wikilangs
nlp
tokenizer
embeddings
n-gram
markov
wikipedia
monolingual
family-romance_iberian
License:
mit
Model card
Files
Files and versions
xet
Community
main
ast
/
visualizations
2.45 MB
1 contributor
History:
1 commit
omarkamali
Upload all models and assets for ast (20251201)
f085801
verified
about 12 hours ago
embedding_isotropy.png
46.6 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
embedding_norms.png
34.7 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
embedding_similarity.png
142 kB
xet
Upload all models and assets for ast (20251201)
about 12 hours ago
markov_branching.png
37.9 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
markov_contexts.png
29.8 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
markov_entropy.png
67.7 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
model_sizes.png
52.3 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
nearest_neighbors.png
68.7 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
ngram_coverage.png
64.4 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
ngram_entropy.png
40.6 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
ngram_perplexity.png
55.6 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
ngram_unique.png
29.2 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
performance_dashboard.png
274 kB
xet
Upload all models and assets for ast (20251201)
about 12 hours ago
position_encoding_comparison.png
114 kB
xet
Upload all models and assets for ast (20251201)
about 12 hours ago
tokenizer_compression.png
49.4 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
tokenizer_fertility.png
45.1 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
tokenizer_oov.png
50.7 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
tokenizer_total_tokens.png
45.4 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
top20_words.png
44.3 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
tsne_sentences.png
276 kB
xet
Upload all models and assets for ast (20251201)
about 12 hours ago
tsne_words.png
651 kB
xet
Upload all models and assets for ast (20251201)
about 12 hours ago
vocab_coverage.png
69.4 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
vocab_freq_dist.png
53 kB
Upload all models and assets for ast (20251201)
about 12 hours ago
zipf_law.png
109 kB
xet
Upload all models and assets for ast (20251201)
about 12 hours ago