google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 820k • • 1.38k
mamei16/chonky_distilbert-base-multilingual-cased Token Classification • 0.1B • Updated Nov 14, 2025 • 18 • 4
Text chunking / splitting models Collection It intelligently segments text into meaningful semantic chunks. Could be useful for RAG systems as text-chunking module. • 4 items • Updated 14 days ago • 1
Text chunking / splitting models Collection It intelligently segments text into meaningful semantic chunks. Could be useful for RAG systems as text-chunking module. • 4 items • Updated 14 days ago • 1
mirth/chonky_mmbert_small_multilingual_1 Token Classification • 0.1B • Updated Oct 23, 2025 • 183 • 23
mirth/chonky_mmbert_small_multilingual_1 Token Classification • 0.1B • Updated Oct 23, 2025 • 183 • 23
mirth/chonky_mmbert_small_multilingual_1 Token Classification • 0.1B • Updated Oct 23, 2025 • 183 • 23
mamei16/chonky_distilbert_base_uncased_1.1 Token Classification • 66.4M • Updated Nov 13, 2025 • 14 • 2
mirth/chonky_distilbert_base_uncased_1 Token Classification • 66.4M • Updated Apr 26, 2025 • 29.8k • • 15
mirth/chonky_modernbert_base_1 Token Classification • 0.1B • Updated Apr 26, 2025 • 32.4k • • 6
mirth/chonky_modernbert_large_1 Token Classification • 0.4B • Updated Apr 26, 2025 • 1.91k • • 2
mirth/chonky_modernbert_large_1 Token Classification • 0.4B • Updated Apr 26, 2025 • 1.91k • • 2
mirth/chonky_modernbert_large_1 Token Classification • 0.4B • Updated Apr 26, 2025 • 1.91k • • 2