Sentence Similarity
sentence-transformers
Safetensors
gemma3_text
feature-extraction
dense
Generated from Trainer
dataset_size:2609
loss:MultipleNegativesRankingLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use TextModel/Embedding-crime-indo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use TextModel/Embedding-crime-indo with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("TextModel/Embedding-crime-indo") sentences = [ "query: Kalau si koruptor ternyata udah nggak punya harta lagi buat bayar uang pengganti, apa konsekuensinya?", "passage: Hukumnya adalah tindak pidana yang diancam dengan pidana penjara paling lama 4 tahun atau pidana denda paling banyak kategori IV karena menggunakan ancaman kekerasan. (Pasal 302 KUHP)", "passage: Kalau harta bendanya tidak mencukupi, terpidana bisa dipidana penjara yang lamanya tidak melebihi ancaman maksimum pidana pokoknya dan sudah ditentukan langsung di dalam putusan pengadilan.", "passage: Penyitaan dan pelelangan harta bila uang pengganti tidak dibayar." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
| { | |
| "backend": "tokenizers", | |
| "boi_token": "<start_of_image>", | |
| "bos_token": "<bos>", | |
| "clean_up_tokenization_spaces": false, | |
| "eoi_token": "<end_of_image>", | |
| "eos_token": "<eos>", | |
| "image_token": "<image_soft_token>", | |
| "is_local": false, | |
| "mask_token": "<mask>", | |
| "model_max_length": 2048, | |
| "model_specific_special_tokens": { | |
| "boi_token": "<start_of_image>", | |
| "eoi_token": "<end_of_image>", | |
| "image_token": "<image_soft_token>" | |
| }, | |
| "pad_token": "<pad>", | |
| "padding_side": "right", | |
| "sp_model_kwargs": null, | |
| "spaces_between_special_tokens": false, | |
| "tokenizer_class": "GemmaTokenizer", | |
| "unk_token": "<unk>", | |
| "use_default_system_prompt": false | |
| } | |