Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
dicta-il
/
neodictabert-bilingual-embed
like
2
Follow
DICTA: The Israel Center for Text Analysis
166
Sentence Similarity
sentence-transformers
Safetensors
Hebrew
English
neobert
feature-extraction
dense
Generated from Trainer
dataset_size:40680
loss:CachedMultipleNegativesRankingLoss
custom_code
arxiv:
2510.20386
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
neodictabert-bilingual-embed
730 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
Shaltiel
Upload folder using huggingface_hub
1da8625
verified
2 months ago
1_Pooling
Upload folder using huggingface_hub
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
2 months ago
README.md
3.85 kB
Upload folder using huggingface_hub
2 months ago
config.json
1.65 kB
Upload folder using huggingface_hub
2 months ago
config_sentence_transformers.json
Safe
283 Bytes
Upload folder using huggingface_hub
2 months ago
model.safetensors
725 MB
xet
Upload folder using huggingface_hub
2 months ago
modeling_neobert.py
Safe
26.6 kB
Upload folder using huggingface_hub
2 months ago
modules.json
Safe
229 Bytes
Upload folder using huggingface_hub
2 months ago
sentence_bert_config.json
Safe
58 Bytes
Upload folder using huggingface_hub
2 months ago
special_tokens_map.json
Safe
971 Bytes
Upload folder using huggingface_hub
2 months ago
tokenizer.json
3.38 MB
Upload folder using huggingface_hub
2 months ago
tokenizer_config.json
1.26 kB
Upload folder using huggingface_hub
2 months ago
vocab.txt
1.3 MB
Upload folder using huggingface_hub
2 months ago