Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lightonai
/
LateOn-Code-pretrain
like
2
Follow
LightOn AI
494
Sentence Similarity
ONNX
Safetensors
sentence-transformers
lightonai/cornstack
English
code
PyLate
modernbert
ColBERT
feature-extraction
Generated from Trainer
dataset_size:21502474
loss:CachedContrastive
embeddings
retrieval
code search
Eval Results (legacy)
text-embeddings-inference
arxiv:
4 papers
License:
apache-2.0
🇪🇺 Region: EU
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
LateOn-Code-pretrain
1.35 GB
2 contributors
History:
12 commits
NohTow
Do not perform query expansion
71251a6
verified
1 day ago
1_Dense
Add new ColBERT model
about 1 month ago
.gitattributes
1.52 kB
Update .gitattributes
3 days ago
README.md
247 kB
Create README.md
2 days ago
config.json
1.41 kB
Add new ColBERT model
about 1 month ago
config_sentence_transformers.json
762 Bytes
Do not perform query expansion
1 day ago
model.onnx
597 MB
xet
Upload model.onnx with huggingface_hub
24 days ago
model.safetensors
596 MB
xet
Add new ColBERT model
about 1 month ago
model_int8.onnx
150 MB
xet
Upload model_int8.onnx with huggingface_hub
24 days ago
modules.json
216 Bytes
Add new ColBERT model
about 1 month ago
onnx_config.json
792 Bytes
Upload onnx_config.json with huggingface_hub
24 days ago
sentence_bert_config.json
58 Bytes
Add new ColBERT model
about 1 month ago
special_tokens_map.json
581 Bytes
Add new ColBERT model
about 1 month ago
tokenizer.json
3.58 MB
Add new ColBERT model
about 1 month ago
tokenizer_config.json
21.4 kB
Add new ColBERT model
about 1 month ago