Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
lightonai
/
LateOn-Code-edge-pretrain
like
3
Follow
LightOn AI
587
Sentence Similarity
ONNX
Safetensors
sentence-transformers
lightonai/cornstack
English
code
PyLate
modernbert
ColBERT
feature-extraction
Generated from Trainer
dataset_size:21502474
loss:CachedContrastive
embeddings
retrieval
code search
Eval Results (legacy)
text-embeddings-inference
arxiv:
4 papers
License:
apache-2.0
🇪🇺 Region: EU
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
LateOn-Code-edge-pretrain
157 MB
Ctrl+K
Ctrl+K
2 contributors
History:
13 commits
NohTow
Create README.md
4ca3a44
verified
about 2 months ago
1_Dense
Add new ColBERT model
3 months ago
2_Dense
Add new ColBERT model
3 months ago
.gitattributes
Safe
1.52 kB
Update .gitattributes
about 2 months ago
README.md
Safe
247 kB
Create README.md
about 2 months ago
config.json
Safe
1.28 kB
Add new ColBERT model
3 months ago
config_sentence_transformers.json
Safe
762 Bytes
Add new ColBERT model
3 months ago
model.onnx
Safe
68 MB
xet
Fix ONNX export: add final projection layer (512->48 dim)
2 months ago
model.safetensors
Safe
67.2 MB
xet
Add new ColBERT model
3 months ago
model_int8.onnx
Safe
17.2 MB
xet
Fix INT8 ONNX export: add final projection layer (512->48 dim)
2 months ago
modules.json
Safe
319 Bytes
Add new ColBERT model
3 months ago
onnx_config.json
Safe
795 Bytes
Upload onnx_config.json with huggingface_hub
2 months ago
sentence_bert_config.json
Safe
57 Bytes
Add new ColBERT model
3 months ago
special_tokens_map.json
Safe
581 Bytes
Add new ColBERT model
3 months ago
tokenizer.json
Safe
3.58 MB
Add new ColBERT model
3 months ago
tokenizer_config.json
Safe
21.4 kB
Add new ColBERT model
3 months ago