Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sarthak1
/
codemalt
like
2
Feature Extraction
Safetensors
Model2Vec
sentence-transformers
code-search-net/code_search_net
sentence-transformers/codesearchnet
code
distiller
code-search
code-embeddings
distillation
static-embeddings
tokenlearn
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
454e47c
codemalt
327 MB
Ctrl+K
Ctrl+K
2 contributors
History:
12 commits
Sarthak
feat: overhaul distiller package with unified CLI, enhanced evaluation, and modular structure
454e47c
11 months ago
evaluation
initial commit
11 months ago
mteb_results
feat: added MTEB evaluation scripts
11 months ago
patches
feat: created a cli to manage the complete generation process
11 months ago
src
feat: overhaul distiller package with unified CLI, enhanced evaluation, and modular structure
11 months ago
.gitattributes
343 Bytes
fix: migrate binary files to LFS tracking
11 months ago
.gitignore
171 Bytes
chore: add env vars
11 months ago
.python-version
Safe
5 Bytes
initial commit
11 months ago
LICENSE
Safe
11.4 kB
initial commit
11 months ago
MTEB_evaluate.py
10.2 kB
feat: added MTEB evaluation scripts
11 months ago
README.md
6.75 kB
feat: added MTEB evaluation scripts
11 months ago
analyze_mteb_results.py
9.05 kB
feat: added MTEB evaluation scripts
11 months ago
config.json
Safe
25 Bytes
feat: 4 stage training, refinement failed for first 3
11 months ago
distill.py
3.73 kB
initial commit
11 months ago
evaluate.py
16.3 kB
initial commit
11 months ago
model.safetensors
311 MB
xet
feat: 4 stage training, refinement failed for first 3
11 months ago
modules.json
Safe
278 Bytes
initial commit
11 months ago
pipeline.skops
3.84 MB
xet
initial commit
11 months ago
pyproject.toml
2.52 kB
feat: added MTEB evaluation scripts
11 months ago
tokenizer.json
11.4 MB
xet
initial commit
11 months ago
train_code_classification.py
13.2 kB
initial commit
11 months ago
uv.lock
337 kB
feat: added MTEB evaluation scripts
11 months ago