MiniLM - CoSQA Fine-tuned models of all-miniLM model on the CoSQA dataset Devy1/MiniLM-cosqa-16 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 2 Devy1/MiniLM-cosqa-32 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 3 Devy1/MiniLM-cosqa-64 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 3 Devy1/MiniLM-cosqa-128 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 3
Quantization for Code Generation Collection of AQLM quantized models from the paper "Quantizing Large Language Models for Code Generation: A Differentiated Replication" Quantizing Large Language Models for Code Generation: A Differentiated Replication Paper • 2503.07103 • Published Mar 10, 2025 • 8 Devy1/CodeLlama-7b-hf-AQLM-8bit-rnd-4x15 Text Generation • 4B • Updated Mar 7, 2025 • 4 Devy1/CodeLlama-7b-hf-AQLM-4bit-rnd-2x15 Text Generation • 2B • Updated Mar 7, 2025 • 5 Devy1/CodeLlama-7b-hf-AQLM-3bit-rnd-2x12 Text Generation • 2B • Updated Mar 7, 2025 • 3
Quantizing Large Language Models for Code Generation: A Differentiated Replication Paper • 2503.07103 • Published Mar 10, 2025 • 8
MiniLM - CoSQA Fine-tuned models of all-miniLM model on the CoSQA dataset Devy1/MiniLM-cosqa-16 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 2 Devy1/MiniLM-cosqa-32 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 3 Devy1/MiniLM-cosqa-64 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 3 Devy1/MiniLM-cosqa-128 Sentence Similarity • 22.7M • Updated Sep 30, 2025 • 3
Quantization for Code Generation Collection of AQLM quantized models from the paper "Quantizing Large Language Models for Code Generation: A Differentiated Replication" Quantizing Large Language Models for Code Generation: A Differentiated Replication Paper • 2503.07103 • Published Mar 10, 2025 • 8 Devy1/CodeLlama-7b-hf-AQLM-8bit-rnd-4x15 Text Generation • 4B • Updated Mar 7, 2025 • 4 Devy1/CodeLlama-7b-hf-AQLM-4bit-rnd-2x15 Text Generation • 2B • Updated Mar 7, 2025 • 5 Devy1/CodeLlama-7b-hf-AQLM-3bit-rnd-2x12 Text Generation • 2B • Updated Mar 7, 2025 • 3
Quantizing Large Language Models for Code Generation: A Differentiated Replication Paper • 2503.07103 • Published Mar 10, 2025 • 8