Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.2M • • 2.02k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity • 22.7M • Updated • 247M • • 4.76k -
BAAI/bge-large-en-v1.5
Feature Extraction • 0.3B • Updated • 15.1M • • 659 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 795k • • 2.75k