Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation ⢠33B ⢠Updated ⢠767k ⢠⢠2k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity ⢠22.7M ⢠Updated ⢠206M ⢠⢠4.58k -
BAAI/bge-large-en-v1.5
Feature Extraction ⢠Updated ⢠6.09M ⢠⢠633 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation ⢠Updated ⢠611k ⢠⢠2.68k