Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation ⢠33B ⢠Updated ⢠524k ⢠⢠1.98k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity ⢠22.7M ⢠Updated ⢠150M ⢠⢠4.38k -
BAAI/bge-large-en-v1.5
Feature Extraction ⢠0.3B ⢠Updated ⢠4.32M ⢠⢠622 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation ⢠71B ⢠Updated ⢠722k ⢠⢠2.64k