Collections
Discover the best community collections!
Collections trending this week
-
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Paper • 2310.16818 • Published • 33 -
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 56 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper • 2401.06066 • Published • 63 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper • 2401.14196 • Published • 73
-
MaziyarPanahi/calme-3.2-instruct-78b
Text Generation • 78B • Updated • 215 • 209 -
MaziyarPanahi/calme-3.1-instruct-78b
Text Generation • 78B • Updated • 43 • 7 -
MaziyarPanahi/calme-3.1-instruct-3b
Text Generation • 3B • Updated • 29 • • 4 -
MaziyarPanahi/calme-3.2-instruct-3b
Text Generation • 3B • Updated • 21 • • 3
-
ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF
Text Generation • 0.5B • Updated • 751 • 10 -
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF
Text Generation • 2B • Updated • 3.9k • 18 -
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF
Text Generation • 3B • Updated • 2.08k • 11 -
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF
Text Generation • 8B • Updated • 2.2k • 10
-
allenai/tulu-3-sft-mixture
Viewer • Updated • 939k • 18.4k • 251 -
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer • Updated • 273k • 1.85k • 26 -
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer • Updated • 337k • 72 • 19 -
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer • Updated • 361k • 38 • 6
-
apple/aimv2-large-patch14-224
Image Feature Extraction • 0.3B • Updated • 1.01k • 62 -
apple/aimv2-huge-patch14-224
Image Feature Extraction • 0.7B • Updated • 218 • 13 -
apple/aimv2-1B-patch14-224
Image Feature Extraction • 1B • Updated • 148 • 8 -
apple/aimv2-3B-patch14-224
Image Feature Extraction • 3B • Updated • 244 • 4
-
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Paper • 2310.16818 • Published • 33 -
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 56 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper • 2401.06066 • Published • 63 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper • 2401.14196 • Published • 73
-
allenai/tulu-3-sft-mixture
Viewer • Updated • 939k • 18.4k • 251 -
allenai/llama-3.1-tulu-3-8b-preference-mixture
Viewer • Updated • 273k • 1.85k • 26 -
allenai/llama-3.1-tulu-3-70b-preference-mixture
Viewer • Updated • 337k • 72 • 19 -
allenai/llama-3.1-tulu-3-405b-preference-mixture
Viewer • Updated • 361k • 38 • 6
-
MaziyarPanahi/calme-3.2-instruct-78b
Text Generation • 78B • Updated • 215 • 209 -
MaziyarPanahi/calme-3.1-instruct-78b
Text Generation • 78B • Updated • 43 • 7 -
MaziyarPanahi/calme-3.1-instruct-3b
Text Generation • 3B • Updated • 29 • • 4 -
MaziyarPanahi/calme-3.2-instruct-3b
Text Generation • 3B • Updated • 21 • • 3
-
ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF
Text Generation • 0.5B • Updated • 751 • 10 -
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF
Text Generation • 2B • Updated • 3.9k • 18 -
ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF
Text Generation • 3B • Updated • 2.08k • 11 -
ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF
Text Generation • 8B • Updated • 2.2k • 10
-
apple/aimv2-large-patch14-224
Image Feature Extraction • 0.3B • Updated • 1.01k • 62 -
apple/aimv2-huge-patch14-224
Image Feature Extraction • 0.7B • Updated • 218 • 13 -
apple/aimv2-1B-patch14-224
Image Feature Extraction • 1B • Updated • 148 • 8 -
apple/aimv2-3B-patch14-224
Image Feature Extraction • 3B • Updated • 244 • 4