Collections
Discover the best community collections!
Collections trending this week
-
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper • 2401.14196 • Published • 71 -
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 53 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 141 -
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Paper • 2403.05525 • Published • 49
-
Compare Siglip1 Siglip2
🚀53Compare SigLIP1 and SigLIP2 on zero shot classification
-
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Paper • 2502.14786 • Published • 159 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • Updated • 427k • 90 -
google/siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 57.3k • 6
-
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q4_K_S-GGUF
12B • Updated • 24 • 1 -
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q4_K_M-GGUF
12B • Updated • 13 -
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q5_K_S-GGUF
12B • Updated • 10 -
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q5_K_M-GGUF
12B • Updated • 18
-
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 111k • 33 -
mradermacher/DeepSeek-R1-Distill-Llama-8B-Abliterated-GGUF
8B • Updated • 3.26k • 33 -
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated-GGUF
8B • Updated • 19.8k • 169 -
bartowski/Meta-Llama-3.1-8B-Instruct-abliterated-GGUF
Text Generation • 8B • Updated • 4.43k • 9
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper • 2503.14456 • Published • 153 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper • 2503.15265 • Published • 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper • 2503.15558 • Published • 50
-
Compare Siglip1 Siglip2
🚀53Compare SigLIP1 and SigLIP2 on zero shot classification
-
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Paper • 2502.14786 • Published • 159 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • Updated • 427k • 90 -
google/siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 57.3k • 6
-
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q4_K_S-GGUF
12B • Updated • 24 • 1 -
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q4_K_M-GGUF
12B • Updated • 13 -
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q5_K_S-GGUF
12B • Updated • 10 -
Triangle104/Mistral-Nemo-Instruct-2407-abliterated-Q5_K_M-GGUF
12B • Updated • 18
-
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 111k • 33 -
mradermacher/DeepSeek-R1-Distill-Llama-8B-Abliterated-GGUF
8B • Updated • 3.26k • 33 -
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated-GGUF
8B • Updated • 19.8k • 169 -
bartowski/Meta-Llama-3.1-8B-Instruct-abliterated-GGUF
Text Generation • 8B • Updated • 4.43k • 9
-
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper • 2401.14196 • Published • 71 -
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 53 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 141 -
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Paper • 2403.05525 • Published • 49
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper • 2503.14456 • Published • 153 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper • 2503.15265 • Published • 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper • 2503.15558 • Published • 50