-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 1.54M • 347 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 62.9k • 118 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 1.45M • 350 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 93k • 732
Bjorn Melin
BjornMelin
AI & ML interests
Large Language Models, AI Agents, Multi-Agent Orchestrations, Deep Learning, NLP, Local LLM Optimization.
Recent Activity
updated a collection about 1 month ago
Google liked a model about 1 month ago
unsloth/gemma-4-26B-A4B-it-GGUF updated a collection 6 months ago
RerankersOrganizations
None yet
Datasets
Fine Tuning
- Running65
GGUF Model VRAM Calculator
📈65Calculate VRAM requirements for LLM models
- Running on CPU UpgradeAgentsFeatured1.01k
Model Memory Utility
🚀1.01kCalculate GPU memory needed for training Hugging Face models
- RunningFeatured1.05k
Can You Run It? LLM version
🚀1.05kCalculate GPU needs for running LLMs on your hardware
Legendary VL Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 52k • 397 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 14.2k • • 222 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 867k • • 859 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.35M • • 705
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 64.3k • 15 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 2.47M • • 2.12k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.39M • • 5.81k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 79.7k • 34
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 1.14M • • 4.07k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 57 -
internlm/internlm3-8b-instruct
Text Generation • Updated • 43.9k • 231 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 81 • • 548
Embedding Models
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 2.51k • • 124 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 872k • 131 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 127 • 391 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 473k • • 643
Leaderboards
Coding Models
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 825 • 399 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 26.8k • 199 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 625k • • 1.97k -
google/gemma-3n-E4B-it
Image-Text-to-Text • Updated • 41.3k • • 912
Qwen
Rerankers
-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 1.54M • 347 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 62.9k • 118 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 1.45M • 350 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 93k • 732
Embedding Models
Datasets
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 2.51k • • 124 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 872k • 131 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 127 • 391 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 473k • • 643
Fine Tuning
- Running65
GGUF Model VRAM Calculator
📈65Calculate VRAM requirements for LLM models
- Running on CPU UpgradeAgentsFeatured1.01k
Model Memory Utility
🚀1.01kCalculate GPU memory needed for training Hugging Face models
- RunningFeatured1.05k
Can You Run It? LLM version
🚀1.05kCalculate GPU needs for running LLMs on your hardware
Leaderboards
Legendary VL Models
Coding Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 52k • 397 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 14.2k • • 222 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 867k • • 859 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.35M • • 705
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 825 • 399 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 26.8k • 199 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 625k • • 1.97k -
google/gemma-3n-E4B-it
Image-Text-to-Text • Updated • 41.3k • • 912
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 64.3k • 15 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 2.47M • • 2.12k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.39M • • 5.81k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 79.7k • 34
Qwen
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 1.14M • • 4.07k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 57 -
internlm/internlm3-8b-instruct
Text Generation • Updated • 43.9k • 231 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 81 • • 548