-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 339k • 295 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 46.9k • 113 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 389k • 336 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 173k • 730
Bjorn Melin
BjornMelin
AI & ML interests
Large Language Models, AI Agents, Multi-Agent Orchestrations, Deep Learning, NLP, Local LLM Optimization.
Organizations
None yet
Datasets
Fine Tuning
-
Running62
GGUF Model VRAM Calculator
📈62Calculate VRAM requirements for LLM models
-
Running on CPU UpgradeFeatured999
Model Memory Utility
🚀999Calculate vRAM needed for model training and inference
-
RunningFeatured1.04k
Can You Run It? LLM version
🚀1.04kDetermine GPU requirements for running large language models
Legendary VL Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 50.9k • 365 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 14.5k • • 216 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 410k • • 836 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 1.16M • • 606
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 145k • 13 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.53M • • 1.95k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 10.1M • • 5.32k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 145k • 32
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 813k • • 4.02k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 53 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 9.62k • 229 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 69 • • 550
Embedding Models
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 693 • • 122 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 199k • 120 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 369 • • 392 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 294k • • 600
Leaderboards
-
RunningFeatured140
smolagents LLM leaderboard
🏆140A leaderboard for LLMs powering smolagents
-
RunningFeatured435
LLM Performance Leaderboard
🐨435View LLM performance rankings
-
RunningFeatured194
Low-bit Quantized Open LLM Leaderboard
🏆194Track, rank and evaluate open LLMs and chatbots
-
Running1.44k
UGI Leaderboard
📢1.44kUncensored General Intelligence Leaderboard
Coding Models
Google
Qwen
-
Qwen/Qwen3-30B-A3B-Instruct-2507
Text Generation • 31B • Updated • 1.31M • • 742 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 29.2k • • 394 -
Qwen/Qwen3-32B
Text Generation • 33B • Updated • 1.64M • • 628 -
unsloth/Qwen3-30B-A3B-GGUF
Text Generation • 31B • Updated • 87.9k • 265
Rerankers
-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 339k • 295 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 46.9k • 113 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 389k • 336 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 173k • 730
Embedding Models
Datasets
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 693 • • 122 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 199k • 120 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 369 • • 392 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 294k • • 600
Fine Tuning
-
Running62
GGUF Model VRAM Calculator
📈62Calculate VRAM requirements for LLM models
-
Running on CPU UpgradeFeatured999
Model Memory Utility
🚀999Calculate vRAM needed for model training and inference
-
RunningFeatured1.04k
Can You Run It? LLM version
🚀1.04kDetermine GPU requirements for running large language models
Leaderboards
-
RunningFeatured140
smolagents LLM leaderboard
🏆140A leaderboard for LLMs powering smolagents
-
RunningFeatured435
LLM Performance Leaderboard
🐨435View LLM performance rankings
-
RunningFeatured194
Low-bit Quantized Open LLM Leaderboard
🏆194Track, rank and evaluate open LLMs and chatbots
-
Running1.44k
UGI Leaderboard
📢1.44kUncensored General Intelligence Leaderboard
Legendary VL Models
Coding Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 50.9k • 365 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 14.5k • • 216 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 410k • • 836 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 1.16M • • 606
Google
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 145k • 13 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.53M • • 1.95k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 10.1M • • 5.32k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 145k • 32
Qwen
-
Qwen/Qwen3-30B-A3B-Instruct-2507
Text Generation • 31B • Updated • 1.31M • • 742 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 29.2k • • 394 -
Qwen/Qwen3-32B
Text Generation • 33B • Updated • 1.64M • • 628 -
unsloth/Qwen3-30B-A3B-GGUF
Text Generation • 31B • Updated • 87.9k • 265
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 813k • • 4.02k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 53 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 9.62k • 229 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 69 • • 550