Collections
Discover the best community collections!
Collections trending this week
-
nvidia/Nemotron-RL-knowledge-web_search-mcqa
Viewer • Updated • 2.93k • 122 • 9 -
nvidia/Nemotron-RL-agent-workplace_assistant
Viewer • Updated • 1.8k • 204 • 16 -
nvidia/Nemotron-RL-instruction_following
Preview • Updated • 99 • 11 -
nvidia/Nemotron-RL-instruction_following-structured_outputs
Viewer • Updated • 9.95k • 474 • 34
-
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Paper • 2105.09501 • Published • 1 -
Cross-modal Contrastive Learning for Speech Translation
Paper • 2205.02444 • Published -
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Paper • 2210.03052 • Published -
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
Paper • 2212.10240 • Published • 1
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 1.19M • • 946 -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • Updated • 88.3k • • 482 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 150k • 81 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • Updated • 18.9k • 52
-
Qwen3 Coder WebDev
🌍993Generate HTML/React code from a web app description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 74.8k • • 1.3k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • Updated • 138k • • 148 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • Updated • 820k • • 954
-
ibm-granite/granite-4.0-micro
Text Generation • Updated • 87.6k • 260 -
ibm-granite/granite-4.0-micro-base
Text Generation • 3B • Updated • 1.75k • 37 -
ibm-granite/granite-4.0-h-micro
Text Generation • Updated • 16.5k • 136 -
ibm-granite/granite-4.0-h-micro-base
Text Generation • 3B • Updated • 4.16k • 33
-
nvidia/Nemotron-RL-knowledge-web_search-mcqa
Viewer • Updated • 2.93k • 122 • 9 -
nvidia/Nemotron-RL-agent-workplace_assistant
Viewer • Updated • 1.8k • 204 • 16 -
nvidia/Nemotron-RL-instruction_following
Preview • Updated • 99 • 11 -
nvidia/Nemotron-RL-instruction_following-structured_outputs
Viewer • Updated • 9.95k • 474 • 34
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 1.19M • • 946 -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • Updated • 88.3k • • 482 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 150k • 81 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • Updated • 18.9k • 52
-
Qwen3 Coder WebDev
🌍993Generate HTML/React code from a web app description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 74.8k • • 1.3k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • Updated • 138k • • 148 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • Updated • 820k • • 954
-
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Paper • 2105.09501 • Published • 1 -
Cross-modal Contrastive Learning for Speech Translation
Paper • 2205.02444 • Published -
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Paper • 2210.03052 • Published -
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
Paper • 2212.10240 • Published • 1
-
ibm-granite/granite-4.0-micro
Text Generation • Updated • 87.6k • 260 -
ibm-granite/granite-4.0-micro-base
Text Generation • 3B • Updated • 1.75k • 37 -
ibm-granite/granite-4.0-h-micro
Text Generation • Updated • 16.5k • 136 -
ibm-granite/granite-4.0-h-micro-base
Text Generation • 3B • Updated • 4.16k • 33