Image-Text-to-Text
Transformers
Safetensors
qwen3_5_moe
Merge
evolutionary-merge
darwin
darwin-v5
model-mri
reasoning
advanced-reasoning
chain-of-thought
thinking
qwen3.5
qwen
Mixture of Experts
mixture-of-experts
claude-opus
distillation
multimodal
vision-language
gpqa
benchmark
open-source
apache-2.0
layer-wise-merge
moe-merge
dead-expert-revival
coding-agent
tool-calling
long-context
262k-context
conversational
Eval Results (legacy)