1. Introduction
GRM-2.6-Opus is a merge between OrionLLM/GRM-2.6-Plus and rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled.
GRM-2.6-Opus is a general-purpose AI model optimized for difficult, high-complexity tasks. It is designed to deliver stronger performance for its size while remaining practical, efficient, and accessible for advanced local and research-oriented use.
The model now follows an Opus-style reasoning format, producing more structured, organized, and deliberate reasoning. This merge improves its ability to handle terminal agents, coding workflows, and complex problem-solving tasks, taking advantage of the strong reasoning and agentic capabilities associated with Claude Opus-style distilled behavior.
GRM-2.6-Opus demonstrates improvements over the original GRM-2.6-Plus, especially in structured reasoning, coding, agent workflows, and high-difficulty STEM evaluation.
2. Key Capabilities
- Opus-Style Structured Reasoning: GRM-2.6-Opus uses a more organized reasoning format, helping it produce clearer and more reliable solutions for complex tasks.
- Improved Terminal Agent Ability: The model is better suited for terminal-based agents, tool-style workflows, debugging, code execution planning, and multi-step technical tasks.
- Stronger Coding Performance: The merge improves code reasoning, implementation planning, and difficult programming task handling.
- Enhanced General-Purpose Intelligence: GRM-2.6-Opus remains useful across research, STEM, chat, coding, local agents, and advanced problem-solving.
- Improved Over GRM-2.6-Plus: The model builds on the original GRM-2.6-Plus and adds stronger structured reasoning behavior through the Opus-style distilled merge.
3. Performance
GRM-2.6-Opus is designed to be a highly capable 27B local AI model for complex reasoning, coding, everyday chat, and agentic workflows. It focuses on delivering better performance for its size, making it a strong option for users who want powerful reasoning without relying only on massive-scale models.
Its core strength is practical intelligence: structured reasoning, strong task understanding, improved coding behavior, stable responses, and the ability to handle difficult problems across multiple domains.
Detailed Benchmarks
| Benchmark | GRM-2.6-Opus | GRM-2.6-Plus | Qwen3.6-27B | google/gemma-4-31B-it | GPT-5.4-Mini | Claude-4.5-Haiku |
|---|---|---|---|---|---|---|
| Knowledge & STEM | ||||||
| GPQA Diamond | 89.2 | 88.3 | 87.8 | 84.3 | 88.0 | 73.0 |
GRM-2.6-Opus is developed by OrionLLM and released under the Apache 2.0 License.
- Downloads last month
- 16
Model tree for OrionLLM/GRM-2.6-Opus
Collection including OrionLLM/GRM-2.6-Opus
Evaluation results
- Diamond on Idavidrein/gpqa View evaluation results leaderboard 89.2