Orivus-Coder-v1-3B-Instruct GGUF

Orivus-Coder-v1 is a local code-generation model fine-tuned for the Orivus Edge runtime.

It is designed to execute structured software engineering tasks using real repository context, architectural constraints, and deterministic outputs.

Model Details

  • Base model: Qwen/Qwen2.5-Coder-3B-Instruct
  • Format: GGUF
  • Quantization: Q4_K_M
  • Prompt format: ChatML
  • Runtime: llama.cpp-compatible environments

Intended Use

This model is optimized for:

  • code generation
  • refactoring
  • implementation of structured tasks
  • execution of architecture-driven plans
  • local AI workflows inside Orivus Edge

This model is not intended to:

  • define architecture
  • perform risk analysis
  • decide migration strategy

Those responsibilities belong to Orivus-Architect.

Prompt Format (ChatML)

The model expects strict ChatML formatting:

<|im_start|>system
You are Orivus-Coder, a precise software engineering execution model. Follow the provided plan, respect architectural constraints, do not invent files or dependencies, and return implementation-ready output.<|im_end|>
<|im_start|>user
{user_prompt}<|im_end|>
<|im_start|>assistant

Runtime Configuration (llama.cpp)

{
  "n_ctx": 8192,
  "temperature": 0.2,
  "top_p": 0.95
}

Orivus Edge Integration

Orivus-Coder operates as the execution engine within Orivus Edge.

Flow:

  1. Orivus-Architect analyzes the system (OODA: Observe, Orient, Decide)
  2. A structured handoff is generated
  3. Orivus-Coder executes the implementation

This separation ensures:

  • deterministic execution
  • controlled changes
  • reduced hallucination
  • architecture consistency

Capabilities

  • follows structured instructions
  • respects file boundaries
  • avoids inventing dependencies
  • produces implementation-ready output
  • works with partial context (context-aware)

Limitations

  • requires correct prompt structure
  • depends on quality of handoff input
  • not optimized for open-ended chat
  • limited reasoning depth compared to larger models

Available Files

  • qwen2.5-coder-3b-instruct.Q4_K_M.gguf

Notes

This model is fine-tuned for local-first execution and integration into controlled engineering systems. It is designed to operate without cloud dependencies as part of the Orivus Edge runtime.

Downloads last month
246
GGUF
Model size
3B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for orivus/Orivus-Coder-v1-3B-Instruct-GGUF

Base model

Qwen/Qwen2.5-3B
Quantized
(98)
this model