CyberNative-AI/Qwen36_27B_CyberNative_20260529_8bitLoRA64_32k

Fine-tuned from Qwen/Qwen3.6-27B on CyberNative text-only ShareGPT-style data.

Training settings:

  • Date: 20260529
  • Context: 32768 tokens
  • Base loading during training: 8-bit
  • LoRA: r=128, alpha=128, dropout=0.05
  • Epochs: 1
  • Learning rate: 0.0002
  • Warmup ratio: 0.05
  • Weight decay: 0.01
  • Effective batch size: 4
  • Train-on-assistant-only labels: True
  • Assistant reasoning: preserved from thinking / reasoning fields as <think>...</think> when present
  • MTP heads: not custom-trained; base MTP/speculative decoding capability preserved for serving.
  • Dataset: already processed combo.jsonl / CyberNative ShareGPT-style JSONL

Recommended serving note:

  • Enable Qwen MTP/speculative decoding in vLLM/SGLang at inference time if your serving stack supports it.
Downloads last month
24
Safetensors
Model size
27B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for CyberNative-AI/Qwen36_27B_CyberNative_20260529_8bitLoRA64_32k

Base model

Qwen/Qwen3.6-27B
Adapter
(129)
this model