You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

EdgeAI Docs Qwen2.5 Coder 7B Instruct (LoRA Adapter)

This repository contains a LoRA adapter (not full model weights) trained for an offline Edge AI + MCP documentation assistant workflow.

Base model:

  • Qwen/Qwen2.5-Coder-7B-Instruct

Intended use

  • Use this adapter with a local RAG pipeline.
  • Keep retrieval output as the factual source.
  • Use the adapter for response behavior: format, citation style, and grounded answering.

Training summary

  • Train examples: 115
  • Eval examples: 13
  • Max steps: 30
  • Precision/load strategy: QLoRA 4-bit (NF4), bf16 compute
  • Final eval loss: 0.0641
  • Device: cuda (8GB VRAM class local GPU profile)

Files

  • adapter_model.safetensors: trained LoRA adapter weights
  • adapter_config.json: PEFT adapter config
  • tokenizer.json, tokenizer_config.json, chat_template.jinja: tokenizer/chat formatting assets
  • run_summary.json, trainer_train_metrics.json, training_args.bin: training metadata/artifacts

Quick start

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig
from peft import PeftModel

base_model = "Qwen/Qwen2.5-Coder-7B-Instruct"
adapter_repo = "eoinedge/EdgeAI-Docs-Qwen2.5-Coder-7B-Instruct"

bnb = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_use_double_quant=True,
    bnb_4bit_compute_dtype=torch.bfloat16,
)

model = AutoModelForCausalLM.from_pretrained(
    base_model,
    quantization_config=bnb,
    device_map="auto",
)
model = PeftModel.from_pretrained(model, adapter_repo)
tokenizer = AutoTokenizer.from_pretrained(base_model)

Notes

  • This adapter is optimized for docs-assistant behavior, not as a standalone factual memory.
  • For best results, pair with MCP tools + document retrieval context.
Downloads last month
22
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for eoinedge/EdgeAI-Docs-Qwen2.5-Coder-7B-Instruct

Base model

Qwen/Qwen2.5-7B
Adapter
(378)
this model