Text Generation
PEFT
Safetensors
English
lora
qwen2.5
qwen2.5-coder
code
reasoning
pedagogy
fine-tuned
conversational
Instructions to use mechramc/codek-qwen2.5-coder-7b-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use mechramc/codek-qwen2.5-coder-7b-lora with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("unsloth/Qwen2.5-Coder-7B-Instruct") model = PeftModel.from_pretrained(base_model, "mechramc/codek-qwen2.5-coder-7b-lora") - Notebooks
- Google Colab
- Kaggle
| base_model: Qwen/Qwen2.5-Coder-7B-Instruct | |
| library_name: peft | |
| pipeline_tag: text-generation | |
| tags: | |
| - lora | |
| - peft | |
| - qwen2.5 | |
| - qwen2.5-coder | |
| - code | |
| - reasoning | |
| - pedagogy | |
| - fine-tuned | |
| language: | |
| - en | |
| license: apache-2.0 | |
| datasets: | |
| - mechramc/codek-v1 | |
| # CodeK LoRA v0 -- Qwen2.5-Coder-7B-Instruct | |
| A LoRA adapter fine-tuned on the **CodeK v1** dataset: a reasoning-first, pedagogical | |
| coding dataset. Teaches decomposition, bug diagnosis, contrast reasoning, and | |
| hypothesis-driven thinking about code. | |
| ## v0 Eval Results (Pass 2 ground-truth, 50 seeds) | |
| | Model | Pass@1 | | |
| |-------|--------| | |
| | Base (Qwen2.5-Coder-7B-Instruct) | 64% | | |
| | **LoRA checkpoint-800** | **58%** | | |
| 6% regression on bug diagnosis. LoRA wins on 2/50 seeds (more direct, correct), | |
| base wins on 5/50 (LoRA misidentifies function or pattern-matches to training data). | |
| See `BASELINE_V0.md` in the dataset repo for full analysis. | |
| ## Training | |
| | Setting | Value | | |
| |---------|-------| | |
| | Base model | `Qwen/Qwen2.5-Coder-7B-Instruct` | | |
| | Method | LoRA (RS-LoRA) | | |
| | Rank / Alpha | 16 / 32 | | |
| | Dropout | 0.05 | | |
| | Epochs | 3 | | |
| | Batch (effective) | 8 | | |
| | Learning rate | 2e-4 | | |
| | Train pairs | 2,351 | | |
| | Best eval loss | 0.0583 (step 528) | | |
| | Checkpoint used | checkpoint-800 (eval loss 0.061) | | |
| | Hardware | RunPod A100 80GB, 59 min | | |
| ## Dataset | |
| [mechramc/codek-v1](https://huggingface.co/datasets/mechramc/codek-v1) -- | |
| 201 seeds, 4 augmentation passes, 2,613 ShareGPT pairs. | |
| ## Usage | |
| ```python | |
| from peft import PeftModel | |
| from transformers import AutoModelForCausalLM, AutoTokenizer | |
| base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-Coder-7B-Instruct") | |
| model = PeftModel.from_pretrained(base, "mechramc/codek-qwen2.5-coder-7b-lora") | |
| tokenizer = AutoTokenizer.from_pretrained("mechramc/codek-qwen2.5-coder-7b-lora") | |
| ``` | |