Monibee-Fudgekins commited on
Commit
bf4ec71
·
verified ·
1 Parent(s): 1f949a2

Update model card (smoke run, pass@1 0.00%)

Browse files
Files changed (1) hide show
  1. README.md +72 -14
README.md CHANGED
@@ -1,22 +1,80 @@
1
  ---
 
2
  base_model: unsloth/gemma-2-9b-it-bnb-4bit
 
 
 
 
3
  tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
- - gemma2
8
- - trl
9
- license: apache-2.0
10
- language:
11
- - en
 
 
 
 
 
 
 
 
 
12
  ---
13
 
14
- # Uploaded model
15
 
16
- - **Developed by:** Monibee-Fudgekins
17
- - **License:** apache-2.0
18
- - **Finetuned from model :** unsloth/gemma-2-9b-it-bnb-4bit
 
19
 
20
- This gemma2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
 
 
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: gemma
3
  base_model: unsloth/gemma-2-9b-it-bnb-4bit
4
+ datasets:
5
+ - sahil2801/CodeAlpaca-20k
6
+ library_name: transformers
7
+ pipeline_tag: text-generation
8
  tags:
9
+ - code
10
+ - coding-assistant
11
+ - qlora
12
+ - unsloth
13
+ model-index:
14
+ - name: gemma-coder-dev
15
+ results:
16
+ - task:
17
+ type: text-generation
18
+ name: Code generation
19
+ dataset:
20
+ type: code-eval
21
+ name: remote-agent-dev-platform coding_eval
22
+ metrics:
23
+ - type: pass@1
24
+ value: 0.0000
25
+ name: pass@1 (Python/JS/React/Go/Java/Swift)
26
  ---
27
 
28
+ # gemma-coder-dev
29
 
30
+ Coding-focused fine-tune of [`unsloth/gemma-2-9b-it-bnb-4bit`](https://huggingface.co/unsloth/gemma-2-9b-it-bnb-4bit),
31
+ produced automatically by the weekly retrain pipeline in
32
+ [remote-agent-dev-platform](https://github.com/Monibee-Fudgekins/remote-agent-dev-platform). **Last updated: 2026-06-16 17:36 UTC** · run mode: `smoke` ·
33
+ promoted: **False**.
34
 
35
+ ## Model description
36
+ QLoRA fine-tune of unsloth/gemma-2-9b-it-bnb-4bit specialized for coding assistance. It is the default
37
+ agent model for the remote-agent-dev-platform (served via vLLM on Modal).
38
 
39
+ ## Intended uses & limitations
40
+ - **Intended:** code generation and assistance in Python, JavaScript/React, Go, Java,
41
+ and Swift, inside a sandboxed agent that runs/tests the output.
42
+ - **Not intended:** safety-critical use, or running generated code unreviewed.
43
+ - **Limitations:** a small, free-tier-trained model — it can produce incorrect or
44
+ insecure code. Always review and test. Quality tracks the training data, which is
45
+ still being built out.
46
+
47
+ ## Training data
48
+ - Dataset: [`sahil2801/CodeAlpaca-20k`](https://huggingface.co/datasets/sahil2801/CodeAlpaca-20k)
49
+
50
+ ## Training procedure
51
+ - Method: QLoRA (Unsloth), 4-bit base, LoRA r=16 / alpha=32, lr 2e-4 (cosine),
52
+ max steps 10, max seq len 2048, optimizer adamw_8bit.
53
+ - Hardware: Kaggle free GPU (weekly GitHub Actions run).
54
+
55
+ ## Evaluation
56
+ Sandboxed multi-language **pass@1** harness (`finetune/evaluate.py`): the model
57
+ completes functions that are then compiled/run against unit tests. Languages whose
58
+ toolchain is unavailable are skipped.
59
+
60
+ **Overall pass@1: 0.00%** over 3 executed problems
61
+ (0 skipped). Promotion threshold: 46%.
62
+
63
+ | language | passed / run | pass@1 |
64
+ |---|---|---|
65
+ | python | 0/3 | 0.00% |
66
+
67
+ ## How to use
68
+ ```python
69
+ from transformers import AutoModelForCausalLM, AutoTokenizer
70
+ tok = AutoTokenizer.from_pretrained("Monibee-Fudgekins/gemma-coder-dev")
71
+ model = AutoModelForCausalLM.from_pretrained("Monibee-Fudgekins/gemma-coder-dev", device_map="auto")
72
+ msgs = [{"role": "user", "content": "Write a Python function that reverses a string."}]
73
+ ids = tok.apply_chat_template(msgs, add_generation_prompt=True, return_tensors="pt").to(model.device)
74
+ print(tok.decode(model.generate(ids, max_new_tokens=256)[0]))
75
+ ```
76
+
77
+
78
+ ## Provenance
79
+ Generated by `finetune/kaggle/run.py` in [https://github.com/Monibee-Fudgekins/remote-agent-dev-platform](https://github.com/Monibee-Fudgekins/remote-agent-dev-platform); see that repo for
80
+ the full training + eval pipeline.