Monibee-Fudgekins commited on
Commit
b2d5724
verified
1 Parent(s): 33a3e16

Update model card (smoke run, pass@1 0.00%)

Browse files
Files changed (1) hide show
  1. README.md +25 -25
README.md CHANGED
@@ -1,40 +1,40 @@
1
  ---
2
  license: gemma
3
- base_model: unsloth/gemma-2-9b-it-bnb-4bit
4
  datasets:
5
- - sahil2801/CodeAlpaca-20k
6
  library_name: transformers
7
  pipeline_tag: text-generation
8
  tags:
9
- - code
10
- - coding-assistant
11
- - qlora
12
- - unsloth
13
  model-index:
14
- - name: gemma-coder-dev
15
- results:
16
- - task:
17
- type: text-generation
18
- name: Code generation
19
- dataset:
20
- name: remote-agent-dev-platform coding_eval
21
- type: code-eval
22
- metrics:
23
- - type: pass@1
24
- value: 0.0
25
- name: pass@1 (Python/JS/React/Go/Java/Swift)
26
  ---
27
 
28
  # gemma-coder-dev
29
 
30
- Coding-focused fine-tune of [`unsloth/gemma-2-9b-it-bnb-4bit`](https://huggingface.co/unsloth/gemma-2-9b-it-bnb-4bit),
31
- produced automatically by the weekly retrain pipeline in
32
- [remote-agent-dev-platform](https://github.com/Monibee-Fudgekins/remote-agent-dev-platform). **Last updated: 2026-06-16 17:51 UTC** 路 run mode: `smoke` 路
33
- promoted: **False**.
34
 
35
  ## Model description
36
- QLoRA fine-tune of unsloth/gemma-2-9b-it-bnb-4bit specialized for coding assistance. It is the default
37
- agent model for the remote-agent-dev-platform (served via vLLM on Modal).
38
 
39
  ## Intended uses & limitations
40
  - **Intended:** code generation and assistance in Python, JavaScript/React, Go, Java,
@@ -49,7 +49,7 @@ agent model for the remote-agent-dev-platform (served via vLLM on Modal).
49
 
50
  ## Training procedure
51
  - Method: QLoRA (Unsloth), 4-bit base, LoRA r=16 / alpha=32, lr 2e-4 (cosine),
52
- max steps 10, max seq len 2048, optimizer adamw_8bit.
53
  - Hardware: Kaggle free GPU (weekly GitHub Actions run).
54
 
55
  ## Evaluation
 
1
  ---
2
  license: gemma
3
+ base_model: google/gemma-4-26B-A4B-it
4
  datasets:
5
+ - sahil2801/CodeAlpaca-20k
6
  library_name: transformers
7
  pipeline_tag: text-generation
8
  tags:
9
+ - code
10
+ - coding-assistant
11
+ - qlora
12
+ - unsloth
13
  model-index:
14
+ - name: gemma-coder-dev
15
+ results:
16
+ - task:
17
+ type: text-generation
18
+ name: Code generation
19
+ dataset:
20
+ type: code-eval
21
+ name: remote-agent-dev-platform coding_eval
22
+ metrics:
23
+ - type: pass@1
24
+ value: 0.0000
25
+ name: pass@1 (Python/JS/React/Go/Java/Swift)
26
  ---
27
 
28
  # gemma-coder-dev
29
 
30
+ Coding-focused fine-tune of [`google/gemma-4-26B-A4B-it`](https://huggingface.co/google/gemma-4-26B-A4B-it)
31
+ (**Gemma 4 26B A4B**, an MoE with ~4B active params), produced automatically by the
32
+ weekly retrain pipeline in [remote-agent-dev-platform](https://github.com/Monibee-Fudgekins/remote-agent-dev-platform).
33
+ **Last updated: 2026-06-16 21:07 UTC** 路 run mode: `smoke` 路 promoted: **False**.
34
 
35
  ## Model description
36
+ QLoRA fine-tune of google/gemma-4-26B-A4B-it specialized for coding assistance. It is the
37
+ default agent model for the remote-agent-dev-platform (served via vLLM on Modal).
38
 
39
  ## Intended uses & limitations
40
  - **Intended:** code generation and assistance in Python, JavaScript/React, Go, Java,
 
49
 
50
  ## Training procedure
51
  - Method: QLoRA (Unsloth), 4-bit base, LoRA r=16 / alpha=32, lr 2e-4 (cosine),
52
+ max steps 10, max seq len 1024, optimizer adamw_8bit.
53
  - Hardware: Kaggle free GPU (weekly GitHub Actions run).
54
 
55
  ## Evaluation