josephmayo commited on
Commit
a6515dc
·
verified ·
1 Parent(s): cea398a

Update model card

Browse files
Files changed (1) hide show
  1. README.md +58 -6
README.md CHANGED
@@ -1,6 +1,58 @@
1
- # Holo-3.1-4B-Coding-Repair38
2
-
3
- Merged coding SFT model from `Hcompany/Holo-3.1-4B` and adapter `josephmayo/Holo-3.1-4B-Coding-Repair38-LoRA`.
4
-
5
- This release is published after an explicit override of the original gate. The held-out
6
- HumanEval+MBPP probe improved from 24/80 to 31/80, a 29.17% relative lift.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Hcompany/Holo-3.1-4B
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - coding
7
+ - merged
8
+ - safetensors
9
+ - python
10
+ ---
11
+
12
+ # Holo-3.1-4B Coding Merged Model
13
+
14
+ ## Overview
15
+
16
+ This repository contains a merged Transformers checkpoint produced from `Hcompany/Holo-3.1-4B` and the companion coding LoRA adapter. It is intended for users who prefer loading a standard merged model rather than applying a PEFT adapter at runtime.
17
+
18
+ ## What Is Included
19
+
20
+ - Merged model weights in sharded `safetensors` format.
21
+ - Model configuration and generation configuration.
22
+ - Tokenizer and chat template files.
23
+ - A model card summarizing the measured coding adaptation result.
24
+
25
+ ## Training And Evaluation Summary
26
+
27
+ The underlying adapter was trained with supervised fine-tuning on curated coding instruction data. Evaluation used an 80-task held-out greedy decoding probe drawn from HumanEval-style and MBPP-style tasks.
28
+
29
+ Measured result on the held-out probe:
30
+
31
+ - Base model: 24 / 80 tasks passed.
32
+ - Adapted model: 31 / 80 tasks passed.
33
+ - Relative lift over the measured base result: 29.17%.
34
+
35
+ The merged model should match the adapter-applied behavior, subject to normal numerical and runtime differences.
36
+
37
+ ## Intended Use
38
+
39
+ Use this checkpoint for coding assistance experiments, Python function generation, lightweight algorithmic problem solving, and local inference workflows that expect standard Transformers model files.
40
+
41
+ ## Known Limitations
42
+
43
+ - The evaluation probe is small and does not cover all programming languages or repository-scale workflows.
44
+ - The model can produce incorrect code, incomplete reasoning, or solutions that fail edge cases.
45
+ - Generated code should be reviewed, tested, and sandboxed where appropriate.
46
+ - The checkpoint inherits limitations and licensing terms from the base model and adaptation data sources.
47
+
48
+ ## File List
49
+
50
+ - `model-00001-of-00009.safetensors` through `model-00009-of-00009.safetensors`: merged model shards.
51
+ - `model.safetensors.index.json`: shard index.
52
+ - `config.json`, `generation_config.json`: model configuration files.
53
+ - `tokenizer.json`, `tokenizer_config.json`, `chat_template.jinja`: tokenizer/chat assets.
54
+ - `README.md`: this model card.
55
+
56
+ ## Reproducibility And Provenance
57
+
58
+ The model was produced by merging a PEFT LoRA coding adapter into `Hcompany/Holo-3.1-4B` and saving the result as sharded `safetensors`. Companion evaluation and training provenance artifacts are available in the LoRA repository.