kramster commited on
Commit
3143b60
·
verified ·
1 Parent(s): c524803

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +68 -0
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - mistral
7
+ - alpaca
8
+ - fine-tuning
9
+ - code
10
+ - crud
11
+ - sft
12
+ - vllm
13
+ datasets:
14
+ - kramster/crud-code-tests
15
+ base_model: mistralai/Mistral-7B-Instruct-v0.2
16
+ ---
17
+
18
+ # 🧠 Evolve Mistral: Fine-Tuned Mistral-7B-Instruct on CRUD Coding Tasks
19
+
20
+ This model is a fine-tuned version of [`mistralai/Mistral-7B-Instruct-v0.2`](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2), adapted for reasoning about structured CRUD-based code inputs and instruction-following tasks. It was trained on a dataset in [Alpaca](https://github.com/tatsu-lab/stanford_alpaca)-style format, using supervised fine-tuning (SFT).
21
+
22
+ ---
23
+
24
+ ## 📂 Dataset
25
+
26
+ The model was trained on:
27
+
28
+ **[`kramster/crud-code-tests`](https://huggingface.co/datasets/kramster/crud-code-tests)**
29
+ A dataset of instruction-based code snippets focusing on Create, Read, Update, and Delete operations in various programming contexts. It uses the Alpaca-style JSON format with fields: `instruction`, `input`, and `output`.
30
+
31
+ ---
32
+
33
+ ## 🏗️ Training Setup
34
+
35
+ | Detail | Value |
36
+ |---------------------|-------|
37
+ | Base model | `mistralai/Mistral-7B-Instruct-v0.2` |
38
+ | LoRA Config | r=32, alpha=16 |
39
+ | Framework | Axolotl + DeepSpeed + LoRA |
40
+ | Training Steps | 51 |
41
+ | Epochs | ~3.94 |
42
+ | Mixed Precision | bfloat16 |
43
+ | GPU | NVIDIA H100 80GB |
44
+ | Training Duration | 10m 26s |
45
+ | Final Train Loss | 0.0909 |
46
+ | Final Eval Loss | 0.1012 |
47
+ | FLOPs used | 347.6 trillion |
48
+
49
+
50
+ ---
51
+
52
+ ## 🧪 Evaluation Summary
53
+
54
+ - **Eval runtime:** 2.84s
55
+ - **Eval samples/sec:** 2.11
56
+ - **Eval steps/sec:** 1.05
57
+ - **Gradient norm (final):** 0.064
58
+ - **Final LR:** 2.93e-7
59
+
60
+ ---
61
+
62
+ ## 🧠 Example Usage
63
+
64
+ vllm-api-server \
65
+ --model kramster/evolve-mistral \
66
+ --max-model-len 64000 \
67
+ --rope-scaling '{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}' \
68
+ --no-enable-prefix-caching