passagereptile455 commited on
Commit
1ded106
·
verified ·
1 Parent(s): 5ac90e6

Update README for cleaned repo

Browse files
Files changed (1) hide show
  1. README.md +13 -11
README.md CHANGED
@@ -1,28 +1,30 @@
1
  ---
2
  tags:
3
- - training-scripts
4
- - fine-tuning
5
- - lora
 
6
  license: apache-2.0
7
  ---
8
 
9
  # Training Scripts
10
 
11
- Reusable training scripts for HF Jobs.
12
 
13
- ## Scripts
14
 
15
- | Script | Purpose |
16
- |--------|---------|
17
- | `train_sft_demo.py` | SFT on Capybara dataset |
18
- | `train_concise.py` | SFT on concise-tech-explanations |
19
 
20
  ## Usage
21
 
22
  ```bash
23
- hf jobs uv run --flavor t4-small --timeout 30m --secrets HF_TOKEN "https://huggingface.co/passagereptile455/training-scripts/resolve/main/train_sft_demo.py"
24
  ```
25
 
 
 
 
 
26
  ## Notes
27
 
28
- Scripts use PEP 723 inline dependencies no local setup required.
 
1
  ---
2
  tags:
3
+ - fine-tuning
4
+ - lora
5
+ - humaneval
6
+ - qwen3
7
  license: apache-2.0
8
  ---
9
 
10
  # Training Scripts
11
 
12
+ Fine-tuning script for reproducing the HuggingFace fine-tuning challenge.
13
 
14
+ ## Script
15
 
16
+ **train_humaneval_clean.py** - Fine-tune Qwen3-0.6B to beat base on HumanEval.
 
 
 
17
 
18
  ## Usage
19
 
20
  ```bash
21
+ hf jobs uv run --flavor a10g-small --timeout 14400 --secrets HF_TOKEN "https://huggingface.co/passagereptile455/training-scripts/resolve/main/train_humaneval_clean.py"
22
  ```
23
 
24
+ ## Results
25
+
26
+ See [qwen3-humaneval-sft](https://huggingface.co/passagereptile455/qwen3-humaneval-sft) for the trained model and full results.
27
+
28
  ## Notes
29
 
30
+ Script uses PEP 723 inline dependencies - no local setup required.