picard47at commited on
Commit
3323898
·
verified ·
1 Parent(s): 8667cec

Training in progress, step 100

Browse files
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
- base_model: unsloth/qwen3-0.6b-unsloth-bnb-4bit
3
  library_name: transformers
4
- model_name: punctuation_128
5
  tags:
6
  - generated_from_trainer
7
  - unsloth
@@ -10,9 +10,9 @@ tags:
10
  licence: license
11
  ---
12
 
13
- # Model Card for punctuation_128
14
 
15
- This model is a fine-tuned version of [unsloth/qwen3-0.6b-unsloth-bnb-4bit](https://huggingface.co/unsloth/qwen3-0.6b-unsloth-bnb-4bit).
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
@@ -21,14 +21,14 @@ It has been trained using [TRL](https://github.com/huggingface/trl).
21
  from transformers import pipeline
22
 
23
  question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
24
- generator = pipeline("text-generation", model="picard47at/punctuation_128", device="cuda")
25
  output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
26
  print(output["generated_text"])
27
  ```
28
 
29
  ## Training procedure
30
 
31
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/picardtseng-pesi/punctuation/runs/u1163x5c)
32
 
33
 
34
  This model was trained with SFT.
@@ -39,7 +39,7 @@ This model was trained with SFT.
39
  - Transformers: 4.51.3
40
  - Pytorch: 2.7.0
41
  - Datasets: 3.6.0
42
- - Tokenizers: 0.21.0
43
 
44
  ## Citations
45
 
 
1
  ---
2
+ base_model: unsloth/qwen3-1.7b-unsloth-bnb-4bit
3
  library_name: transformers
4
+ model_name: punctuation_512
5
  tags:
6
  - generated_from_trainer
7
  - unsloth
 
10
  licence: license
11
  ---
12
 
13
+ # Model Card for punctuation_512
14
 
15
+ This model is a fine-tuned version of [unsloth/qwen3-1.7b-unsloth-bnb-4bit](https://huggingface.co/unsloth/qwen3-1.7b-unsloth-bnb-4bit).
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
 
21
  from transformers import pipeline
22
 
23
  question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
24
+ generator = pipeline("text-generation", model="picard47at/punctuation_512", device="cuda")
25
  output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
26
  print(output["generated_text"])
27
  ```
28
 
29
  ## Training procedure
30
 
31
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/picardtseng-pesi/punctuation/runs/gghra5tk)
32
 
33
 
34
  This model was trained with SFT.
 
39
  - Transformers: 4.51.3
40
  - Pytorch: 2.7.0
41
  - Datasets: 3.6.0
42
+ - Tokenizers: 0.21.1
43
 
44
  ## Citations
45
 
adapter_config.json CHANGED
@@ -24,8 +24,8 @@
24
  "rank_pattern": {},
25
  "revision": null,
26
  "target_modules": [
27
- "q_proj",
28
- "v_proj"
29
  ],
30
  "task_type": "CAUSAL_LM",
31
  "trainable_token_indices": null,
 
24
  "rank_pattern": {},
25
  "revision": null,
26
  "target_modules": [
27
+ "v_proj",
28
+ "q_proj"
29
  ],
30
  "task_type": "CAUSAL_LM",
31
  "trainable_token_indices": null,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1e147ba60049109b0f9031d1807461a6d2060239ff4c192104f14b46a1de70a3
3
  size 9189904
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a32446a7acfc33ccd63119008b43f052d18bac0c83f5fe79cc4b60873353f11
3
  size 9189904
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d6856b5b1ca5318169bd21f8ff7c0c0c9cf7021de2d4e6fa2e71bf644a16bbb1
3
  size 6033
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:84aff1c1c0f8723aad093d37ba912fc737efb2d4c96545539999f474102470b6
3
  size 6033