modrill commited on
Commit
fa42872
·
verified ·
1 Parent(s): 944a1b0

Add model card

Browse files
Files changed (1) hide show
  1. README.md +10 -56
README.md CHANGED
@@ -1,62 +1,16 @@
1
  ---
2
- base_model: Qwen/Qwen3-4B-Base
3
- library_name: peft
4
- model_name: qwen3-4b-base-restart-lingcoder-shortcot-4k-lora
5
  tags:
6
- - base_model:adapter:Qwen/Qwen3-4B-Base
7
- - lora
8
- - sft
9
- - transformers
10
- - trl
11
- licence: license
12
- pipeline_tag: text-generation
13
  ---
14
 
15
- # Model Card for qwen3-4b-base-restart-lingcoder-shortcot-4k-lora
16
 
17
- This model is a fine-tuned version of [Qwen/Qwen3-4B-Base](https://huggingface.co/Qwen/Qwen3-4B-Base).
18
- It has been trained using [TRL](https://github.com/huggingface/trl).
19
 
20
- ## Quick start
21
-
22
- ```python
23
- from transformers import pipeline
24
-
25
- question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
26
- generator = pipeline("text-generation", model="None", device="cuda")
27
- output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
28
- print(output["generated_text"])
29
- ```
30
-
31
- ## Training procedure
32
-
33
-
34
-
35
-
36
-
37
- This model was trained with SFT.
38
-
39
- ### Framework versions
40
-
41
- - PEFT 0.19.1
42
- - TRL: 1.5.0.dev0
43
- - Transformers: 5.8.0
44
- - Pytorch: 2.11.0
45
- - Datasets: 4.8.5
46
- - Tokenizers: 0.22.2
47
-
48
- ## Citations
49
-
50
-
51
-
52
- Cite TRL as:
53
-
54
- ```bibtex
55
- @software{vonwerra2020trl,
56
- title = {{TRL: Transformers Reinforcement Learning}},
57
- author = {von Werra, Leandro and Belkada, Younes and Tunstall, Lewis and Beeching, Edward and Thrush, Tristan and Lambert, Nathan and Huang, Shengyi and Rasul, Kashif and Gallouédec, Quentin},
58
- license = {Apache-2.0},
59
- url = {https://github.com/huggingface/trl},
60
- year = {2020}
61
- }
62
- ```
 
1
  ---
2
+ license: cc-by-nc-4.0
 
 
3
  tags:
4
+ - qwen3
5
+ - mhm
6
+ - text-generation
7
+ library_name: transformers
 
 
 
8
  ---
9
 
10
+ # lingcoder_shortcot_4k_lora_qwen3_4b_base
11
 
12
+ Auto-uploaded from local training output.
 
13
 
14
+ - Source path: `trl/models/qwen3-4b-base-restart-lingcoder-shortcot-4k-lora`
15
+ - Uploaded at: `2026-05-20T05:46:40.204294`
16
+ - Visibility: `public`