sd-inf commited on
Commit
533b04f
·
verified ·
1 Parent(s): 866f2a0

Add model card

Browse files
Files changed (1) hide show
  1. README.md +5 -34
README.md CHANGED
@@ -1,39 +1,10 @@
1
  ---
2
- base_model: Qwen/Qwen3-4B
3
- datasets:
4
- - LucidityAI/Astral-Post-Training-Dataset
5
  tags:
6
- - code
7
- - chemistry
8
- - finance
9
- - biology
10
  ---
11
 
12
- # Astral-4B-Coder
13
 
14
- Astral 4B Coder is the medium sized model in the Astral coder family. It was fine-tuned from Astral 4b.
15
-
16
- > Note: Utilize no think for agentic tasks and think for hard non-agentic tasks
17
-
18
- As with usual Qwen3 models, reasoning can be toggled through the usage of ```/no_think``` or not.
19
-
20
-
21
- ### Example Prompt (ChatML Format (THINKING)):
22
-
23
- ```xml
24
- <|im_start|>user
25
- What is the capital of France?
26
- <|im_end|>
27
- <|im_start|>assistant
28
- <think>
29
- ```
30
-
31
- ### Example Prompt (ChatML Format (NON-THINKING)):
32
-
33
- ```xml
34
- <|im_start|>user
35
- What is the capital of France? /no_think
36
- <|im_end|>
37
- <|im_start|>assistant
38
- <think>
39
- ```
 
1
  ---
2
+ base_model: Qwen/Qwen3-14B
 
 
3
  tags:
4
+ - merge
5
+ - lora
 
 
6
  ---
7
 
8
+ # Astral-14B
9
 
10
+ Merged LoRA model based on Qwen3-14B.