prithivMLmods commited on
Commit
eddac19
·
verified ·
1 Parent(s): d8d9da0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -1
README.md CHANGED
@@ -2,4 +2,42 @@
2
  license: apache-2.0
3
  tags:
4
  - 'Thinking: Disabled'
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  tags:
4
  - 'Thinking: Disabled'
5
+ - text-generation-inference
6
+ language:
7
+ - en
8
+ base_model:
9
+ - prithivMLmods/Omega-Qwen2.5-Coder-3B
10
+ pipeline_tag: text-generation
11
+ library_name: transformers
12
+ ---
13
+
14
+ # **Omega-Qwen2.5-Coder-3B-GGUF**
15
+
16
+ > Omega-Qwen2.5-Coder-3B is a compact and high-efficiency code-focused model fine-tuned on Qwen2.5-Coder-3B-Instruct, using the symbolic-rich Open-Omega-Forge-1M dataset. Designed specifically for hard-coded tasks and deterministic computation, this model runs in a "thinking-disabled" mode—delivering precise, structured outputs with minimal hallucination, making it ideal for rigorous coding workflows and embedded logic applications.
17
+
18
+ ## Model Files
19
+
20
+ | File Name | Size | Precision |
21
+ |-----------|------|-----------|
22
+ | Omega-Qwen2.5-Coder-3B.BF16.gguf | 6.18 GB | BF16 |
23
+ | Omega-Qwen2.5-Coder-3B.F16.gguf | 6.18 GB | F16 |
24
+ | Omega-Qwen2.5-Coder-3B.F32.gguf | 12.3 GB | F32 |
25
+ | Omega-Qwen2.5-Coder-3B.Q2_K.gguf | 1.27 GB | Q2_K |
26
+ | Omega-Qwen2.5-Coder-3B.Q3_K_L.gguf | 1.71 GB | Q3_K_L |
27
+ | Omega-Qwen2.5-Coder-3B.Q3_K_M.gguf | 1.59 GB | Q3_K_M |
28
+ | Omega-Qwen2.5-Coder-3B.Q3_K_S.gguf | 1.45 GB | Q3_K_S |
29
+ | Omega-Qwen2.5-Coder-3B.Q4_K_M.gguf | 1.93 GB | Q4_K_M |
30
+ | Omega-Qwen2.5-Coder-3B.Q4_K_S.gguf | 1.83 GB | Q4_K_S |
31
+ | Omega-Qwen2.5-Coder-3B.Q5_K_M.gguf | 2.22 GB | Q5_K_M |
32
+ | Omega-Qwen2.5-Coder-3B.Q5_K_S.gguf | 2.17 GB | Q5_K_S |
33
+ | Omega-Qwen2.5-Coder-3B.Q6_K.gguf | 2.54 GB | Q6_K |
34
+ | Omega-Qwen2.5-Coder-3B.Q8_0.gguf | 3.29 GB | Q8_0 |
35
+
36
+ ## Quants Usage
37
+
38
+ (sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
39
+
40
+ Here is a handy graph by ikawrakow comparing some lower-quality quant
41
+ types (lower is better):
42
+
43
+ ![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)