Daffaadityp commited on
Commit
803ceac
ยท
0 Parent(s):

Duplicate from Daffaadityp/AxonAI-MX4-2.0-GGUF

Browse files
.gitattributes ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ AxonAI-MX4-2.0-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ AxonAI-MX4-2.0-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
38
+ AxonAI-MX4-2.0-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
AxonAI-MX4-2.0-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:31ec86e26703ce9b1d591198d17ae88d0028662e239511b26bbb0f3379d89b2f
3
+ size 1669500512
AxonAI-MX4-2.0-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dd5013a31ac38dc96e1d7f77c97f69ae774336bc36c89629414a0ee0d1bcc29f
3
+ size 2497281632
AxonAI-MX4-2.0-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e4415ba571b7e0407913d93dbb0b6159fab082986790dc7783ad53f267f3bc09
3
+ size 4280406112
README.md ADDED
@@ -0,0 +1,410 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Daffaadityp/AxonAI-MX4-2.0
3
+ language:
4
+ - en
5
+ - id
6
+ license: apache-2.0
7
+ tags:
8
+ - gguf
9
+ - quantized
10
+ - qwen3
11
+ - dora
12
+ - axonlabs
13
+ - reasoning
14
+ - local-llm
15
+ - chain-of-thought
16
+ - edge-ai
17
+ - ollama
18
+ - llama-cpp
19
+ - indonesian-ai
20
+ - text-generation
21
+ - 4b
22
+ - instruct
23
+ pipeline_tag: text-generation
24
+ library_name: gguf
25
+ ---
26
+
27
+ <div align="center">
28
+
29
+ # ๐Ÿง  AxonAI MX4 2.0 โ€” GGUF Quantized Edition
30
+
31
+ ### *Reasoning-First Language Model ยท 4B Parameters ยท Chain-of-Thought Native*
32
+ ### *Optimized for Local Inference ยท Edge Devices ยท Laptops ยท Offline AI*
33
+
34
+ <br>
35
+
36
+ [![Model](https://img.shields.io/badge/Base%20Model-AxonAI%20MX4%202.0-blueviolet?style=for-the-badge&logo=huggingface)](https://huggingface.co/Daffaadityp/AxonAI-MX4-2.0)
37
+ [![Format](https://img.shields.io/badge/Format-GGUF-orange?style=for-the-badge&logo=llvm)](https://github.com/ggerganov/llama.cpp)
38
+ [![Quantization](https://img.shields.io/badge/Quants-Q2__K%20%7C%20Q4__K__M%20%7C%20Q8__0-brightgreen?style=for-the-badge)](https://github.com/ggerganov/llama.cpp#quantization)
39
+ [![Ollama](https://img.shields.io/badge/Ollama-Compatible-informational?style=for-the-badge&logo=ollama)](https://ollama.com)
40
+ [![llama.cpp](https://img.shields.io/badge/llama.cpp-Compatible-success?style=for-the-badge)](https://github.com/ggerganov/llama.cpp)
41
+ [![LM Studio](https://img.shields.io/badge/LM%20Studio-Compatible-9cf?style=for-the-badge)](https://lmstudio.ai)
42
+ [![Parameters](https://img.shields.io/badge/Parameters-4B-blue?style=for-the-badge)](https://huggingface.co/Daffaadityp/AxonAI-MX4-2.0)
43
+ [![License](https://img.shields.io/badge/License-Apache%202.0-red?style=for-the-badge)](https://www.apache.org/licenses/LICENSE-2.0)
44
+ [![Made in Indonesia](https://img.shields.io/badge/Made%20in-Indonesia%20๐Ÿ‡ฎ๐Ÿ‡ฉ-red?style=for-the-badge)](https://github.com/Daffaadityp)
45
+
46
+ <br>
47
+
48
+ > **This repository contains the official GGUF quantized files for AxonAI MX4 2.0.**
49
+ > Run a full Chain-of-Thought reasoning LLM *entirely locally* โ€” no GPU required, no internet connection, no API costs. Just pure, structured intelligence on your own hardware.
50
+
51
+ </div>
52
+
53
+ ---
54
+
55
+ ## ๐Ÿ“Œ Quick Navigation
56
+
57
+ | Section | Description |
58
+ |---|---|
59
+ | [๐Ÿ—‚๏ธ Available Files](#๏ธ-available-gguf-files--quantization-guide) | Q2_K, Q4_K_M, Q8_0 โ€” which one is right for you? |
60
+ | [๐Ÿš€ Ollama Quickstart](#-ollama-quickstart-recommended) | Easiest way to run locally โ€” one command |
61
+ | [โš™๏ธ llama.cpp CLI](#๏ธ-llamacpp-cli) | For advanced users and scripting |
62
+ | [๐Ÿ–ฅ๏ธ LM Studio / GPT4All](#๏ธ-lm-studio--gpt4all) | GUI-based local inference |
63
+ | [๐Ÿงฌ Why Quantized Reasoning?](#-why-a-quantized-reasoning-model-is-so-powerful) | The secret sauce โ€” explained for GGUF |
64
+ | [๐Ÿ› ๏ธ Prompt Format](#๏ธ-prompt--system-format) | How to structure your prompts |
65
+ | [๐Ÿ‡ฎ๐Ÿ‡ฉ Komunitas Indonesia](#-untuk-developer-indonesia) | Untuk para developer Tanah Air |
66
+
67
+ ---
68
+
69
+ ## ๐ŸŒ What Is This Repository?
70
+
71
+ This is the **official GGUF release** of [AxonAI MX4 2.0](https://huggingface.co/Daffaadityp/AxonAI-MX4-2.0), a 4-billion-parameter reasoning-first language model built by **AxonLabs** (SMKN 26 Jakarta). The original model was trained using **DoRA (Weight-Decomposed Low-Rank Adaptation)** on top of the Qwen3 architecture, fine-tuned to produce structured, transparent Chain-of-Thought (`<think>`) reasoning before every final response.
72
+
73
+ These GGUF files were produced using `llama.cpp`'s official quantization pipeline, preserving the model's reasoning depth while dramatically reducing memory footprint โ€” making **local LLM inference** accessible on consumer hardware.
74
+
75
+ **If you want the full-precision FP16/BF16 weights**, visit the original repository:
76
+ ๐Ÿ‘‰ [`Daffaadityp/AxonAI-MX4-2.0`](https://huggingface.co/Daffaadityp/AxonAI-MX4-2.0)
77
+
78
+ ---
79
+
80
+ ## ๐Ÿ—‚๏ธ Available GGUF Files & Quantization Guide
81
+
82
+ Choose the right quantization level for your hardware. As a general rule: **higher Q = better quality, higher RAM requirement**.
83
+
84
+ | File | Quant Type | Size (Est.) | Min RAM | Quality | Use Case |
85
+ |---|---|---|---|---|---|
86
+ | `AxonAI-MX4-2.0-Q2_K.gguf` | Q2_K | ~1.7 GB | 4 GB | โšก Fast / Compressed | Raspberry Pi, very old laptops, extreme RAM constraints |
87
+ | `AxonAI-MX4-2.0-Q4_K_M.gguf` | Q4_K_M | ~2.7 GB | 6 GB | โญ **Recommended** | Mac M1/M2, standard laptops, WSL2, most modern CPUs |
88
+ | `AxonAI-MX4-2.0-Q8_0.gguf` | Q8_0 | ~4.5 GB | 8 GB | ๐Ÿ”ฌ Near-FP16 | Workstations, gaming PCs with ample RAM, power users |
89
+
90
+ ### โญ Recommendation: Start with `Q4_K_M`
91
+
92
+ `Q4_K_M` is the universally recommended sweet spot for local LLM inference. It delivers:
93
+ - **~95% of the full-precision model quality** at less than 35% of the memory cost
94
+ - Excellent performance on **Apple Silicon (M1/M2/M3)**, standard x86 laptops, and cloud VMs
95
+ - The best balance of **inference speed**, **reasoning coherence**, and **RAM efficiency**
96
+
97
+ > ๐Ÿ’ก For most users: **Q4_K_M is the right choice. Start here.**
98
+
99
+ ---
100
+
101
+ ## ๐Ÿš€ Ollama Quickstart (Recommended)
102
+
103
+ [Ollama](https://ollama.com) is the fastest way to run AxonAI MX4 2.0 locally. No Python setup required.
104
+
105
+ ### Step 1 โ€” Install Ollama
106
+
107
+ ```bash
108
+ # macOS / Linux
109
+ curl -fsSL https://ollama.com/install.sh | sh
110
+
111
+ # Windows: Download installer from https://ollama.com/download
112
+ ```
113
+
114
+ ### Step 2 โ€” Create a Modelfile
115
+
116
+ Create a file named `Modelfile` (no extension) in your working directory:
117
+
118
+ ```dockerfile
119
+ # Modelfile for AxonAI MX4 2.0 (Q4_K_M - Recommended)
120
+ FROM ./AxonAI-MX4-2.0-Q4_K_M.gguf
121
+
122
+ # --- Core Identity & Reasoning System Prompt ---
123
+ SYSTEM """
124
+ You are AxonAI, an advanced reasoning assistant developed by AxonLabs.
125
+ Before answering any question, you MUST use your internal scratchpad enclosed in <think>...</think> tags to reason step-by-step.
126
+ Only after completing your reasoning should you provide a clear, structured, and helpful final answer.
127
+ Be precise, thorough, and transparent in your logic.
128
+ """
129
+
130
+ # --- Generation Parameters (Optimized for Reasoning) ---
131
+ PARAMETER temperature 0.6
132
+ PARAMETER top_p 0.95
133
+ PARAMETER top_k 20
134
+ PARAMETER repeat_penalty 1.1
135
+ PARAMETER num_ctx 8192
136
+ ```
137
+
138
+ > ๐Ÿ’ก **Why the `<think>` system prompt?** AxonAI MX4 2.0 was fine-tuned with Chain-of-Thought supervision. Including this system prompt *unlocks* the model's full reasoning capability. Without it, you may get direct answers without the structured deliberation the model was trained to produce.
139
+
140
+ ### Step 3 โ€” Build and Run
141
+
142
+ ```bash
143
+ # Build the local Ollama model from your Modelfile
144
+ ollama create axonai-mx4 -f ./Modelfile
145
+
146
+ # Run it interactively
147
+ ollama run axonai-mx4
148
+
149
+ # Or run with a direct prompt
150
+ ollama run axonai-mx4 "Explain the P vs NP problem and whether you think it will ever be solved."
151
+ ```
152
+
153
+ ### Using the Ollama REST API
154
+
155
+ Once running, Ollama exposes a local REST API โ€” perfect for integrations:
156
+
157
+ ```bash
158
+ curl http://localhost:11434/api/generate \
159
+ -H "Content-Type: application/json" \
160
+ -d '{
161
+ "model": "axonai-mx4",
162
+ "prompt": "What are the ethical implications of deploying AI in judicial systems?",
163
+ "stream": false
164
+ }'
165
+ ```
166
+
167
+ ---
168
+
169
+ ## โš™๏ธ llama.cpp CLI
170
+
171
+ For advanced users, scripting pipelines, or maximum performance control.
172
+
173
+ ### Install llama.cpp
174
+
175
+ ```bash
176
+ git clone https://github.com/ggerganov/llama.cpp
177
+ cd llama.cpp
178
+ cmake -B build
179
+ cmake --build build --config Release -j$(nproc)
180
+ ```
181
+
182
+ ### Run Inference
183
+
184
+ ```bash
185
+ # Basic interactive mode (Q4_K_M recommended)
186
+ ./build/bin/llama-cli \
187
+ -m ./AxonAI-MX4-2.0-Q4_K_M.gguf \
188
+ -n 2048 \
189
+ --temp 0.6 \
190
+ --top-p 0.95 \
191
+ --top-k 20 \
192
+ --repeat-penalty 1.1 \
193
+ --ctx-size 8192 \
194
+ -i \
195
+ -r "User:" \
196
+ --in-prefix " " \
197
+ -p "You are AxonAI, a reasoning assistant. Think step by step inside <think> tags before answering.\n\nUser:"
198
+ ```
199
+
200
+ ```bash
201
+ # Single-shot inference (batch/scripting)
202
+ ./build/bin/llama-cli \
203
+ -m ./AxonAI-MX4-2.0-Q8_0.gguf \
204
+ -n 1024 \
205
+ --temp 0.6 \
206
+ --ctx-size 8192 \
207
+ -p "<|im_start|>system\nYou are AxonAI. Reason carefully using <think> tags.<|im_end|>\n<|im_start|>user\nSolve: If a train travels 120km at 60km/h, then 80km at 40km/h, what is the average speed for the whole journey?<|im_end|>\n<|im_start|>assistant\n"
208
+ ```
209
+
210
+ > ๐Ÿ”ง **Performance tip:** Add `-ngl 99` flag if you have a GPU (NVIDIA/AMD/Metal) to offload layers โ€” this can yield **3โ€“10x speedup** even with quantized GGUF files.
211
+
212
+ ---
213
+
214
+ ## ๐Ÿ–ฅ๏ธ LM Studio / GPT4All
215
+
216
+ Both LM Studio and GPT4All support direct GGUF loading with a graphical interface โ€” ideal for non-technical users or demos.
217
+
218
+ **LM Studio:**
219
+ 1. Download from [lmstudio.ai](https://lmstudio.ai)
220
+ 2. Go to **Search** โ†’ search `AxonAI` or import GGUF manually via **My Models**
221
+ 3. Load `AxonAI-MX4-2.0-Q4_K_M.gguf`
222
+ 4. In the **System Prompt** field, paste the reasoning system prompt from the Modelfile above
223
+ 5. Start chatting โ€” LM Studio also exposes a local OpenAI-compatible API on port `1234`
224
+
225
+ **GPT4All:**
226
+ 1. Download from [gpt4all.io](https://www.nomic.ai/gpt4all)
227
+ 2. Under **Add Model** โ†’ choose **Import from file** and select your `.gguf` file
228
+ 3. GPT4All works entirely offline after the initial load โ€” perfect for privacy-sensitive use cases
229
+
230
+ ---
231
+
232
+ ## ๐Ÿงฌ Why a Quantized Reasoning Model Is So Powerful
233
+
234
+ Most local LLMs are **answer-first** โ€” they pattern-match to the most statistically likely response. AxonAI MX4 2.0 is fundamentally different.
235
+
236
+ It was trained to **reason before it answers** โ€” meaning every response is preceded by an internal deliberation process encoded inside `<think>...</think>` tags. This is the Chain-of-Thought (CoT) paradigm, and when applied to a quantized local model, several powerful properties emerge:
237
+
238
+ ### ๐Ÿ”’ Complete Privacy, Full Intelligence
239
+ Your prompts **never leave your machine**. Unlike cloud LLM APIs, there is no data sent to any server. You get structured reasoning capability that rivals much larger models โ€” entirely offline. This is essential for:
240
+ - Legal document analysis
241
+ - Medical note summarization
242
+ - Private financial reasoning
243
+ - Proprietary code review
244
+
245
+ ### ๐Ÿ“‰ Quantization โ‰  Reasoning Degradation
246
+ Unlike factual recall (where quantization can cause more hallucination), **structured reasoning is surprisingly robust** to quantization. The logical flow encoded during DoRA fine-tuning is preserved at 4-bit precision. The model still deliberates. It still checks its own steps. It still produces structured conclusions.
247
+
248
+ ### ๐Ÿงฉ The DoRA Advantage
249
+ AxonAI MX4 2.0 was adapted using **DoRA (Weight-Decomposed Low-Rank Adaptation)**, which separates weight updates into magnitude and direction components. This produces **more stable, nuanced fine-tuning** than standard LoRA โ€” and that stability carries through quantization. You get a model that reasons with fidelity even at Q4 compression.
250
+
251
+ ### โšก The Efficiency Equation
252
+ A 4B parameter model at Q4_K_M runs at **~20โ€“60 tokens/second** on Apple M-series chips and modern CPUs. That's fast enough for real-time, interactive reasoning โ€” think of it as having a thoughtful senior analyst available offline, on any machine, forever.
253
+
254
+ ---
255
+
256
+ ## ๐Ÿ› ๏ธ Prompt & System Format
257
+
258
+ AxonAI MX4 2.0 uses the **ChatML** prompt template (inherited from Qwen3):
259
+
260
+ ```
261
+ <|im_start|>system
262
+ {system_prompt}<|im_end|>
263
+ <|im_start|>user
264
+ {user_message}<|im_end|>
265
+ <|im_start|>assistant
266
+ <think>
267
+ {internal reasoning โ€” model generates this}
268
+ </think>
269
+ {final answer โ€” model generates this}
270
+ <|im_end|>
271
+ ```
272
+
273
+ ### Recommended System Prompt (Full Version)
274
+
275
+ ```
276
+ You are AxonAI, an advanced reasoning language model developed by AxonLabs.
277
+ Your core capability is structured deliberation: before answering any question,
278
+ you MUST think step-by-step inside <think>...</think> tags.
279
+
280
+ Guidelines:
281
+ - Use <think> to break down the problem, consider edge cases, and verify your logic.
282
+ - After </think>, give a clear, well-structured, and helpful final answer.
283
+ - Be honest about uncertainty. Never fabricate facts.
284
+ - For math and logic, show your work explicitly inside <think>.
285
+ - For creative or open-ended tasks, use <think> to plan your response structure.
286
+ ```
287
+
288
+ ### Minimal System Prompt (Fast / Lightweight)
289
+
290
+ ```
291
+ You are AxonAI. Always reason inside <think>...</think> before your final answer.
292
+ ```
293
+
294
+ ---
295
+
296
+ ## ๐Ÿ“Š Model Architecture & Training Summary
297
+
298
+ | Property | Value |
299
+ |---|---|
300
+ | **Base Architecture** | Qwen3 (4B) |
301
+ | **Fine-Tuning Method** | DoRA (Weight-Decomposed Low-Rank Adaptation) |
302
+ | **Training Paradigm** | Chain-of-Thought Supervised Fine-Tuning |
303
+ | **Context Window** | 8,192 tokens |
304
+ | **Vocab Size** | 151,936 |
305
+ | **Attention Heads** | 32 |
306
+ | **Key-Value Heads** | 8 (Grouped Query Attention) |
307
+ | **Hidden Dimensions** | 2,048 |
308
+ | **GGUF Quantizer** | llama.cpp (official) |
309
+ | **Available Quants** | Q2_K, Q4_K_M, Q8_0 |
310
+ | **Language Support** | English (primary), Indonesian (strong) |
311
+ | **License** | Apache 2.0 |
312
+
313
+ ---
314
+
315
+ ## ๐Ÿ”ฌ Benchmark Context
316
+
317
+ > AxonAI MX4 2.0 is a research and educational model from AxonLabs. Formal benchmark results are forthcoming. The following reflects qualitative design targets based on the training methodology.
318
+
319
+ | Capability | Assessment |
320
+ |---|---|
321
+ | Structured Reasoning (CoT) | โœ… Strong โ€” core training objective |
322
+ | Mathematical Problem Solving | โœ… Good โ€” benefiting from step-by-step CoT |
323
+ | Code Generation (Python/JS) | โœ… Good |
324
+ | Factual Q&A (English) | โœ… Good |
325
+ | Indonesian Language (id) | โœ… Good |
326
+ | Long-Context Coherence (8K) | โš ๏ธ Moderate โ€” improves with Q8_0 |
327
+ | Complex Multi-Step Agentic Tasks | โš ๏ธ Moderate โ€” use longer system prompts |
328
+
329
+ *Community evaluations and PR-based benchmark additions are welcome.*
330
+
331
+ ---
332
+
333
+ ## ๐Ÿ‡ฎ๐Ÿ‡ฉ Untuk Developer Indonesia
334
+
335
+ **Halo, Developer Indonesia! ๐Ÿ™Œ**
336
+
337
+ Ini adalah model AI lokal pertama dari AxonLabs yang bisa kamu jalankan **100% offline di laptop atau PC sendiri** โ€” tanpa perlu GPU mahal, tanpa biaya API, dan tanpa koneksi internet.
338
+
339
+ Bayangkan: punya asisten AI yang bisa berpikir langkah demi langkah, memahami konteks, dan menjawab pertanyaan kompleks โ€” semuanya berjalan di dalam mesin kamu sendiri. Itulah tujuan AxonAI MX4 2.0 GGUF.
340
+
341
+ **Kenapa ini penting buat kamu?**
342
+ - ๐Ÿ”’ **Privasi total** โ€” data kamu tidak pernah keluar dari devicemu
343
+ - ๐Ÿ’ธ **Gratis selamanya** โ€” tidak ada biaya langganan atau token
344
+ - ๐ŸŒ **Bisa dipakai offline** โ€” di daerah dengan koneksi terbatas sekalipun
345
+ - ๐Ÿง  **Reasoning-first** โ€” model ini *mikir dulu* sebelum menjawab, bukan asal tebak
346
+
347
+ Dibangun oleh pelajar SMK, untuk semua orang Indonesia yang ingin mengeksplorasi AI secara langsung.
348
+
349
+ > *"AI terbaik adalah AI yang bisa kamu kontrol sendiri."*
350
+ > โ€” AxonLabs, SMKN 26 Jakarta
351
+
352
+ **Cara paling cepat untuk mulai (5 menit):**
353
+ ```bash
354
+ # 1. Install Ollama
355
+ curl -fsSL https://ollama.com/install.sh | sh
356
+
357
+ # 2. Buat Modelfile (lihat panduan di atas), lalu:
358
+ ollama create axonai-mx4 -f ./Modelfile
359
+
360
+ # 3. Jalankan!
361
+ ollama run axonai-mx4 "Jelaskan cara kerja transformer architecture dalam bahasa yang mudah dipahami."
362
+ ```
363
+
364
+ ---
365
+
366
+ ## โš–๏ธ License & Usage
367
+
368
+ This model is released under the **Apache 2.0 License**.
369
+
370
+ - โœ… Free for personal, academic, and commercial use
371
+ - โœ… Modification and redistribution permitted with attribution
372
+ - โœ… Derivative models and fine-tunes welcome
373
+ - โŒ Must not be used to generate illegal, harmful, or deceptive content
374
+ - โŒ Attribution to AxonLabs / `Daffaadityp/AxonAI-MX4-2.0` required for derivative releases
375
+
376
+ ---
377
+
378
+ ## ๐Ÿ”— Related Resources
379
+
380
+ | Resource | Link |
381
+ |---|---|
382
+ | ๐Ÿง  Original FP16 Model | [Daffaadityp/AxonAI-MX4-2.0](https://huggingface.co/Daffaadityp/AxonAI-MX4-2.0) |
383
+ | ๐Ÿ“ฆ llama.cpp Repository | [github.com/ggerganov/llama.cpp](https://github.com/ggerganov/llama.cpp) |
384
+ | ๐Ÿฆ™ Ollama Documentation | [ollama.com/docs](https://ollama.com) |
385
+ | ๐Ÿ–ฅ๏ธ LM Studio | [lmstudio.ai](https://lmstudio.ai) |
386
+ | ๐Ÿซ AxonLabs / SMKN 26 Jakarta | [Daffaadityp on HuggingFace](https://huggingface.co/Daffaadityp) |
387
+
388
+ ---
389
+
390
+ ## ๐Ÿ’ฌ Community & Feedback
391
+
392
+ Found a bug? Have a benchmark result to share? Want to contribute evaluation data?
393
+
394
+ - **Open a Discussion** on this HuggingFace repository
395
+ - **Open an Issue** on the [AxonAI GitHub](https://github.com/Daffaadityp) (if available)
396
+ - **Community evaluations are actively welcomed** โ€” especially Indonesian-language benchmarks
397
+
398
+ ---
399
+
400
+ <div align="center">
401
+
402
+ *Built with ๐Ÿง  by AxonLabs ยท SMKN 26 Jakarta ยท Indonesia ๐Ÿ‡ฎ๐Ÿ‡ฉ*
403
+
404
+ *"Intelligence is not about speed. It's about depth of thought."*
405
+
406
+ *"Michie Edition"*
407
+
408
+ [![HuggingFace](https://img.shields.io/badge/๐Ÿค—%20HuggingFace-Daffaadityp-yellow?style=for-the-badge)](https://huggingface.co/Daffaadityp)
409
+
410
+ </div>