AdvRahul commited on
Commit
c5c1ce7
·
verified ·
1 Parent(s): bad2a38

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -6
README.md CHANGED
@@ -6,10 +6,9 @@ pipeline_tag: text-generation
6
  base_model: Qwen/Qwen3-4B-Instruct-2507
7
  tags:
8
  - llama-cpp
9
- - gguf-my-repo
10
  ---
11
 
12
- # AdvRahul/Qwen3-4B-Instruct-2507-Q4_K_M-GGUF
13
  This model was converted to GGUF format from [`Qwen/Qwen3-4B-Instruct-2507`](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
14
  Refer to the [original model card](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) for more details on the model.
15
 
@@ -24,12 +23,12 @@ Invoke the llama.cpp server or the CLI.
24
 
25
  ### CLI:
26
  ```bash
27
- llama-cli --hf-repo AdvRahul/Qwen3-4B-Instruct-2507-Q4_K_M-GGUF --hf-file qwen3-4b-instruct-2507-q4_k_m.gguf -p "The meaning to life and the universe is"
28
  ```
29
 
30
  ### Server:
31
  ```bash
32
- llama-server --hf-repo AdvRahul/Qwen3-4B-Instruct-2507-Q4_K_M-GGUF --hf-file qwen3-4b-instruct-2507-q4_k_m.gguf -c 2048
33
  ```
34
 
35
  Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
@@ -46,9 +45,9 @@ cd llama.cpp && LLAMA_CURL=1 make
46
 
47
  Step 3: Run inference through the main binary.
48
  ```
49
- ./llama-cli --hf-repo AdvRahul/Qwen3-4B-Instruct-2507-Q4_K_M-GGUF --hf-file qwen3-4b-instruct-2507-q4_k_m.gguf -p "The meaning to life and the universe is"
50
  ```
51
  or
52
  ```
53
- ./llama-server --hf-repo AdvRahul/Qwen3-4B-Instruct-2507-Q4_K_M-GGUF --hf-file qwen3-4b-instruct-2507-q4_k_m.gguf -c 2048
54
  ```
 
6
  base_model: Qwen/Qwen3-4B-Instruct-2507
7
  tags:
8
  - llama-cpp
 
9
  ---
10
 
11
+ # AdvRahul/Axion-4B
12
  This model was converted to GGUF format from [`Qwen/Qwen3-4B-Instruct-2507`](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
13
  Refer to the [original model card](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) for more details on the model.
14
 
 
23
 
24
  ### CLI:
25
  ```bash
26
+ llama-cli --hf-repo AdvRahul/Axion-4B-Q4_K_M-GGUF --hf-file Axion-4B-Q4_K_M.gguf -p "The meaning to life and the universe is"
27
  ```
28
 
29
  ### Server:
30
  ```bash
31
+ llama-server --hf-repo AdvRahul/Axion-4B-Q4_K_M-GGUF --hf-file Axion-4B-Q4_K_M.gguf -c 2048
32
  ```
33
 
34
  Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
 
45
 
46
  Step 3: Run inference through the main binary.
47
  ```
48
+ ./llama-cli --hf-repo AdvRahul/Axion-4B-Q4_K_M-GGUF --hf-file Axion-4B-Q4_K_M.gguf -p "The meaning to life and the universe is"
49
  ```
50
  or
51
  ```
52
+ ./llama-server --hf-repo AdvRahul/Axion-4B-Q4_K_M-GGUF --hf-file Axion-4B-Q4_K_M.gguf -c 2048
53
  ```