Instructions to use GTKING/ZFusionAI_Hacker with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use GTKING/ZFusionAI_Hacker with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="GTKING/ZFusionAI_Hacker",
	filename="gguf/ZFusionAI-f16.gguf",
)

llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use GTKING/ZFusionAI_Hacker with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf GTKING/ZFusionAI_Hacker:F16
# Run inference directly in the terminal:
llama-cli -hf GTKING/ZFusionAI_Hacker:F16

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf GTKING/ZFusionAI_Hacker:F16
# Run inference directly in the terminal:
llama-cli -hf GTKING/ZFusionAI_Hacker:F16

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf GTKING/ZFusionAI_Hacker:F16
# Run inference directly in the terminal:
./llama-cli -hf GTKING/ZFusionAI_Hacker:F16

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf GTKING/ZFusionAI_Hacker:F16
# Run inference directly in the terminal:
./build/bin/llama-cli -hf GTKING/ZFusionAI_Hacker:F16

Use Docker

docker model run hf.co/GTKING/ZFusionAI_Hacker:F16

LM Studio
Jan
Ollama
How to use GTKING/ZFusionAI_Hacker with Ollama:
```
ollama run hf.co/GTKING/ZFusionAI_Hacker:F16
```

Unsloth Studio new

How to use GTKING/ZFusionAI_Hacker with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for GTKING/ZFusionAI_Hacker to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for GTKING/ZFusionAI_Hacker to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for GTKING/ZFusionAI_Hacker to start chatting

Pi new

How to use GTKING/ZFusionAI_Hacker with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf GTKING/ZFusionAI_Hacker:F16

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "GTKING/ZFusionAI_Hacker:F16"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use GTKING/ZFusionAI_Hacker with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf GTKING/ZFusionAI_Hacker:F16

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default GTKING/ZFusionAI_Hacker:F16

Run Hermes

hermes

Docker Model Runner
How to use GTKING/ZFusionAI_Hacker with Docker Model Runner:
```
docker model run hf.co/GTKING/ZFusionAI_Hacker:F16
```

Lemonade

How to use GTKING/ZFusionAI_Hacker with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull GTKING/ZFusionAI_Hacker:F16

Run and chat with the model

lemonade run user.ZFusionAI_Hacker-F16

List all available models

lemonade list

GTKING commited on Jan 14

Commit

2d850e9

1 Parent(s): dd9a47f

Training in progress, step 48

Browse files

Files changed (5) hide show

README.md +58 -3
adapter_config.json +39 -0
chat_template.jinja +72 -52
special_tokens_map.json +1 -2
tokenizer_config.json +0 -1

README.md CHANGED Viewed

@@ -1,3 +1,58 @@
----
-license: apache-2.0
----

+---
+base_model: meta-llama/Llama-3.2-1B-Instruct
+library_name: transformers
+model_name: ZFusionAI_Hacker
+tags:
+- generated_from_trainer
+- trl
+- sft
+licence: license
+---
+# Model Card for ZFusionAI_Hacker
+This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct).
+It has been trained using [TRL](https://github.com/huggingface/trl).
+## Quick start
+```python
+from transformers import pipeline
+question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
+generator = pipeline("text-generation", model="GTKING/ZFusionAI_Hacker", device="cuda")
+output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
+print(output["generated_text"])
+```
+## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/gamingking9025-muthayammal-college-of-engineering/huggingface/runs/ajpft44z)
+This model was trained with SFT.
+### Framework versions
+- TRL: 0.26.2
+- Transformers: 4.57.1
+- Pytorch: 2.8.0+cu126
+- Datasets: 4.4.1
+- Tokenizers: 0.22.1
+## Citations
+Cite TRL as:
+```bibtex
+@misc{vonwerra2022trl,
+	title        = {{TRL: Transformer Reinforcement Learning}},
+	author       = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
+	year         = 2020,
+	journal      = {GitHub repository},
+	publisher    = {GitHub},
+	howpublished = {\url{https://github.com/huggingface/trl}}
+}
+```

adapter_config.json ADDED Viewed

	@@ -0,0 +1,39 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "meta-llama/Llama-3.2-1B-Instruct",
+  "bias": "none",
+  "corda_config": null,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.0,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "k_proj",
+    "v_proj",
+    "o_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

chat_template.jinja CHANGED Viewed

@@ -1,73 +1,93 @@
 {{- bos_token }}
-{# --- Extract system message if present --- #}
-{%- if messages and messages[0]['role'] == 'system' %}
-    {%- set system_message = messages[0]['content'] | trim %}
     {%- set messages = messages[1:] %}
 {%- else %}
     {%- set system_message = "" %}
 {%- endif %}
-{# --- System block (minimal) --- #}
-{{- "<|start_header_id|>system<|end_header_id|>
-" }}
 {{- system_message }}
 {{- "<|eot_id|>" }}
-{# --- Optional tool definitions (in user message) --- #}
-{%- if tools is defined and tools is not none %}
-    {%- set first_user = messages[0]['content'] | trim %}
-    {%- set messages = messages[1:] %}
-    {{- "<|start_header_id|>user<|end_header_id|>
-" }}
-    {{- "You have access to the following functions.
-" }}
-    {{- "Respond ONLY with a JSON function call.
-" }}
     {%- for t in tools %}
-        {{- t | tojson(indent=2) }}
-        {{- "
-" }}
     {%- endfor %}
-    {{- first_user }}
-    {{- "<|eot_id|>" }}
 {%- endif %}
-{# --- Remaining messages --- #}
 {%- for message in messages %}
-    {%- if message.role in ["user", "assistant"] %}
-        {{- "<|start_header_id|>" + message.role + "<|end_header_id|>
-" }}
-        {{- message.content | trim }}
-        {{- "<|eot_id|>" }}
-    {%- elif "tool_calls" in message %}
-        {%- set call = message.tool_calls[0].function %}
-        {{- "<|start_header_id|>assistant<|end_header_id|>
-" }}
-        {{- '{"name": "' + call.name + '", "parameters": ' + (call.arguments | tojson) + '}' }}
         {{- "<|eot_id|>" }}
-    {%- elif message.role == "tool" %}
-        {{- "<|start_header_id|>tool<|end_header_id|>
-" }}
-        {{- message.content }}
         {{- "<|eot_id|>" }}
     {%- endif %}
 {%- endfor %}
-{# --- Generation prompt --- #}
 {%- if add_generation_prompt %}
-    {{- "<|start_header_id|>assistant<|end_header_id|>
-" }}
 {%- endif %}

 {{- bos_token }}
+{%- if custom_tools is defined %}
+    {%- set tools = custom_tools %}
+{%- endif %}
+{%- if not tools_in_user_message is defined %}
+    {%- set tools_in_user_message = true %}
+{%- endif %}
+{%- if not date_string is defined %}
+    {%- if strftime_now is defined %}
+        {%- set date_string = strftime_now("%d %b %Y") %}
+    {%- else %}
+        {%- set date_string = "26 Jul 2024" %}
+    {%- endif %}
+{%- endif %}
+{%- if not tools is defined %}
+    {%- set tools = none %}
+{%- endif %}
+{#- This block extracts the system message, so we can slot it into the right place. #}
+{%- if messages[0]['role'] == 'system' %}
+    {%- set system_message = messages[0]['content']|trim %}
     {%- set messages = messages[1:] %}
 {%- else %}
     {%- set system_message = "" %}
 {%- endif %}
+{#- System message #}
+{{- "<|start_header_id|>system<|end_header_id|>\n\n" }}
+{%- if tools is not none %}
+    {{- "Environment: ipython\n" }}
+{%- endif %}
+{{- "Cutting Knowledge Date: December 2023\n" }}
+{{- "Today Date: " + date_string + "\n\n" }}
+{%- if tools is not none and not tools_in_user_message %}
+    {{- "You have access to the following functions. To call a function, please respond with JSON for a function call." }}
+    {{- 'Respond in the format {"name": function name, "parameters": dictionary of argument name and its value}.' }}
+    {{- "Do not use variables.\n\n" }}
+    {%- for t in tools %}
+        {{- t | tojson(indent=4) }}
+        {{- "\n\n" }}
+    {%- endfor %}
+{%- endif %}
 {{- system_message }}
 {{- "<|eot_id|>" }}
+{#- Custom tools are passed in a user message with some extra guidance #}
+{%- if tools_in_user_message and not tools is none %}
+    {#- Extract the first user message so we can plug it in here #}
+    {%- if messages | length != 0 %}
+        {%- set first_user_message = messages[0]['content']|trim %}
+        {%- set messages = messages[1:] %}
+    {%- else %}
+        {{- raise_exception("Cannot put tools in the first user message when there's no first user message!") }}
+{%- endif %}
+    {{- '<|start_header_id|>user<|end_header_id|>\n\n' -}}
+    {{- "Given the following functions, please respond with a JSON for a function call " }}
+    {{- "with its proper arguments that best answers the given prompt.\n\n" }}
+    {{- 'Respond in the format {"name": function name, "parameters": dictionary of argument name and its value}.' }}
+    {{- "Do not use variables.\n\n" }}
     {%- for t in tools %}
+        {{- t | tojson(indent=4) }}
+        {{- "\n\n" }}
     {%- endfor %}
+    {{- first_user_message + "<|eot_id|>"}}
 {%- endif %}
 {%- for message in messages %}
+    {%- if not (message.role == 'ipython' or message.role == 'tool' or 'tool_calls' in message) %}
+        {{- '<|start_header_id|>' + message['role'] + '<|end_header_id|>\n\n'+ message['content'] | trim + '<|eot_id|>' }}
+    {%- elif 'tool_calls' in message %}
+        {%- if not message.tool_calls|length == 1 %}
+            {{- raise_exception("This model only supports single tool-calls at once!") }}
+        {%- endif %}
+        {%- set tool_call = message.tool_calls[0].function %}
+        {{- '<|start_header_id|>assistant<|end_header_id|>\n\n' -}}
+        {{- '{"name": "' + tool_call.name + '", ' }}
+        {{- '"parameters": ' }}
+        {{- tool_call.arguments | tojson }}
+        {{- "}" }}
         {{- "<|eot_id|>" }}
+    {%- elif message.role == "tool" or message.role == "ipython" %}
+        {{- "<|start_header_id|>ipython<|end_header_id|>\n\n" }}
+        {%- if message.content is mapping or message.content is iterable %}
+            {{- message.content | tojson }}
+        {%- else %}
+            {{- message.content }}
+        {%- endif %}
         {{- "<|eot_id|>" }}
     {%- endif %}
 {%- endfor %}
 {%- if add_generation_prompt %}
+    {{- '<|start_header_id|>assistant<|end_header_id|>\n\n' }}
 {%- endif %}

special_tokens_map.json CHANGED Viewed

@@ -12,6 +12,5 @@
     "normalized": false,
     "rstrip": false,
     "single_word": false
-  },
-  "pad_token": "<|eot_id|>"
 }

     "normalized": false,
     "rstrip": false,
     "single_word": false
+  }
 }

tokenizer_config.json CHANGED Viewed

@@ -2058,6 +2058,5 @@
     "attention_mask"
   ],
   "model_max_length": 131072,
-  "pad_token": "<|eot_id|>",
   "tokenizer_class": "PreTrainedTokenizerFast"
 }

     "attention_mask"
   ],
   "model_max_length": 131072,
   "tokenizer_class": "PreTrainedTokenizerFast"
 }