Instructions to use madhuHuggingface/functiongemma-ec2-finetuned with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use madhuHuggingface/functiongemma-ec2-finetuned with Transformers:

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("madhuHuggingface/functiongemma-ec2-finetuned", dtype="auto")

llama-cpp-python

How to use madhuHuggingface/functiongemma-ec2-finetuned with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="madhuHuggingface/functiongemma-ec2-finetuned",
	filename="gguf/functiongemma-270m-it.Q8_0.gguf",
)

llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Notebooks
Google Colab
Kaggle
Local Apps Settings

llama.cpp

How to use madhuHuggingface/functiongemma-ec2-finetuned with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0
# Run inference directly in the terminal:
llama-cli -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0
# Run inference directly in the terminal:
llama-cli -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0
# Run inference directly in the terminal:
./llama-cli -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0
# Run inference directly in the terminal:
./build/bin/llama-cli -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Use Docker

docker model run hf.co/madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

LM Studio
Jan
Ollama
How to use madhuHuggingface/functiongemma-ec2-finetuned with Ollama:
```
ollama run hf.co/madhuHuggingface/functiongemma-ec2-finetuned:Q8_0
```

Unsloth Studio

How to use madhuHuggingface/functiongemma-ec2-finetuned with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for madhuHuggingface/functiongemma-ec2-finetuned to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for madhuHuggingface/functiongemma-ec2-finetuned to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for madhuHuggingface/functiongemma-ec2-finetuned to start chatting

How to use madhuHuggingface/functiongemma-ec2-finetuned with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "madhuHuggingface/functiongemma-ec2-finetuned:Q8_0"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use madhuHuggingface/functiongemma-ec2-finetuned with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Run Hermes

hermes

Docker Model Runner
How to use madhuHuggingface/functiongemma-ec2-finetuned with Docker Model Runner:
```
docker model run hf.co/madhuHuggingface/functiongemma-ec2-finetuned:Q8_0
```

Lemonade

How to use madhuHuggingface/functiongemma-ec2-finetuned with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull madhuHuggingface/functiongemma-ec2-finetuned:Q8_0

Run and chat with the model

lemonade run user.functiongemma-ec2-finetuned-Q8_0

List all available models

lemonade list

madhuHuggingface commited on Apr 24

Commit

fbca40b

verified ·

1 Parent(s): c06f408

Training in progress, step 500, checkpoint

Browse files

Files changed (4) hide show

last-checkpoint/adapter_model.safetensors +1 -1
last-checkpoint/optimizer.pt +1 -1
last-checkpoint/scheduler.pt +1 -1
last-checkpoint/trainer_state.json +73 -3

last-checkpoint/adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:342b5b54eeabad3537367e1e8ec3c7d8f5384023782c14c1a4aef737f23bde89
 size 60785144

 version https://git-lfs.github.com/spec/v1
+oid sha256:d27e9476dc918efebb61651f7e9f759934ffdffe549c6fc2165cc9ced32d93a6
 size 60785144

last-checkpoint/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7042815b336b5beda88f3e7c0fa8756367e0e644ed3b573dc2a2619ccc065410
 size 31149205

 version https://git-lfs.github.com/spec/v1
+oid sha256:c6ce41753fb994b8f796034d662e460bea6de0413caeaa2d0ec5be27d28aba58
 size 31149205

last-checkpoint/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1fd1ec6747143d481855590b0ce95939b6941b3b5048b94e65615074cacfdb81
 size 1465

 version https://git-lfs.github.com/spec/v1
+oid sha256:d35a14be938754e2a3aa3ebe18bdde4e86b890e4e0ff3c1f2a56a75942036606
 size 1465

last-checkpoint/trainer_state.json CHANGED Viewed

@@ -2,9 +2,9 @@
   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
-  "epoch": 1.6,
   "eval_steps": 500,
-  "global_step": 400,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
@@ -288,6 +288,76 @@
       "learning_rate": 9.39786722634207e-05,
       "loss": 0.0157,
       "step": 400
     }
   ],
   "logging_steps": 10,
@@ -307,7 +377,7 @@
       "attributes": {}
     }
   },
-  "total_flos": 1277212245955584.0,
   "train_batch_size": 2,
   "trial_name": null,
   "trial_params": null

   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
+  "epoch": 2.0,
   "eval_steps": 500,
+  "global_step": 500,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
       "learning_rate": 9.39786722634207e-05,
       "loss": 0.0157,
       "step": 400
+    },
+    {
+      "epoch": 1.6400000000000001,
+      "grad_norm": 0.3347119688987732,
+      "learning_rate": 8.968983025525654e-05,
+      "loss": 0.019,
+      "step": 410
+    },
+    {
+      "epoch": 1.6800000000000002,
+      "grad_norm": 0.4677681028842926,
+      "learning_rate": 8.542008030801254e-05,
+      "loss": 0.0119,
+      "step": 420
+    },
+    {
+      "epoch": 1.72,
+      "grad_norm": 0.2853691577911377,
+      "learning_rate": 8.11773290156756e-05,
+      "loss": 0.0327,
+      "step": 430
+    },
+    {
+      "epoch": 1.76,
+      "grad_norm": 0.13519690930843353,
+      "learning_rate": 7.696943297693878e-05,
+      "loss": 0.0133,
+      "step": 440
+    },
+    {
+      "epoch": 1.8,
+      "grad_norm": 0.15065522491931915,
+      "learning_rate": 7.280418424658946e-05,
+      "loss": 0.0193,
+      "step": 450
+    },
+    {
+      "epoch": 1.8399999999999999,
+      "grad_norm": 0.2511173188686371,
+      "learning_rate": 6.868929590641735e-05,
+      "loss": 0.013,
+      "step": 460
+    },
+    {
+      "epoch": 1.88,
+      "grad_norm": 0.2314622700214386,
+      "learning_rate": 6.463238778236288e-05,
+      "loss": 0.0137,
+      "step": 470
+    },
+    {
+      "epoch": 1.92,
+      "grad_norm": 0.002136136870831251,
+      "learning_rate": 6.064097233435333e-05,
+      "loss": 0.0203,
+      "step": 480
+    },
+    {
+      "epoch": 1.96,
+      "grad_norm": 0.5252191424369812,
+      "learning_rate": 5.672244074495689e-05,
+      "loss": 0.0097,
+      "step": 490
+    },
+    {
+      "epoch": 2.0,
+      "grad_norm": 0.46642419695854187,
+      "learning_rate": 5.288404923261361e-05,
+      "loss": 0.0199,
+      "step": 500
     }
   ],
   "logging_steps": 10,
       "attributes": {}
     }
   },
+  "total_flos": 1597906326839808.0,
   "train_batch_size": 2,
   "trial_name": null,
   "trial_params": null