ariel-pillar
/

phi-4_function_calling

GGUF

conversational

Model card Files Files and versions

xet

Community

ariel-pillar commited on May 31, 2025

Commit

e29b963

verified ·

1 Parent(s): 8ae50d9

make clear that readme is about tool use

Browse files

Files changed (1) hide show

README.md +39 -9

README.md CHANGED Viewed

@@ -1,10 +1,18 @@
----
-base_model:
-- microsoft/Phi-4-mini-instruct
----
-# Phi-4-mini-instruct with llama-server
-This repository contains instructions for running the Phi-4-mini-instruct model using llama-server, which provides a ChatGPT-compatible API interface.
 ## Prerequisites
@@ -31,7 +39,6 @@ Start the llama-server with the following command:
 llama-server \
     --model models/Phi-4-mini-instruct-Q4_K_M-modified.gguf \
     --port 8082 \
-    --verbose \
     --jinja
 ```
@@ -71,6 +78,27 @@ curl http://localhost:8082/v1/chat/completions \
   }'
 ```
 ## API Endpoints
 The server provides a ChatGPT-compatible API with the following main endpoints:
@@ -82,8 +110,10 @@ The server provides a ChatGPT-compatible API with the following main endpoints:
 ## Notes
 - The server uses the same API format as OpenAI's ChatGPT API, making it compatible with many existing tools and libraries
-- The `--jinja` flag enables proper chat template formatting for the model
 - The model name in the requests can be set to "any-model" as shown in the examples
 ## Troubleshooting
@@ -96,4 +126,4 @@ If you encounter issues:
 ## License
-Please ensure you comply with the model's license terms when using it.

+---
+base_model:
+- microsoft/Phi-4-mini-instruct
+---
+# Phi-4-mini-instruct with llama-server (Tool-Enhanced Version)
+This repository contains instructions for running a modified version of the Phi-4-mini-instruct model using llama-server. This version has been enhanced to support tool usage, allowing the model to interact with external tools and APIs through a ChatGPT-compatible interface.
+## Model Capabilities
+This modified version of Phi-4-mini-instruct includes:
+- Full support for tool usage and function calling
+- Custom chat template optimized for tool interactions
+- Ability to process and respond to tool outputs
+- ChatGPT-compatible API interface
 ## Prerequisites
 llama-server \
     --model models/Phi-4-mini-instruct-Q4_K_M-modified.gguf \
     --port 8082 \
     --jinja
 ```
   }'
 ```
+### Example 3: Using Tools
+```bash
+curl http://localhost:8082/v1/chat/completions \
+  -H "Content-Type: application/json" \
+  -d '{
+    "model": "any-model",
+    "messages": [
+      {
+        "role": "system",
+        "content": "You are a helpful AI assistant that can use tools.",
+        "tools": "[{\"name\": \"calculator\", \"description\": \"Useful for performing mathematical calculations\", \"parameters\": {\"type\": \"object\", \"properties\": {\"expression\": {\"type\": \"string\", \"description\": \"The mathematical expression to evaluate\"}}}}]"
+      },
+      {
+        "role": "user",
+        "content": "What is 235 * 89?"
+      }
+    ]
+  }'
+```
 ## API Endpoints
 The server provides a ChatGPT-compatible API with the following main endpoints:
 ## Notes
 - The server uses the same API format as OpenAI's ChatGPT API, making it compatible with many existing tools and libraries
+- The `--jinja` flag enables proper chat template formatting for the model, which is essential for tool usage
 - The model name in the requests can be set to "any-model" as shown in the examples
+- This version supports system messages with tool definitions
+- Tool responses are properly handled through the chat template
 ## Troubleshooting
 ## License
+Please ensure you comply with the model's license terms when using it.