Spaces:

plarnholt
/

excom-ai-demo

Paused

App Files Files Community

excom-ai-demo / TOOL_CALLING_GUIDE.md

Peter Larnholt

Add simple tool calling script that works around vLLM tool_choice limitation

1f77df0 2 months ago

preview code

raw

history blame contribute delete

3.62 kB

	# Tool Calling Guide

	Your ExCom AI deployment supports tool calling! However, there's a quirk with vLLM that requires a workaround.

	## The Issue

	vLLM requires `--enable-auto-tool-choice` and `--tool-call-parser` flags to accept the `tool_choice: "auto"` parameter. Since Qwen 2.5 has native tool calling built into the model, we don't use these flags.

	Result: LangChain's default agent framework sends `tool_choice: "auto"` which vLLM rejects with a 400 error.

	## Solution: Use OpenAI SDK Directly

	I've created `simple_tool_chat.py` which uses the OpenAI SDK directly and doesn't send `tool_choice`.

	### Installation

	```bash
	pip install openai
	```

	### Usage

	```bash
	python simple_tool_chat.py
	```

	### Example Session

	```
	You: What is 15 * 23 + 100?
	🔧 Calling tool: calculator({'expression': '15 * 23 + 100'})
	Assistant: The result is 445.

	You: What's the weather in Paris and what time is it?
	🔧 Calling tool: get_weather({'city': 'Paris'})
	🔧 Calling tool: get_current_time({})
	Assistant: The weather in Paris is 18°C and sunny. The current time is 2025-10-09 18:30:45.
	```

	## How It Works

	1. No tool_choice parameter - We don't send `tool_choice` at all
	2. Qwen decides naturally - The model's training handles when to use tools
	3. OpenAI SDK - Direct HTTP calls to your vLLM endpoint
	4. Multi-turn - Maintains conversation history for context

	## Using with Your Own Code

	```python
	from openai import OpenAI

	client = OpenAI(
	base_url="https://plarnholt-excom-ai-demo.hf.space/v1",
	api_key="not-needed"
	)

	# Define your tools
	tools = [{
	"type": "function",
	"function": {
	"name": "my_tool",
	"description": "What it does",
	"parameters": {
	"type": "object",
	"properties": {
	"param": {"type": "string"}
	}
	}
	}
	}]

	# Call without tool_choice parameter
	response = client.chat.completions.create(
	model="excom-ai",
	messages=[{"role": "user", "content": "Use my tool"}],
	tools=tools,
	temperature=0.4
	# NOTE: No tool_choice parameter!
	)

	# Check for tool calls
	if response.choices[0].message.tool_calls:
	for tool_call in response.choices[0].message.tool_calls:
	print(f"Tool: {tool_call.function.name}")
	print(f"Args: {tool_call.function.arguments}")
	```

	## Adding Custom Tools

	Edit `simple_tool_chat.py`:

	```python
	# 1. Add tool definition to 'tools' list
	{
	"type": "function",
	"function": {
	"name": "my_custom_tool",
	"description": "What it does",
	"parameters": {
	"type": "object",
	"properties": {
	"param": {"type": "string", "description": "Param description"}
	},
	"required": ["param"]
	}
	}
	}

	# 2. Add implementation
	def my_custom_tool(param: str) -> str:
	# Your logic here
	return "result"

	# 3. Add to dispatcher
	def execute_tool(tool_name: str, arguments: dict) -> str:
	# ... existing tools ...
	elif tool_name == "my_custom_tool":
	return my_custom_tool(arguments["param"])
	```

	## Troubleshooting

	Error: "auto" tool choice requires --enable-auto-tool-choice
	- You're using LangChain's agent framework
	- Solution: Use `simple_tool_chat.py` instead

	Tool calls not working
	- Make sure your Space is running: https://huggingface.co/spaces/plarnholt/excom-ai-demo
	- Check that you're not sending `tool_choice` parameter
	- Verify tools are properly formatted (see OpenAI docs)

	500 Internal Server Error
	- Space might be sleeping - make a request to wake it up
	- Check Space logs for errors