Spaces:

VibecoderMcSwaggins
/

DeepBoner

Paused

VibecoderMcSwaggins commited on 23 days ago

Commit

8c0ec2b

1 Parent(s): 534dece

fix(P0): Implement proper AIFunction serialization for HuggingFace

Root cause: AIFunction objects from Microsoft agent-framework were
passed directly to HuggingFace InferenceClient, causing JSON
serialization errors.

Fix:
- Add _convert_tools() to convert AIFunction → OpenAI-compatible JSON
- Add _parse_tool_calls() to convert HF response → FunctionCallContent
- Update both sync and streaming response methods

Verified:
- 307 tests pass (make check)
- Tool serialization: 3 AIFunction → 3 JSON dicts (2128 bytes)
- End-to-end research completes successfully

Closes P0 AIFunction serialization bug for Free Tier.

Files changed (2) hide show

docs/bugs/P0_AIFUNCTION_NOT_JSON_SERIALIZABLE.md +78 -84
src/clients/huggingface.py +81 -9

docs/bugs/P0_AIFUNCTION_NOT_JSON_SERIALIZABLE.md CHANGED Viewed

@@ -1,7 +1,7 @@
 # P0 Bug: AIFunction Not JSON Serializable (Free Tier Broken)
 **Severity**: P0 (Critical) - Free Tier cannot perform research
-**Status**: Open
 **Discovered**: 2025-12-01
 **Reporter**: Production user via HuggingFace Spaces
@@ -47,14 +47,6 @@ TypeError: Object of type AIFunction is not JSON serializable
 4. `requests.post()` internally calls `json.dumps()` on the request body
 5. `AIFunction` has no `__json__()` method or isn't a dict → TypeError
-### The Warning We Ignored
-The agent framework already warned us:
-```
-[WARNING] The provided chat client does not support function invoking,
-this might limit agent capabilities.
-```
 ## Impact
 | Component | Impact |
@@ -63,104 +55,107 @@ this might limit agent capabilities.
 | Advanced Mode without API key | **Cannot do research** |
 | Paid Tier (OpenAI) | Unaffected (OpenAI handles AIFunction) |
-## Proposed Solutions
-### Option 1: Disable Tools for HuggingFace (QUICK FIX)
-Pass `tools=None` to disable function calling entirely:
-```python
-# src/clients/huggingface.py
-async def _inner_get_response(self, ...):
-    hf_messages = self._convert_messages(messages)
-    # QUICK FIX: Disable tools - HuggingFace free tier doesn't reliably support them
-    # The agents will use natural language instructions instead
-    tools = None  # Was: chat_options.tools if chat_options.tools else None
-    hf_tool_choice = None
-    ...
-```
-**Pros**:
-- 5-minute fix
-- No serialization errors
-- Agents still work via natural language instructions
-**Cons**:
-- Agents can't use structured tool calls
-- Less precise than function calling
-- Qwen2.5-72B DOES support function calling (we're not using it)
-### Option 2: Convert AIFunction to JSON Schema (PROPER FIX)
-Serialize `AIFunction` objects to OpenAI-compatible tool format:
 ```python
 def _convert_tools(self, tools: list[Any] | None) -> list[dict[str, Any]] | None:
-    """Convert AIFunction objects to JSON-serializable tool definitions."""
     if not tools:
         return None
     json_tools = []
     for tool in tools:
         if hasattr(tool, 'to_dict'):
-            # AIFunction.to_dict() returns JSON-serializable dict
-            json_tools.append(tool.to_dict())
-        elif hasattr(tool, 'schema'):
-            # Alternative: use schema property
             json_tools.append({
                 "type": "function",
                 "function": {
-                    "name": tool.name,
-                    "description": tool.description,
-                    "parameters": tool.schema,
                 }
             })
         else:
-            # Fallback: skip unknown tool types
             logger.warning(f"Skipping non-serializable tool: {type(tool)}")
     return json_tools if json_tools else None
 ```
-**Pros**:
-- Proper function calling with Qwen2.5
-- Structured tool invocation
-- Better agent capabilities
-**Cons**:
-- More complex
-- Need to handle tool call responses
-- May require testing with different HF models
-### Option 3: Hybrid Approach (RECOMMENDED)
-Try to convert tools, fall back to None if it fails:
 ```python
-def _convert_tools(self, tools: list[Any] | None) -> list[dict[str, Any]] | None:
-    """Attempt to convert tools to JSON, disable if conversion fails."""
-    if not tools:
-        return None
-    try:
-        json_tools = []
-        for tool in tools:
-            if hasattr(tool, 'to_dict'):
-                json_tools.append(tool.to_dict())
-            elif isinstance(tool, dict):
-                json_tools.append(tool)
-        return json_tools if json_tools else None
-    except Exception as e:
-        logger.warning(f"Tool conversion failed, disabling function calling: {e}")
-        return None
 ```
-## Recommended Fix
-**Immediate (P0)**: Option 1 - Disable tools with `tools=None`
-**Follow-up**: Option 3 - Implement proper conversion with fallback
 ## Call Stack Trace
@@ -197,18 +192,17 @@ from src.orchestrators.advanced import AdvancedOrchestrator
 async def test():
     orch = AdvancedOrchestrator(max_rounds=2)
     async for event in orch.run('testosterone benefits'):
-        print(f'[{event.type}] {event.message[:50]}...')
 asyncio.run(test())
 "
-# Expected: TypeError: Object of type AIFunction is not JSON serializable
-# After fix: Should complete without serialization errors
 ```
 ## References
-- [Microsoft Agent Framework - AIFunction](https://learn.microsoft.com/en-us/python/api/agent-framework-core/agent_framework.aifunction)
-- [HuggingFace Chat Completion API](https://huggingface.co/docs/api-inference/en/tasks/chat-completion)
 - [Qwen Function Calling](https://qwen.readthedocs.io/en/latest/framework/function_call.html)
-- [huggingface_hub chat_completion](https://github.com/huggingface/huggingface_hub/releases/tag/v0.22.0)

 # P0 Bug: AIFunction Not JSON Serializable (Free Tier Broken)
 **Severity**: P0 (Critical) - Free Tier cannot perform research
+**Status**: In Progress
 **Discovered**: 2025-12-01
 **Reporter**: Production user via HuggingFace Spaces
 4. `requests.post()` internally calls `json.dumps()` on the request body
 5. `AIFunction` has no `__json__()` method or isn't a dict → TypeError
 ## Impact
 | Component | Impact |
 | Advanced Mode without API key | **Cannot do research** |
 | Paid Tier (OpenAI) | Unaffected (OpenAI handles AIFunction) |
+## Professional Fix (Full Implementation)
+Qwen2.5-72B-Instruct **SUPPORTS** function calling via HuggingFace. The fix requires:
+1. **Request Serialization**: Convert `AIFunction` → OpenAI-compatible JSON
+2. **Response Parsing**: Convert HuggingFace `tool_calls` → Framework `FunctionCallContent`
+### Part 1: Tool Serialization (`_convert_tools`)
 ```python
 def _convert_tools(self, tools: list[Any] | None) -> list[dict[str, Any]] | None:
+    """Convert AIFunction objects to OpenAI-compatible tool definitions.
+    AIFunction.to_dict() returns:
+        {'type': 'ai_function', 'name': '...', 'description': '...', 'input_model': {...}}
+    OpenAI/HuggingFace expects:
+        {'type': 'function', 'function': {'name': '...', 'description': '...', 'parameters': {...}}}
+    """
     if not tools:
         return None
     json_tools = []
     for tool in tools:
         if hasattr(tool, 'to_dict'):
+            t_dict = tool.to_dict()
             json_tools.append({
                 "type": "function",
                 "function": {
+                    "name": t_dict["name"],
+                    "description": t_dict.get("description", ""),
+                    "parameters": t_dict["input_model"]
                 }
             })
+        elif isinstance(tool, dict):
+            json_tools.append(tool)
         else:
             logger.warning(f"Skipping non-serializable tool: {type(tool)}")
     return json_tools if json_tools else None
 ```
+### Part 2: Response Parsing (Tool Calls → FunctionCallContent)
+When HuggingFace returns tool calls, we must convert them to the framework's format:
 ```python
+from agent_framework._types import FunctionCallContent
+# In _inner_get_response, after getting the response:
+choice = choices[0]
+message = choice.message
+message_content = message.content or ""
+# Parse tool calls if present
+contents: list[Any] = []
+if hasattr(message, 'tool_calls') and message.tool_calls:
+    for tc in message.tool_calls:
+        # HF returns: tc.id, tc.function.name, tc.function.arguments
+        contents.append(FunctionCallContent(
+            call_id=tc.id,
+            name=tc.function.name,
+            arguments=tc.function.arguments  # JSON string or dict
+        ))
+response_msg = ChatMessage(
+    role=cast(Any, message.role),
+    text=message_content,
+    contents=contents if contents else None
+)
 ```
+### Verified Schema Mapping
+```python
+# AIFunction.to_dict() output (verified 2025-12-01):
+{
+  "type": "ai_function",
+  "name": "search_pubmed",
+  "description": "Search PubMed for biomedical research papers...",
+  "input_model": {
+    "properties": {"query": {"title": "Query", "type": "string"}, ...},
+    "required": ["query"],
+    "type": "object"
+  }
+}
+# Mapped to OpenAI format:
+{
+  "type": "function",
+  "function": {
+    "name": "search_pubmed",
+    "description": "Search PubMed for biomedical research papers...",
+    "parameters": {
+      "properties": {"query": {"title": "Query", "type": "string"}, ...},
+      "required": ["query"],
+      "type": "object"
+    }
+  }
+}
+```
 ## Call Stack Trace
 async def test():
     orch = AdvancedOrchestrator(max_rounds=2)
     async for event in orch.run('testosterone benefits'):
+        print(f'[{event.type}] {str(event.message)[:50]}...')
 asyncio.run(test())
 "
+# Expected BEFORE fix: TypeError: Object of type AIFunction is not JSON serializable
+# Expected AFTER fix: Research completes with tool calls working
 ```
 ## References
+- [HuggingFace Chat Completion - Function Calling](https://huggingface.co/docs/inference-providers/tasks/chat-completion)
 - [Qwen Function Calling](https://qwen.readthedocs.io/en/latest/framework/function_call.html)
+- [Microsoft Agent Framework - AIFunction](https://learn.microsoft.com/en-us/python/api/agent-framework-core/agent_framework.aifunction)

src/clients/huggingface.py CHANGED Viewed

@@ -18,6 +18,7 @@ from agent_framework import (
     ChatResponse,
     ChatResponseUpdate,
 )
 from huggingface_hub import InferenceClient
 from src.utils.config import settings
@@ -26,7 +27,7 @@ logger = structlog.get_logger()
 class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
-    """Adapter for HuggingFace Inference API."""
     def __init__(
         self,
@@ -69,6 +70,69 @@ class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
             hf_messages.append({"role": role_str, "content": content})
         return hf_messages
     async def _inner_get_response(
         self,
         *,
@@ -79,12 +143,13 @@ class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
         """Synchronous response generation using chat_completion."""
         hf_messages = self._convert_messages(messages)
-        # Extract tool configuration
-        tools = chat_options.tools if chat_options.tools else None
         # HF expects 'tool_choice' to be 'auto', 'none', or specific tool
         # Framework uses ToolMode enum or dict
         hf_tool_choice: str | None = None
-        if chat_options.tool_choice is not None:
             tool_choice_str = str(chat_options.tool_choice)
             if "AUTO" in tool_choice_str:
                 hf_tool_choice = "auto"
@@ -116,12 +181,17 @@ class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
                 return ChatResponse(messages=[], response_id="error-no-choices")
             choice = choices[0]
-            message_content = choice.message.content or ""
-            # Construct response message with proper kwargs
             response_msg = ChatMessage(
-                role=cast(Any, choice.message.role),
                 text=message_content,
             )
             return ChatResponse(
@@ -143,9 +213,11 @@ class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
         """Streaming response generation."""
         hf_messages = self._convert_messages(messages)
-        tools = chat_options.tools if chat_options.tools else None
         hf_tool_choice: str | None = None
-        if chat_options.tool_choice is not None:
             if "AUTO" in str(chat_options.tool_choice):
                 hf_tool_choice = "auto"

     ChatResponse,
     ChatResponseUpdate,
 )
+from agent_framework._types import FunctionCallContent
 from huggingface_hub import InferenceClient
 from src.utils.config import settings
 class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
+    """Adapter for HuggingFace Inference API with full function calling support."""
     def __init__(
         self,
             hf_messages.append({"role": role_str, "content": content})
         return hf_messages
+    def _convert_tools(self, tools: list[Any] | None) -> list[dict[str, Any]] | None:
+        """Convert AIFunction objects to OpenAI-compatible tool definitions.
+        AIFunction.to_dict() returns:
+            {'type': 'ai_function', 'name': '...', 'input_model': {...}}
+        OpenAI/HuggingFace expects:
+            {'type': 'function', 'function': {'name': '...', 'parameters': {...}}}
+        """
+        if not tools:
+            return None
+        json_tools = []
+        for tool in tools:
+            if hasattr(tool, "to_dict"):
+                try:
+                    t_dict = tool.to_dict()
+                    json_tools.append(
+                        {
+                            "type": "function",
+                            "function": {
+                                "name": t_dict["name"],
+                                "description": t_dict.get("description", ""),
+                                "parameters": t_dict["input_model"],
+                            },
+                        }
+                    )
+                except (KeyError, TypeError) as e:
+                    logger.warning("Failed to convert tool", tool=str(tool), error=str(e))
+            elif isinstance(tool, dict):
+                # Already a dict - assume correct format
+                json_tools.append(tool)
+            else:
+                logger.warning("Skipping non-serializable tool", tool_type=str(type(tool)))
+        return json_tools if json_tools else None
+    def _parse_tool_calls(self, message: Any) -> list[FunctionCallContent]:
+        """Parse HuggingFace tool_calls into framework FunctionCallContent.
+        HF returns tool_calls as:
+            [ChatCompletionOutputToolCall(id='...', function=ChatCompletionOutputFunctionDefinition(
+                name='...', arguments='{"key": "value"}'), type='function')]
+        """
+        contents: list[FunctionCallContent] = []
+        if not hasattr(message, "tool_calls") or not message.tool_calls:
+            return contents
+        for tc in message.tool_calls:
+            try:
+                contents.append(
+                    FunctionCallContent(
+                        call_id=tc.id,
+                        name=tc.function.name,
+                        arguments=tc.function.arguments,  # JSON string or dict
+                    )
+                )
+            except (AttributeError, TypeError) as e:
+                logger.warning("Failed to parse tool call", error=str(e))
+        return contents
     async def _inner_get_response(
         self,
         *,
         """Synchronous response generation using chat_completion."""
         hf_messages = self._convert_messages(messages)
+        # Convert AIFunction objects to OpenAI-compatible JSON
+        tools = self._convert_tools(chat_options.tools if chat_options.tools else None)
         # HF expects 'tool_choice' to be 'auto', 'none', or specific tool
         # Framework uses ToolMode enum or dict
         hf_tool_choice: str | None = None
+        if tools and chat_options.tool_choice is not None:
             tool_choice_str = str(chat_options.tool_choice)
             if "AUTO" in tool_choice_str:
                 hf_tool_choice = "auto"
                 return ChatResponse(messages=[], response_id="error-no-choices")
             choice = choices[0]
+            message = choice.message
+            message_content = message.content or ""
+            # Parse tool calls if present
+            tool_call_contents = self._parse_tool_calls(message)
+            # Construct response message with tool calls in contents
             response_msg = ChatMessage(
+                role=cast(Any, message.role),
                 text=message_content,
+                contents=tool_call_contents if tool_call_contents else None,
             )
             return ChatResponse(
         """Streaming response generation."""
         hf_messages = self._convert_messages(messages)
+        # Convert AIFunction objects to OpenAI-compatible JSON
+        tools = self._convert_tools(chat_options.tools if chat_options.tools else None)
         hf_tool_choice: str | None = None
+        if tools and chat_options.tool_choice is not None:
             if "AUTO" in str(chat_options.tool_choice):
                 hf_tool_choice = "auto"