Spaces:

VibecoderMcSwaggins
/

DeepBoner

Paused

App Files Files Community

VibecoderMcSwaggins commited on 16 days ago

Commit

ecaa2e8

1 Parent(s): 8c0ec2b

fix(P0): Complete bi-directional tool support for HuggingFace (sync+streaming)

Browse files

Files changed (3) hide show

docs/bugs/ACTIVE_BUGS.md +12 -69
docs/bugs/P0_AIFUNCTION_NOT_JSON_SERIALIZABLE.md +18 -1
src/clients/huggingface.py +50 -5

docs/bugs/ACTIVE_BUGS.md CHANGED Viewed

@@ -7,81 +7,24 @@
 ## P0 - Critical
-### P0 - AIFunction Not JSON Serializable (Free Tier Broken)
-**File:** `docs/bugs/P0_AIFUNCTION_NOT_JSON_SERIALIZABLE.md`
-**Found:** 2025-12-01 (HuggingFace Spaces)
-**Problem:** Every search round fails with "Object of type AIFunction is not JSON serializable".
-**Error:**
-```
-📚 SEARCH_COMPLETE: searcher: Agent searcher: Error processing request -
-Object of type AIFunction is not JSON serializable
-```
-**Root Cause:** `HuggingFaceChatClient` passes raw `AIFunction` objects to `InferenceClient.chat_completion()`. When `requests` tries to serialize them to JSON, it fails.
-**Impact:** Free Tier cannot do any research. 5 rounds of errors, no results.
-**Proposed Fix:** Either:
-1. **Quick**: Disable tools with `tools=None` (agents use natural language)
-2. **Proper**: Convert `AIFunction` to JSON schema before passing to HF API
 ---
 ## P3 - UX Polish
-### P3 - Progress Bar Positioning/Overlap in ChatInterface
-**File:** `docs/bugs/P3_PROGRESS_BAR_POSITIONING.md`
-**Found:** 2025-12-01 (HuggingFace Spaces)
-**Problem:** `gr.Progress()` bar renders in strange position inside ChatInterface - floats mid-chat, overlaps text.
-**Root Cause:** Mixing two progress mechanisms:
-1. `gr.Progress()` - general purpose, not designed for ChatInterface
-2. `ChatInterface.show_progress` - built-in chat progress
-**Recommended Fix:** Remove `gr.Progress()`, rely on emoji status text we already emit. Low priority - UX polish only.
----
-## P2 - UX Friction
-### P2 - Advanced Mode Cold Start Has No User Feedback (✅ FIXED)
-**File:** `docs/bugs/P2_ADVANCED_MODE_COLD_START_NO_FEEDBACK.md`
-**Issue:** [#108](https://github.com/The-Obstacle-Is-The-Way/DeepBoner/issues/108)
-**Found:** 2025-12-01 (Gradio Testing)
-**Problem:** Three "dead zones" with no visual feedback during Advanced Mode startup:
-1. **Dead Zone #1** (5-15s): Between STARTED → THINKING ✅ FIXED (granular events)
-2. **Dead Zone #2** (10-30s): Between THINKING → PROGRESS (first LLM call) ✅ FIXED (Progress Bar)
-3. **Dead Zone #3** (30-90s): After PROGRESS (SearchAgent executing) ✅ FIXED (Pre-warming + Progress Bar)
-**Phase 1 Fix (commit dbf888c):**
-- Added granular progress events during initialization
-- Users now see "Loading embedding service...", "Initializing research memory...", "Building agent team..."
-- Significantly improves perceived responsiveness
-**Phase 2/3 Fix (Latest):**
-- Implemented service pre-warming (`service_loader.warmup_services`)
-- Added native Gradio progress bar (`gr.Progress`) to `research_agent`
-- Visual feedback is now continuous throughout the entire lifecycle
----
-## P1 - Important
-### P1 - Memory Layer Not Integrated (Post-Hackathon)
-**Issue:** [#73](https://github.com/The-Obstacle-Is-The-Way/DeepBoner/issues/73)
-**Spec:** [SPEC_08_INTEGRATE_MEMORY_LAYER.md](../specs/SPEC_08_INTEGRATE_MEMORY_LAYER.md)
-**Problem:** Structured memory (hypotheses, conflicts) is isolated in "God Mode" only.
-**Solution:** Extract memory into shared service, integrate into Simple and Advanced modes.
-**Status:** Spec written. Blocked until post-hackathon.
----
-## Resolved Bugs
 ### ~~P1 - HuggingFace Router 401 Unauthorized~~ FIXED
 **File:** `docs/bugs/P1_HUGGINGFACE_ROUTER_401_HYPERBOLIC.md`

 ## P0 - Critical
+(No active P0 bugs)
 ---
 ## P3 - UX Polish
+...
+## Resolved Bugs
+### ~~P0 - AIFunction Not JSON Serializable~~ FIXED
+**File:** `docs/bugs/P0_AIFUNCTION_NOT_JSON_SERIALIZABLE.md`
+**Found:** 2025-12-01
+**Resolved:** 2025-12-01
+- Problem: `HuggingFaceChatClient` crashed with "Object of type AIFunction is not JSON serializable".
+- Fix: Implemented full bi-directional tool support:
+    1. **Serialization**: Added `_convert_tools` (AIFunction → OpenAI JSON)
+    2. **Parsing (Sync/Async)**: Added `_parse_tool_calls` and streaming accumulator
+- Result: Free Tier now supports full function calling capabilities with Qwen2.5-72B.
 ### ~~P1 - HuggingFace Router 401 Unauthorized~~ FIXED
 **File:** `docs/bugs/P1_HUGGINGFACE_ROUTER_401_HYPERBOLIC.md`

docs/bugs/P0_AIFUNCTION_NOT_JSON_SERIALIZABLE.md CHANGED Viewed

@@ -1,8 +1,9 @@
 # P0 Bug: AIFunction Not JSON Serializable (Free Tier Broken)
 **Severity**: P0 (Critical) - Free Tier cannot perform research
-**Status**: In Progress
 **Discovered**: 2025-12-01
 **Reporter**: Production user via HuggingFace Spaces
 ## Symptom
@@ -201,6 +202,22 @@ asyncio.run(test())
 # Expected AFTER fix: Research completes with tool calls working
 ```
 ## References
 - [HuggingFace Chat Completion - Function Calling](https://huggingface.co/docs/inference-providers/tasks/chat-completion)

 # P0 Bug: AIFunction Not JSON Serializable (Free Tier Broken)
 **Severity**: P0 (Critical) - Free Tier cannot perform research
+**Status**: RESOLVED
 **Discovered**: 2025-12-01
+**Resolved**: 2025-12-01
 **Reporter**: Production user via HuggingFace Spaces
 ## Symptom
 # Expected AFTER fix: Research completes with tool calls working
 ```
+## Resolution
+Implemented full function calling support for HuggingFace client:
+1.  **Request Serialization**: Added `_convert_tools` to map `AIFunction` schemas to OpenAI-compatible JSON.
+2.  **Response Parsing (Sync)**: Added `_parse_tool_calls` to convert HF `tool_calls` to `FunctionCallContent`.
+3.  **Response Parsing (Async)**: Implemented tool call accumulator in `_inner_get_streaming_response` to handle partial tool call deltas and yield valid `FunctionCallContent` objects.
+## Verification
+Verified with unit tests and manual simulation:
+1.  **Serialization**: Confirmed `AIFunction` -> JSON conversion works for `search_pubmed`.
+2.  **Streaming**: Verified that fragmented tool call deltas (e.g., `{"query":` then `"testosterone"}`) are correctly reassembled into a single `FunctionCallContent`.
+3.  **Integration**: Passed project-level `make check`.
 ## References
 - [HuggingFace Chat Completion - Function Calling](https://huggingface.co/docs/inference-providers/tasks/chat-completion)

src/clients/huggingface.py CHANGED Viewed

@@ -240,6 +240,10 @@ class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
             stream = await asyncio.to_thread(call_fn)
             for chunk in stream:
                 # Chunk is ChatCompletionStreamOutput
                 if not chunk.choices:
@@ -247,15 +251,56 @@ class HuggingFaceChatClient(BaseChatClient):  # type: ignore[misc]
                 choice = chunk.choices[0]
                 delta = choice.delta
-                # Convert to ChatResponseUpdate
-                yield ChatResponseUpdate(
-                    role=cast(Any, delta.role) if delta.role else None,
-                    content=delta.content,
-                )
                 # Yield control to event loop
                 await asyncio.sleep(0)
         except Exception as e:
             logger.error("HuggingFace Streaming error", error=str(e))
             raise

             stream = await asyncio.to_thread(call_fn)
+            # Accumulator for tool calls (index -> dict)
+            # We need to accumulate because deltas are partial
+            tool_call_accumulator: dict[int, dict[str, Any]] = {}
             for chunk in stream:
                 # Chunk is ChatCompletionStreamOutput
                 if not chunk.choices:
                 choice = chunk.choices[0]
                 delta = choice.delta
+                # 1. Handle Text Content
+                if delta.content:
+                    yield ChatResponseUpdate(
+                        role=cast(Any, delta.role) if delta.role else None,
+                        text=delta.content,
+                    )
+                # 2. Handle Tool Calls (Accumulate)
+                if hasattr(delta, "tool_calls") and delta.tool_calls:
+                    for tc in delta.tool_calls:
+                        idx = tc.index
+                        if idx not in tool_call_accumulator:
+                            tool_call_accumulator[idx] = {
+                                "id": "",
+                                "name": "",
+                                "arguments": "",
+                            }
+                        # Merge fields
+                        if tc.id:
+                            tool_call_accumulator[idx]["id"] += tc.id
+                        if tc.function:
+                            if tc.function.name:
+                                tool_call_accumulator[idx]["name"] += tc.function.name
+                            if tc.function.arguments:
+                                tool_call_accumulator[idx]["arguments"] += tc.function.arguments
                 # Yield control to event loop
                 await asyncio.sleep(0)
+            # 3. Yield Accumulated Tool Calls
+            if tool_call_accumulator:
+                contents: list[FunctionCallContent] = []
+                for idx in sorted(tool_call_accumulator.keys()):
+                    tc_data = tool_call_accumulator[idx]
+                    # Only yield if ID and Name are present (required by FunctionCallContent)
+                    if tc_data["id"] and tc_data["name"]:
+                        contents.append(
+                            FunctionCallContent(
+                                call_id=tc_data["id"],
+                                name=tc_data["name"],
+                                arguments=tc_data["arguments"],
+                            )
+                        )
+                if contents:
+                    yield ChatResponseUpdate(
+                        contents=contents,
+                    )
         except Exception as e:
             logger.error("HuggingFace Streaming error", error=str(e))
             raise