Spaces:

VibecoderMcSwaggins
/

DeepBoner

Paused

App Files Files Community

DeepBoner / docs /bugs /archive /P1_HUGGINGFACE_ROUTER_401_HYPERBOLIC.md

VibecoderMcSwaggins

docs: Organize and archive resolved bug documentation

f9f62d4 about 1 month ago

preview code

raw

history blame

2.11 kB

P1 Bug: HuggingFace Router 401 Unauthorized

Severity: P1 (High) Status: RESOLVED Discovered: 2025-12-01 Resolved: 2025-12-01 Reporter: Production user via HuggingFace Spaces

Symptom

401 Client Error: Unauthorized for url:
https://router.huggingface.co/hyperbolic/v1/chat/completions
Invalid username or password.

Root Cause

The HF_TOKEN in .env and HuggingFace Spaces secrets was invalid/expired.

Token hf_ssayg... failed HfApi().whoami() verification.

Resolution

Generated new HF_TOKEN at https://huggingface.co/settings/tokens
Updated .env with new token: hf_gZVBI...
Updated HuggingFace Spaces secret with same token
Switched default model from meta-llama/Llama-3.1-70B-Instruct to Qwen/Qwen2.5-72B-Instruct (better reliability via HF router)

Verification

uv run python -c "
import os
from huggingface_hub import InferenceClient, HfApi

token = os.environ['HF_TOKEN']  # Your valid token from .env
api = HfApi(token=token)
print(f'Token valid: {api.whoami()[\"name\"]}')

client = InferenceClient(model='Qwen/Qwen2.5-72B-Instruct', token=token)
response = client.chat_completion(messages=[{'role': 'user', 'content': '2+2=?'}], max_tokens=10)
print(f'Inference works: {response.choices[0].message.content}')
"
# Output:
# Token valid: VibecoderMcSwaggins
# Inference works: 4

Lessons Learned

First-principles debugging: Before adding complex "fixes", verify basic assumptions (is the token actually valid?)
Token expiration: HuggingFace tokens can expire or become invalid. Always verify with whoami().
Model routing: HuggingFace routes large models to partner providers (Hyperbolic, Novita). All require valid auth.

Files Changed

src/utils/config.py: Changed default model to Qwen/Qwen2.5-72B-Instruct
src/clients/huggingface.py: Updated fallback model reference
src/agent_factory/judges.py: Updated fallback model reference
src/orchestrators/langgraph_orchestrator.py: Updated hardcoded model
CLAUDE.md, AGENTS.md, GEMINI.md: Updated documentation