Spaces:
Paused
docs: π update documentation to reflect gemini 2.5 removal and claude sonnet dual-mode support
Browse filesThis commit updates both README.md and DOCUMENTATION.md to accurately reflect recent changes to the Antigravity provider:
- Remove all references to Gemini 2.5 models (Pro/Flash) as they are no longer supported
- Document Claude Sonnet 4.5's dual-mode capability (thinking and non-thinking variants)
- Add provider support section explaining credential prioritization implementation for both Gemini CLI and Antigravity providers
- Clarify that Claude Opus 4.5 only supports thinking mode
- Update model-specific logic documentation to reflect current architecture (Gemini 3, Claude Sonnet, Claude Opus)
- Add credential tier reset timing details (paid tier: 5 hours, free tier: weekly)
- Remove outdated "NEW" badges and function call response pairing references
- DOCUMENTATION.md +15 -10
- README.md +5 -4
|
@@ -361,6 +361,13 @@ def get_model_tier_requirement(self, model: str) -> Optional[int]:
|
|
| 361 |
return None # All other models have no restrictions
|
| 362 |
```
|
| 363 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 364 |
**Usage Manager Integration:**
|
| 365 |
|
| 366 |
The `acquire_key()` method has been enhanced to:
|
|
@@ -391,22 +398,18 @@ A modular, shared caching system for providers to persist conversation state acr
|
|
| 391 |
|
| 392 |
### 3.5. Antigravity (`antigravity_provider.py`)
|
| 393 |
|
| 394 |
-
The most sophisticated provider implementation, supporting Google's internal Antigravity API for Gemini and Claude models (including **Claude Opus 4.5**, Anthropic's most powerful model).
|
| 395 |
|
| 396 |
#### Architecture
|
| 397 |
|
| 398 |
- **Unified Streaming/Non-Streaming**: Single code path handles both response types with optimal transformations
|
| 399 |
- **Thought Signature Caching**: Server-side caching of encrypted signatures for multi-turn Gemini 3 conversations
|
| 400 |
-
- **Model-Specific Logic**: Automatic configuration based on model type (Gemini
|
|
|
|
| 401 |
|
| 402 |
#### Model Support
|
| 403 |
|
| 404 |
-
**Gemini
|
| 405 |
-
- Uses `thinkingBudget` parameter (integer tokens: -1 for auto, 0 to disable, or specific value)
|
| 406 |
-
- Standard safety settings and toolConfig
|
| 407 |
-
- Stream processing with thinking content separation
|
| 408 |
-
|
| 409 |
-
**Gemini 3 (Pro/Image):**
|
| 410 |
- Uses `thinkingLevel` parameter (string: "low" or "high")
|
| 411 |
- **Tool Hallucination Prevention**:
|
| 412 |
- Automatic system instruction injection explaining custom tool schema rules
|
|
@@ -427,8 +430,10 @@ The most sophisticated provider implementation, supporting Google's internal Ant
|
|
| 427 |
- Increased default max output tokens to 64000 to accommodate thinking output
|
| 428 |
|
| 429 |
**Claude Sonnet 4.5:**
|
| 430 |
-
- Proxied through Antigravity API
|
| 431 |
-
-
|
|
|
|
|
|
|
| 432 |
- **Thinking Preservation**: Caches thinking content using composite keys (tool_call_id + text_hash)
|
| 433 |
- **Schema Cleaning**: Removes unsupported properties (`$schema`, `additionalProperties`, `const` β `enum`)
|
| 434 |
|
|
|
|
| 361 |
return None # All other models have no restrictions
|
| 362 |
```
|
| 363 |
|
| 364 |
+
**Provider Support:**
|
| 365 |
+
|
| 366 |
+
The following providers implement credential prioritization:
|
| 367 |
+
|
| 368 |
+
- **Gemini CLI**: Paid tier (priority 1), Free tier (priority 2), Legacy/Unknown (priority 10). Gemini 3 models require paid tier.
|
| 369 |
+
- **Antigravity**: Same priority system as Gemini CLI. No model-tier restrictions (all models work on all tiers). Paid tier resets every 5 hours, free tier resets weekly.
|
| 370 |
+
|
| 371 |
**Usage Manager Integration:**
|
| 372 |
|
| 373 |
The `acquire_key()` method has been enhanced to:
|
|
|
|
| 398 |
|
| 399 |
### 3.5. Antigravity (`antigravity_provider.py`)
|
| 400 |
|
| 401 |
+
The most sophisticated provider implementation, supporting Google's internal Antigravity API for Gemini 3 and Claude models (including **Claude Opus 4.5**, Anthropic's most powerful model).
|
| 402 |
|
| 403 |
#### Architecture
|
| 404 |
|
| 405 |
- **Unified Streaming/Non-Streaming**: Single code path handles both response types with optimal transformations
|
| 406 |
- **Thought Signature Caching**: Server-side caching of encrypted signatures for multi-turn Gemini 3 conversations
|
| 407 |
+
- **Model-Specific Logic**: Automatic configuration based on model type (Gemini 3, Claude Sonnet, Claude Opus)
|
| 408 |
+
- **Credential Prioritization**: Automatic tier detection with paid credentials prioritized over free (paid tier resets every 5 hours, free tier resets weekly)
|
| 409 |
|
| 410 |
#### Model Support
|
| 411 |
|
| 412 |
+
**Gemini 3 Pro:**
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 413 |
- Uses `thinkingLevel` parameter (string: "low" or "high")
|
| 414 |
- **Tool Hallucination Prevention**:
|
| 415 |
- Automatic system instruction injection explaining custom tool schema rules
|
|
|
|
| 430 |
- Increased default max output tokens to 64000 to accommodate thinking output
|
| 431 |
|
| 432 |
**Claude Sonnet 4.5:**
|
| 433 |
+
- Proxied through Antigravity API
|
| 434 |
+
- **Supports both thinking and non-thinking modes**:
|
| 435 |
+
- With `reasoning_effort`: Uses `claude-sonnet-4-5-thinking` variant with `thinkingBudget`
|
| 436 |
+
- Without `reasoning_effort`: Uses standard `claude-sonnet-4-5` variant
|
| 437 |
- **Thinking Preservation**: Caches thinking content using composite keys (tool_call_id + text_hash)
|
| 438 |
- **Schema Cleaning**: Removes unsupported properties (`$schema`, `additionalProperties`, `const` β `enum`)
|
| 439 |
|
|
@@ -28,13 +28,14 @@ This project provides a powerful solution for developers building complex applic
|
|
| 28 |
- **OpenAI-Compatible Proxy**: Offers a familiar API interface with additional endpoints for model and provider discovery.
|
| 29 |
- **Advanced Model Filtering**: Supports both blacklists and whitelists to give you fine-grained control over which models are available through the proxy.
|
| 30 |
|
| 31 |
-
- **π Antigravity Provider**: Full support for Google's internal Antigravity API, providing access to Gemini
|
| 32 |
-
- **π
|
| 33 |
-
- Claude Sonnet 4.5
|
|
|
|
|
|
|
| 34 |
- Thought signature caching for multi-turn conversations
|
| 35 |
- Tool hallucination prevention via parameter signature injection
|
| 36 |
- Automatic thinking block sanitization for Claude models (with recovery strategies)
|
| 37 |
-
- Improved function call response pairing with three-tier matching strategy
|
| 38 |
- Note: Claude thinking mode requires careful conversation state management (see [Antigravity documentation](DOCUMENTATION.md#antigravity-claude-extended-thinking-sanitization) for details)
|
| 39 |
- **π Credential Prioritization**: Automatic tier detection and priority-based credential selection ensures paid-tier credentials are used for premium models that require them.
|
| 40 |
- **π Weighted Random Rotation**: Configurable credential rotation strategy - choose between deterministic (perfect balance) or weighted random (unpredictable, harder to fingerprint) selection.
|
|
|
|
| 28 |
- **OpenAI-Compatible Proxy**: Offers a familiar API interface with additional endpoints for model and provider discovery.
|
| 29 |
- **Advanced Model Filtering**: Supports both blacklists and whitelists to give you fine-grained control over which models are available through the proxy.
|
| 30 |
|
| 31 |
+
- **π Antigravity Provider**: Full support for Google's internal Antigravity API, providing access to Gemini 3 and Claude models with advanced features:
|
| 32 |
+
- **π Claude Opus 4.5** - Anthropic's most powerful model (thinking mode only)
|
| 33 |
+
- **Claude Sonnet 4.5** - Supports both thinking and non-thinking modes
|
| 34 |
+
- **Gemini 3 Pro** - With thinkingLevel support (low/high)
|
| 35 |
+
- Credential prioritization with automatic paid/free tier detection
|
| 36 |
- Thought signature caching for multi-turn conversations
|
| 37 |
- Tool hallucination prevention via parameter signature injection
|
| 38 |
- Automatic thinking block sanitization for Claude models (with recovery strategies)
|
|
|
|
| 39 |
- Note: Claude thinking mode requires careful conversation state management (see [Antigravity documentation](DOCUMENTATION.md#antigravity-claude-extended-thinking-sanitization) for details)
|
| 40 |
- **π Credential Prioritization**: Automatic tier detection and priority-based credential selection ensures paid-tier credentials are used for premium models that require them.
|
| 41 |
- **π Weighted Random Rotation**: Configurable credential rotation strategy - choose between deterministic (perfect balance) or weighted random (unpredictable, harder to fingerprint) selection.
|