Mirrowel commited on
Commit
1d1a62b
Β·
1 Parent(s): df7a756

docs: πŸ“š update documentation to reflect gemini 2.5 removal and claude sonnet dual-mode support

Browse files

This commit updates both README.md and DOCUMENTATION.md to accurately reflect recent changes to the Antigravity provider:

- Remove all references to Gemini 2.5 models (Pro/Flash) as they are no longer supported
- Document Claude Sonnet 4.5's dual-mode capability (thinking and non-thinking variants)
- Add provider support section explaining credential prioritization implementation for both Gemini CLI and Antigravity providers
- Clarify that Claude Opus 4.5 only supports thinking mode
- Update model-specific logic documentation to reflect current architecture (Gemini 3, Claude Sonnet, Claude Opus)
- Add credential tier reset timing details (paid tier: 5 hours, free tier: weekly)
- Remove outdated "NEW" badges and function call response pairing references

Files changed (2) hide show
  1. DOCUMENTATION.md +15 -10
  2. README.md +5 -4
DOCUMENTATION.md CHANGED
@@ -361,6 +361,13 @@ def get_model_tier_requirement(self, model: str) -> Optional[int]:
361
  return None # All other models have no restrictions
362
  ```
363
 
 
 
 
 
 
 
 
364
  **Usage Manager Integration:**
365
 
366
  The `acquire_key()` method has been enhanced to:
@@ -391,22 +398,18 @@ A modular, shared caching system for providers to persist conversation state acr
391
 
392
  ### 3.5. Antigravity (`antigravity_provider.py`)
393
 
394
- The most sophisticated provider implementation, supporting Google's internal Antigravity API for Gemini and Claude models (including **Claude Opus 4.5**, Anthropic's most powerful model).
395
 
396
  #### Architecture
397
 
398
  - **Unified Streaming/Non-Streaming**: Single code path handles both response types with optimal transformations
399
  - **Thought Signature Caching**: Server-side caching of encrypted signatures for multi-turn Gemini 3 conversations
400
- - **Model-Specific Logic**: Automatic configuration based on model type (Gemini 2.5, Gemini 3, Claude)
 
401
 
402
  #### Model Support
403
 
404
- **Gemini 2.5 (Pro/Flash):**
405
- - Uses `thinkingBudget` parameter (integer tokens: -1 for auto, 0 to disable, or specific value)
406
- - Standard safety settings and toolConfig
407
- - Stream processing with thinking content separation
408
-
409
- **Gemini 3 (Pro/Image):**
410
  - Uses `thinkingLevel` parameter (string: "low" or "high")
411
  - **Tool Hallucination Prevention**:
412
  - Automatic system instruction injection explaining custom tool schema rules
@@ -427,8 +430,10 @@ The most sophisticated provider implementation, supporting Google's internal Ant
427
  - Increased default max output tokens to 64000 to accommodate thinking output
428
 
429
  **Claude Sonnet 4.5:**
430
- - Proxied through Antigravity API (uses internal model name `claude-sonnet-4-5-thinking`)
431
- - Uses `thinkingBudget` parameter like Gemini 2.5
 
 
432
  - **Thinking Preservation**: Caches thinking content using composite keys (tool_call_id + text_hash)
433
  - **Schema Cleaning**: Removes unsupported properties (`$schema`, `additionalProperties`, `const` β†’ `enum`)
434
 
 
361
  return None # All other models have no restrictions
362
  ```
363
 
364
+ **Provider Support:**
365
+
366
+ The following providers implement credential prioritization:
367
+
368
+ - **Gemini CLI**: Paid tier (priority 1), Free tier (priority 2), Legacy/Unknown (priority 10). Gemini 3 models require paid tier.
369
+ - **Antigravity**: Same priority system as Gemini CLI. No model-tier restrictions (all models work on all tiers). Paid tier resets every 5 hours, free tier resets weekly.
370
+
371
  **Usage Manager Integration:**
372
 
373
  The `acquire_key()` method has been enhanced to:
 
398
 
399
  ### 3.5. Antigravity (`antigravity_provider.py`)
400
 
401
+ The most sophisticated provider implementation, supporting Google's internal Antigravity API for Gemini 3 and Claude models (including **Claude Opus 4.5**, Anthropic's most powerful model).
402
 
403
  #### Architecture
404
 
405
  - **Unified Streaming/Non-Streaming**: Single code path handles both response types with optimal transformations
406
  - **Thought Signature Caching**: Server-side caching of encrypted signatures for multi-turn Gemini 3 conversations
407
+ - **Model-Specific Logic**: Automatic configuration based on model type (Gemini 3, Claude Sonnet, Claude Opus)
408
+ - **Credential Prioritization**: Automatic tier detection with paid credentials prioritized over free (paid tier resets every 5 hours, free tier resets weekly)
409
 
410
  #### Model Support
411
 
412
+ **Gemini 3 Pro:**
 
 
 
 
 
413
  - Uses `thinkingLevel` parameter (string: "low" or "high")
414
  - **Tool Hallucination Prevention**:
415
  - Automatic system instruction injection explaining custom tool schema rules
 
430
  - Increased default max output tokens to 64000 to accommodate thinking output
431
 
432
  **Claude Sonnet 4.5:**
433
+ - Proxied through Antigravity API
434
+ - **Supports both thinking and non-thinking modes**:
435
+ - With `reasoning_effort`: Uses `claude-sonnet-4-5-thinking` variant with `thinkingBudget`
436
+ - Without `reasoning_effort`: Uses standard `claude-sonnet-4-5` variant
437
  - **Thinking Preservation**: Caches thinking content using composite keys (tool_call_id + text_hash)
438
  - **Schema Cleaning**: Removes unsupported properties (`$schema`, `additionalProperties`, `const` β†’ `enum`)
439
 
README.md CHANGED
@@ -28,13 +28,14 @@ This project provides a powerful solution for developers building complex applic
28
  - **OpenAI-Compatible Proxy**: Offers a familiar API interface with additional endpoints for model and provider discovery.
29
  - **Advanced Model Filtering**: Supports both blacklists and whitelists to give you fine-grained control over which models are available through the proxy.
30
 
31
- - **πŸ†• Antigravity Provider**: Full support for Google's internal Antigravity API, providing access to Gemini 2.5, Gemini 3, and Claude models with advanced features:
32
- - **πŸš€ NEW: Claude Opus 4.5** - Anthropic's most powerful model, now available via Antigravity!
33
- - Claude Sonnet 4.5 with extended thinking support
 
 
34
  - Thought signature caching for multi-turn conversations
35
  - Tool hallucination prevention via parameter signature injection
36
  - Automatic thinking block sanitization for Claude models (with recovery strategies)
37
- - Improved function call response pairing with three-tier matching strategy
38
  - Note: Claude thinking mode requires careful conversation state management (see [Antigravity documentation](DOCUMENTATION.md#antigravity-claude-extended-thinking-sanitization) for details)
39
  - **πŸ†• Credential Prioritization**: Automatic tier detection and priority-based credential selection ensures paid-tier credentials are used for premium models that require them.
40
  - **πŸ†• Weighted Random Rotation**: Configurable credential rotation strategy - choose between deterministic (perfect balance) or weighted random (unpredictable, harder to fingerprint) selection.
 
28
  - **OpenAI-Compatible Proxy**: Offers a familiar API interface with additional endpoints for model and provider discovery.
29
  - **Advanced Model Filtering**: Supports both blacklists and whitelists to give you fine-grained control over which models are available through the proxy.
30
 
31
+ - **πŸ†• Antigravity Provider**: Full support for Google's internal Antigravity API, providing access to Gemini 3 and Claude models with advanced features:
32
+ - **πŸš€ Claude Opus 4.5** - Anthropic's most powerful model (thinking mode only)
33
+ - **Claude Sonnet 4.5** - Supports both thinking and non-thinking modes
34
+ - **Gemini 3 Pro** - With thinkingLevel support (low/high)
35
+ - Credential prioritization with automatic paid/free tier detection
36
  - Thought signature caching for multi-turn conversations
37
  - Tool hallucination prevention via parameter signature injection
38
  - Automatic thinking block sanitization for Claude models (with recovery strategies)
 
39
  - Note: Claude thinking mode requires careful conversation state management (see [Antigravity documentation](DOCUMENTATION.md#antigravity-claude-extended-thinking-sanitization) for details)
40
  - **πŸ†• Credential Prioritization**: Automatic tier detection and priority-based credential selection ensures paid-tier credentials are used for premium models that require them.
41
  - **πŸ†• Weighted Random Rotation**: Configurable credential rotation strategy - choose between deterministic (perfect balance) or weighted random (unpredictable, harder to fingerprint) selection.