| # Models Guide |
|
|
| Reference for all Rox AI models. |
|
|
| ## Model Overview |
|
|
| Rox AI offers eight specialized models, each optimized for different use cases: |
|
|
| | Model | Endpoint | Best For | Max Tokens | Default Temp | |
| |-------|----------|----------|------------|--------------| |
| | **Rox Core** | `/chat` | General conversation | 4,096 | 1.0 | |
| | **Rox 2.1 Turbo** | `/turbo` | Fast responses | 4,096 | 0.6 | |
| | **Rox 3.5 Coder** | `/coder` | Code generation | 16,384 | 0.6 | |
| | **Rox 4.5 Turbo** | `/turbo45` | Advanced reasoning | 8,192 | 0.2 | |
| | **Rox 5 Ultra** | `/ultra` | Superior reasoning | 8,192 | 1.0 | |
| | **Rox 6 Dyno** | `/dyno` | Extended context | 16,384 | 1.0 | |
| | **Rox 7 Coder** | `/coder7` | Advanced coding | 16,384 | 1.0 | |
| | **Rox Vision Max** | `/vision` | Visual understanding | 512 | 0.2 | |
|
|
| --- |
|
|
| ## Rox Core |
|
|
| **Endpoint**: `POST /chat` |
|
|
| ### Description |
| General-purpose conversational model for everyday tasks. |
|
|
| ### Best Use Cases |
| - General conversation and Q&A |
| - Content writing and generation |
| - Creative tasks (stories, poems, ideas) |
| - Summarization and analysis |
| - Educational tutoring |
| - Customer support |
|
|
| ### Parameters |
| - **Temperature**: 1.0 (balanced creativity) |
| - **Top P**: 1.0 (full diversity) |
| - **Max Tokens**: 4,096 |
|
|
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/chat \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "Explain quantum computing in simple terms"} |
| ], |
| "temperature": 1.0, |
| "max_tokens": 512 |
| }' |
| ``` |
|
|
| ### When to Choose Rox Core |
| - You need creative, varied responses |
| - Task requires nuanced understanding |
| - Building a general-purpose chatbot |
| - Content needs to be engaging and natural |
|
|
| --- |
|
|
| ## Rox 2.1 Turbo |
|
|
| **Endpoint**: `POST /turbo` |
|
|
| ### Description |
| Fast model for quick responses and real-time applications. |
|
|
| ### Best Use Cases |
| - Real-time chat applications |
| - Customer support bots |
| - Quick Q&A systems |
| - High-throughput applications |
| - Simple queries and commands |
| - Factual information retrieval |
|
|
| ### Parameters |
| - **Temperature**: 0.6 (more focused) |
| - **Top P**: 0.7 (more consistent) |
| - **Max Tokens**: 4,096 |
|
|
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/turbo \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "What are the business hours?"} |
| ] |
| }' |
| ``` |
|
|
| ### When to Choose Rox 2.1 Turbo |
| - Speed is critical |
| - Need consistent, reliable answers |
| - Building customer support systems |
| - High volume of requests |
| - Simple, straightforward queries |
|
|
| --- |
|
|
| ## Rox 3.5 Coder |
|
|
| **Endpoint**: `POST /coder` |
|
|
| ### Description |
| Code-focused model for programming tasks and technical work. |
|
|
| ### Best Use Cases |
| - Code generation and completion |
| - Debugging and error fixing |
| - Algorithm design and optimization |
| - Technical documentation |
| - Code review and suggestions |
| - Software architecture discussions |
| - API integration help |
|
|
| ### Parameters |
| - **Temperature**: 0.6 (precise and focused) |
| - **Top P**: 0.95 (balanced diversity) |
| - **Max Tokens**: 16,384 (extended context) |
| - **Special Features**: Enhanced thinking mode |
|
|
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/coder \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "Write a Python function to implement binary search"} |
| ], |
| "max_tokens": 2048 |
| }' |
| ``` |
|
|
| ### When to Choose Rox 3.5 Coder |
| - Working with code in any language |
| - Need detailed technical explanations |
| - Debugging complex issues |
| - Designing algorithms or systems |
| - Writing technical documentation |
| - Need extended context (up to 16K tokens) |
|
|
| --- |
|
|
| ## Comparison Matrix |
|
|
| ### Performance Characteristics |
|
|
| | Feature | Rox Core | Rox 2.1 Turbo | Rox 3.5 Coder | Rox 4.5 Turbo | Rox 5 Ultra | Rox 6 Dyno | Rox 7 Coder | Rox Vision | |
| |---------|----------|---------------|---------------|---------------|-------------|------------|-------------|------------| |
| | **Speed** | Medium | Fast | Medium | Fast | Medium | Medium | Medium | Fast | |
| | **Creativity** | High | Medium | Low | Low | High | High | Medium | Low | |
| | **Consistency** | Medium | High | High | Very High | High | Medium | High | Very High | |
| | **Code Quality** | Good | Good | Excellent | Good | Excellent | Good | Superior | N/A | |
| | **Context Length** | 4K | 4K | 16K | 8K | 8K | 16K | 16K | 512 | |
| | **Thinking Mode** | No | No | Yes | Yes | Yes | Yes | Yes | No | |
| | **Reasoning** | Basic | Basic | Advanced | Very Advanced | Superior | Advanced | Superior | Basic | |
|
|
| ### Use Case Recommendations |
|
|
| | Task | Recommended Model | Why | |
| |------|------------------|-----| |
| | Write a blog post | Rox Core | Creative, engaging content | |
| | Answer "What is X?" | Rox 2.1 Turbo | Fast, factual response | |
| | Debug Python code | Rox 3.5 Coder | Code specialist | |
| | Customer support | Rox 2.1 Turbo | Quick, consistent answers | |
| | Write a story | Rox Core | Creative and varied | |
| | Explain algorithm | Rox 3.5 Coder | Technical depth | |
| | Translate text | Rox 2.1 Turbo | Fast and accurate | |
| | Design API | Rox 3.5 Coder | Technical expertise | |
| | Brainstorm ideas | Rox Core | Creative thinking | |
| | Code review | Rox 3.5 Coder | Code understanding | |
| | Complex reasoning | Rox 4.5 Turbo | Advanced thinking | |
| | Research analysis | Rox 5 Ultra | Superior reasoning | |
| | System architecture | Rox 5 Ultra | Complex design | |
| | Long documents | Rox 6 Dyno | Extended context | |
| | Large codebase | Rox 7 Coder | Advanced coding | |
| | Image analysis | Rox Vision Max | Visual understanding | |
|
|
| --- |
|
|
| ## Model Selection Guide |
|
|
| ### Decision Tree |
|
|
| ``` |
| Need to work with code? |
| ├─ Yes |
| │ ├─ Simple/medium tasks? → Rox 3.5 Coder |
| │ └─ Complex/large-scale? → Rox 7 Coder |
| └─ No |
| ├─ Need advanced reasoning? |
| │ ├─ Yes |
| │ │ ├─ Need highest quality? → Rox 5 Ultra |
| │ │ └─ Need speed? → Rox 4.5 Turbo |
| │ └─ No |
| │ ├─ Long documents? → Rox 6 Dyno |
| │ ├─ Visual tasks? → Rox Vision Max |
| │ ├─ Need fast responses? → Rox 2.1 Turbo |
| │ └─ Need creative output? → Rox Core |
| ``` |
|
|
| ### Quick Selection Tips |
|
|
| **Choose Rox Core when:** |
| - Default choice for most tasks |
| - Need creative, engaging responses |
| - Building general chatbots |
| - Content generation projects |
|
|
| **Choose Rox 2.1 Turbo when:** |
| - Speed matters most |
| - Need consistent answers |
| - High request volume |
| - Simple Q&A systems |
|
|
| **Choose Rox 3.5 Coder when:** |
| - Any coding task |
| - Technical documentation |
| - Algorithm design |
| - Need extended context |
|
|
| **Choose Rox 6 Dyno when:** |
| - Processing long documents |
| - Extended context needed |
| - Multi-document analysis |
| - Long conversations |
|
|
| **Choose Rox 7 Coder when:** |
| - Most complex coding tasks |
| - Large-scale projects |
| - System architecture |
| - Advanced algorithms |
|
|
| **Choose Rox Vision Max when:** |
| - Visual understanding |
| - Image analysis |
| - Multimodal tasks |
|
|
| --- |
|
|
| ## Advanced Usage |
|
|
| ### Switching Models Dynamically |
|
|
| ```javascript |
| class RoxAI { |
| constructor(baseUrl = 'https://Rox-Turbo-API.hf.space') { |
| this.baseUrl = baseUrl; |
| } |
| |
| async chat(message, model = 'chat') { |
| const endpoints = { |
| core: 'chat', |
| turbo: 'turbo', |
| coder: 'coder' |
| }; |
| |
| const endpoint = endpoints[model] || model; |
| |
| const response = await fetch(`${this.baseUrl}/${endpoint}`, { |
| method: 'POST', |
| headers: { 'Content-Type': 'application/json' }, |
| body: JSON.stringify({ |
| messages: [{ role: 'user', content: message }] |
| }) |
| }); |
| |
| return (await response.json()).content; |
| } |
| } |
| |
| // Usage |
| const rox = new RoxAI(); |
| |
| // Use different models for different tasks |
| const story = await rox.chat('Write a short story', 'core'); |
| const answer = await rox.chat('What is 2+2?', 'turbo'); |
| const code = await rox.chat('Write a sorting function', 'coder'); |
| ``` |
|
|
| ### Model-Specific Optimization |
|
|
| ```python |
| import requests |
| |
| class RoxClient: |
| def __init__(self, base_url="https://Rox-Turbo-API.hf.space"): |
| self.base_url = base_url |
| |
| def ask_core(self, message, creative=True): |
| """Use Rox Core with creativity control""" |
| return self._request('chat', message, |
| temperature=1.2 if creative else 0.8) |
| |
| def ask_turbo(self, message): |
| """Use Rox Turbo for fast responses""" |
| return self._request('turbo', message, max_tokens=256) |
| |
| def ask_coder(self, message, extended=False): |
| """Use Rox Coder with optional extended context""" |
| return self._request('coder', message, |
| max_tokens=8192 if extended else 2048) |
| |
| def _request(self, endpoint, message, **kwargs): |
| response = requests.post( |
| f"{self.base_url}/{endpoint}", |
| json={ |
| "messages": [{"role": "user", "content": message}], |
| **kwargs |
| } |
| ) |
| return response.json()["content"] |
| ``` |
|
|
| --- |
|
|
| ## Cost and Performance Optimization |
|
|
| ### Tips for Each Model |
|
|
| **Rox Core:** |
| - Use for tasks requiring creativity |
| - Adjust temperature based on needs |
| - Consider caching common queries |
|
|
| **Rox 2.1 Turbo:** |
| - Best cost-performance ratio |
| - Use for high-volume applications |
| - Lower max_tokens for even faster responses |
| |
| **Rox 3.5 Coder:** |
| - Use only for code-related tasks |
| - Leverage extended context when needed |
| - Cache code snippets and patterns |
| |
| --- |
| |
| ## API Compatibility |
| |
| All three models use the same request/response format: |
| |
| **Request:** |
| ```json |
| { |
| "messages": [ |
| {"role": "user", "content": "Your message"} |
| ], |
| "temperature": 1.0, |
| "top_p": 0.95, |
| "max_tokens": 512 |
| } |
| ``` |
| |
| **Response:** |
| ```json |
| { |
| "content": "Model response" |
| } |
| ``` |
| |
| This makes it easy to switch between models without changing your code! |
| |
| --- |
| |
| |
| |
| --- |
| |
| --- |
| |
| Built by Mohammad Faiz |
| |
| |
| ## Rox 4.5 Turbo |
| |
| **Endpoint**: `POST /turbo45` |
| |
| ### Description |
| Reasoning model for complex problem-solving with fast responses. |
| |
| ### Best Use Cases |
| - Complex problem solving |
| - Advanced reasoning tasks |
| - Scientific explanations |
| - Mathematical problems |
| - Strategic planning |
| - Analysis and insights |
| |
| ### Parameters |
| - **Temperature**: 0.2 (highly focused) |
| - **Top P**: 0.7 (consistent) |
| - **Max Tokens**: 8,192 |
| - **Special Features**: Enhanced reasoning mode |
| |
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/turbo45 \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "Explain the theory of relativity"} |
| ], |
| "max_tokens": 2048 |
| }' |
| ``` |
| |
| ### When to Choose Rox 4.5 Turbo |
| - Need advanced reasoning |
| - Complex problem solving |
| - Scientific or technical explanations |
| - Fast responses with deep thinking |
| |
| --- |
| |
| ## Rox 5 Ultra |
| |
| **Endpoint**: `POST /ultra` |
| |
| ### Description |
| Advanced model for complex reasoning and high-quality output. |
| |
| ### Best Use Cases |
| - Most complex problem solving |
| - Research and analysis |
| - Advanced technical tasks |
| - Strategic decision making |
| - Complex code architecture |
| - Multi-step reasoning |
| |
| ### Parameters |
| - **Temperature**: 1.0 (balanced) |
| - **Top P**: 0.95 (high diversity) |
| - **Max Tokens**: 8,192 |
| - **Special Features**: Superior reasoning mode |
| |
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/ultra \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "Design a scalable microservices architecture"} |
| ], |
| "max_tokens": 4096 |
| }' |
| ``` |
| |
| ### When to Choose Rox 5 Ultra |
| - Most complex tasks |
| - Need highest quality output |
| - Multi-step reasoning required |
| - Research and deep analysis |
|
|
| --- |
|
|
| ## Rox 6 Dyno |
|
|
| **Endpoint**: `POST /dyno` |
|
|
| ### Description |
| Extended context model for long documents and conversations. |
|
|
| ### Best Use Cases |
| - Long document analysis |
| - Extended conversations |
| - Document summarization |
| - Research paper analysis |
| - Multi-document synthesis |
|
|
| ### Parameters |
| - **Temperature**: 1.0 (balanced) |
| - **Top P**: 1.0 (full diversity) |
| - **Max Tokens**: 16,384 (extended context) |
| - **Special Features**: Dynamic thinking mode |
|
|
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/dyno \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "Analyze this 20-page document..."} |
| ], |
| "max_tokens": 8192 |
| }' |
| ``` |
|
|
| ### When to Choose Rox 6 Dyno |
| - Processing long documents |
| - Need extended context window |
| - Multi-document analysis |
| - Long-form content generation |
|
|
| --- |
|
|
| ## Rox 7 Coder |
|
|
| **Endpoint**: `POST /coder7` |
|
|
| ### Description |
| Advanced coding model for complex programming tasks. |
|
|
| ### Best Use Cases |
| - Complex algorithm design |
| - Large-scale code generation |
| - Advanced debugging |
| - System architecture |
| - Code refactoring |
| - Multi-file code analysis |
|
|
| ### Parameters |
| - **Temperature**: 1.0 (balanced) |
| - **Top P**: 1.0 (full diversity) |
| - **Max Tokens**: 16,384 (extended context) |
| - **Special Features**: Advanced thinking mode for code |
|
|
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/coder7 \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "Build a distributed caching system"} |
| ], |
| "max_tokens": 8192 |
| }' |
| ``` |
|
|
| ### When to Choose Rox 7 Coder |
| - Most complex coding tasks |
| - Large-scale projects |
| - System design and architecture |
| - Advanced algorithms |
|
|
| --- |
|
|
| ## Rox Vision Max |
|
|
| **Endpoint**: `POST /vision` |
|
|
| ### Description |
| Visual model for image analysis and multimodal tasks. |
|
|
| ### Best Use Cases |
| - Image analysis |
| - Visual understanding |
| - Multimodal tasks |
| - Image description |
| - Visual Q&A |
|
|
| ### Parameters |
| - **Temperature**: 0.2 (highly focused) |
| - **Top P**: 0.7 (consistent) |
| - **Max Tokens**: 512 |
|
|
| ### Example Request |
| ```bash |
| curl -X POST https://Rox-Turbo-API.hf.space/vision \ |
| -H "Content-Type: application/json" \ |
| -d '{ |
| "messages": [ |
| {"role": "user", "content": "Describe this image"} |
| ], |
| "max_tokens": 256 |
| }' |
| ``` |
|
|
| ### When to Choose Rox Vision Max |
| - Visual understanding tasks |
| - Image analysis |
| - Multimodal applications |
|
|
| --- |
|
|
|
|
|
|