API / docs /MODELS.md
Rox-Turbo's picture
Upload 12 files
58ec31b verified

Models Guide

Reference for all Rox AI models.

Model Overview

Rox AI offers eight specialized models, each optimized for different use cases:

Model Endpoint Best For Max Tokens Default Temp
Rox Core /chat General conversation 4,096 1.0
Rox 2.1 Turbo /turbo Fast responses 4,096 0.6
Rox 3.5 Coder /coder Code generation 16,384 0.6
Rox 4.5 Turbo /turbo45 Advanced reasoning 8,192 0.2
Rox 5 Ultra /ultra Superior reasoning 8,192 1.0
Rox 6 Dyno /dyno Extended context 16,384 1.0
Rox 7 Coder /coder7 Advanced coding 16,384 1.0
Rox Vision Max /vision Visual understanding 512 0.2

Rox Core

Endpoint: POST /chat

Description

General-purpose conversational model for everyday tasks.

Best Use Cases

  • General conversation and Q&A
  • Content writing and generation
  • Creative tasks (stories, poems, ideas)
  • Summarization and analysis
  • Educational tutoring
  • Customer support

Parameters

  • Temperature: 1.0 (balanced creativity)
  • Top P: 1.0 (full diversity)
  • Max Tokens: 4,096

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/chat \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Explain quantum computing in simple terms"}
    ],
    "temperature": 1.0,
    "max_tokens": 512
  }'

When to Choose Rox Core

  • You need creative, varied responses
  • Task requires nuanced understanding
  • Building a general-purpose chatbot
  • Content needs to be engaging and natural

Rox 2.1 Turbo

Endpoint: POST /turbo

Description

Fast model for quick responses and real-time applications.

Best Use Cases

  • Real-time chat applications
  • Customer support bots
  • Quick Q&A systems
  • High-throughput applications
  • Simple queries and commands
  • Factual information retrieval

Parameters

  • Temperature: 0.6 (more focused)
  • Top P: 0.7 (more consistent)
  • Max Tokens: 4,096

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/turbo \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "What are the business hours?"}
    ]
  }'

When to Choose Rox 2.1 Turbo

  • Speed is critical
  • Need consistent, reliable answers
  • Building customer support systems
  • High volume of requests
  • Simple, straightforward queries

Rox 3.5 Coder

Endpoint: POST /coder

Description

Code-focused model for programming tasks and technical work.

Best Use Cases

  • Code generation and completion
  • Debugging and error fixing
  • Algorithm design and optimization
  • Technical documentation
  • Code review and suggestions
  • Software architecture discussions
  • API integration help

Parameters

  • Temperature: 0.6 (precise and focused)
  • Top P: 0.95 (balanced diversity)
  • Max Tokens: 16,384 (extended context)
  • Special Features: Enhanced thinking mode

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/coder \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Write a Python function to implement binary search"}
    ],
    "max_tokens": 2048
  }'

When to Choose Rox 3.5 Coder

  • Working with code in any language
  • Need detailed technical explanations
  • Debugging complex issues
  • Designing algorithms or systems
  • Writing technical documentation
  • Need extended context (up to 16K tokens)

Comparison Matrix

Performance Characteristics

Feature Rox Core Rox 2.1 Turbo Rox 3.5 Coder Rox 4.5 Turbo Rox 5 Ultra Rox 6 Dyno Rox 7 Coder Rox Vision
Speed Medium Fast Medium Fast Medium Medium Medium Fast
Creativity High Medium Low Low High High Medium Low
Consistency Medium High High Very High High Medium High Very High
Code Quality Good Good Excellent Good Excellent Good Superior N/A
Context Length 4K 4K 16K 8K 8K 16K 16K 512
Thinking Mode No No Yes Yes Yes Yes Yes No
Reasoning Basic Basic Advanced Very Advanced Superior Advanced Superior Basic

Use Case Recommendations

Task Recommended Model Why
Write a blog post Rox Core Creative, engaging content
Answer "What is X?" Rox 2.1 Turbo Fast, factual response
Debug Python code Rox 3.5 Coder Code specialist
Customer support Rox 2.1 Turbo Quick, consistent answers
Write a story Rox Core Creative and varied
Explain algorithm Rox 3.5 Coder Technical depth
Translate text Rox 2.1 Turbo Fast and accurate
Design API Rox 3.5 Coder Technical expertise
Brainstorm ideas Rox Core Creative thinking
Code review Rox 3.5 Coder Code understanding
Complex reasoning Rox 4.5 Turbo Advanced thinking
Research analysis Rox 5 Ultra Superior reasoning
System architecture Rox 5 Ultra Complex design
Long documents Rox 6 Dyno Extended context
Large codebase Rox 7 Coder Advanced coding
Image analysis Rox Vision Max Visual understanding

Model Selection Guide

Decision Tree

Need to work with code?
β”œβ”€ Yes
β”‚  β”œβ”€ Simple/medium tasks? β†’ Rox 3.5 Coder
β”‚  └─ Complex/large-scale? β†’ Rox 7 Coder
└─ No
   β”œβ”€ Need advanced reasoning?
   β”‚  β”œβ”€ Yes
   β”‚  β”‚  β”œβ”€ Need highest quality? β†’ Rox 5 Ultra
   β”‚  β”‚  └─ Need speed? β†’ Rox 4.5 Turbo
   β”‚  └─ No
   β”‚     β”œβ”€ Long documents? β†’ Rox 6 Dyno
   β”‚     β”œβ”€ Visual tasks? β†’ Rox Vision Max
   β”‚     β”œβ”€ Need fast responses? β†’ Rox 2.1 Turbo
   β”‚     └─ Need creative output? β†’ Rox Core

Quick Selection Tips

Choose Rox Core when:

  • Default choice for most tasks
  • Need creative, engaging responses
  • Building general chatbots
  • Content generation projects

Choose Rox 2.1 Turbo when:

  • Speed matters most
  • Need consistent answers
  • High request volume
  • Simple Q&A systems

Choose Rox 3.5 Coder when:

  • Any coding task
  • Technical documentation
  • Algorithm design
  • Need extended context

Choose Rox 6 Dyno when:

  • Processing long documents
  • Extended context needed
  • Multi-document analysis
  • Long conversations

Choose Rox 7 Coder when:

  • Most complex coding tasks
  • Large-scale projects
  • System architecture
  • Advanced algorithms

Choose Rox Vision Max when:

  • Visual understanding
  • Image analysis
  • Multimodal tasks

Advanced Usage

Switching Models Dynamically

class RoxAI {
  constructor(baseUrl = 'https://Rox-Turbo-API.hf.space') {
    this.baseUrl = baseUrl;
  }

  async chat(message, model = 'chat') {
    const endpoints = {
      core: 'chat',
      turbo: 'turbo',
      coder: 'coder'
    };

    const endpoint = endpoints[model] || model;

    const response = await fetch(`${this.baseUrl}/${endpoint}`, {
      method: 'POST',
      headers: { 'Content-Type': 'application/json' },
      body: JSON.stringify({
        messages: [{ role: 'user', content: message }]
      })
    });

    return (await response.json()).content;
  }
}

// Usage
const rox = new RoxAI();

// Use different models for different tasks
const story = await rox.chat('Write a short story', 'core');
const answer = await rox.chat('What is 2+2?', 'turbo');
const code = await rox.chat('Write a sorting function', 'coder');

Model-Specific Optimization

import requests

class RoxClient:
    def __init__(self, base_url="https://Rox-Turbo-API.hf.space"):
        self.base_url = base_url
    
    def ask_core(self, message, creative=True):
        """Use Rox Core with creativity control"""
        return self._request('chat', message, 
                           temperature=1.2 if creative else 0.8)
    
    def ask_turbo(self, message):
        """Use Rox Turbo for fast responses"""
        return self._request('turbo', message, max_tokens=256)
    
    def ask_coder(self, message, extended=False):
        """Use Rox Coder with optional extended context"""
        return self._request('coder', message,
                           max_tokens=8192 if extended else 2048)
    
    def _request(self, endpoint, message, **kwargs):
        response = requests.post(
            f"{self.base_url}/{endpoint}",
            json={
                "messages": [{"role": "user", "content": message}],
                **kwargs
            }
        )
        return response.json()["content"]

Cost and Performance Optimization

Tips for Each Model

Rox Core:

  • Use for tasks requiring creativity
  • Adjust temperature based on needs
  • Consider caching common queries

Rox 2.1 Turbo:

  • Best cost-performance ratio
  • Use for high-volume applications
  • Lower max_tokens for even faster responses

Rox 3.5 Coder:

  • Use only for code-related tasks
  • Leverage extended context when needed
  • Cache code snippets and patterns

API Compatibility

All three models use the same request/response format:

Request:

{
  "messages": [
    {"role": "user", "content": "Your message"}
  ],
  "temperature": 1.0,
  "top_p": 0.95,
  "max_tokens": 512
}

Response:

{
  "content": "Model response"
}

This makes it easy to switch between models without changing your code!




Built by Mohammad Faiz

Rox 4.5 Turbo

Endpoint: POST /turbo45

Description

Reasoning model for complex problem-solving with fast responses.

Best Use Cases

  • Complex problem solving
  • Advanced reasoning tasks
  • Scientific explanations
  • Mathematical problems
  • Strategic planning
  • Analysis and insights

Parameters

  • Temperature: 0.2 (highly focused)
  • Top P: 0.7 (consistent)
  • Max Tokens: 8,192
  • Special Features: Enhanced reasoning mode

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/turbo45 \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Explain the theory of relativity"}
    ],
    "max_tokens": 2048
  }'

When to Choose Rox 4.5 Turbo

  • Need advanced reasoning
  • Complex problem solving
  • Scientific or technical explanations
  • Fast responses with deep thinking

Rox 5 Ultra

Endpoint: POST /ultra

Description

Advanced model for complex reasoning and high-quality output.

Best Use Cases

  • Most complex problem solving
  • Research and analysis
  • Advanced technical tasks
  • Strategic decision making
  • Complex code architecture
  • Multi-step reasoning

Parameters

  • Temperature: 1.0 (balanced)
  • Top P: 0.95 (high diversity)
  • Max Tokens: 8,192
  • Special Features: Superior reasoning mode

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/ultra \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Design a scalable microservices architecture"}
    ],
    "max_tokens": 4096
  }'

When to Choose Rox 5 Ultra

  • Most complex tasks
  • Need highest quality output
  • Multi-step reasoning required
  • Research and deep analysis

Rox 6 Dyno

Endpoint: POST /dyno

Description

Extended context model for long documents and conversations.

Best Use Cases

  • Long document analysis
  • Extended conversations
  • Document summarization
  • Research paper analysis
  • Multi-document synthesis

Parameters

  • Temperature: 1.0 (balanced)
  • Top P: 1.0 (full diversity)
  • Max Tokens: 16,384 (extended context)
  • Special Features: Dynamic thinking mode

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/dyno \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Analyze this 20-page document..."}
    ],
    "max_tokens": 8192
  }'

When to Choose Rox 6 Dyno

  • Processing long documents
  • Need extended context window
  • Multi-document analysis
  • Long-form content generation

Rox 7 Coder

Endpoint: POST /coder7

Description

Advanced coding model for complex programming tasks.

Best Use Cases

  • Complex algorithm design
  • Large-scale code generation
  • Advanced debugging
  • System architecture
  • Code refactoring
  • Multi-file code analysis

Parameters

  • Temperature: 1.0 (balanced)
  • Top P: 1.0 (full diversity)
  • Max Tokens: 16,384 (extended context)
  • Special Features: Advanced thinking mode for code

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/coder7 \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Build a distributed caching system"}
    ],
    "max_tokens": 8192
  }'

When to Choose Rox 7 Coder

  • Most complex coding tasks
  • Large-scale projects
  • System design and architecture
  • Advanced algorithms

Rox Vision Max

Endpoint: POST /vision

Description

Visual model for image analysis and multimodal tasks.

Best Use Cases

  • Image analysis
  • Visual understanding
  • Multimodal tasks
  • Image description
  • Visual Q&A

Parameters

  • Temperature: 0.2 (highly focused)
  • Top P: 0.7 (consistent)
  • Max Tokens: 512

Example Request

curl -X POST https://Rox-Turbo-API.hf.space/vision \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "Describe this image"}
    ],
    "max_tokens": 256
  }'

When to Choose Rox Vision Max

  • Visual understanding tasks
  • Image analysis
  • Multimodal applications