Spaces:

Rox-Turbo
/

API

Running

App Files Files Community

API / docs /MODELS.md

Rox-Turbo

Upload 12 files

58ec31b verified about 1 month ago

preview code

raw

history blame contribute delete

13.9 kB

	# Models Guide

	Reference for all Rox AI models.

	## Model Overview

	Rox AI offers eight specialized models, each optimized for different use cases:

	\| Model \| Endpoint \| Best For \| Max Tokens \| Default Temp \|
	\|-------\|----------\|----------\|------------\|--------------\|
	\| Rox Core \| `/chat` \| General conversation \| 4,096 \| 1.0 \|
	\| Rox 2.1 Turbo \| `/turbo` \| Fast responses \| 4,096 \| 0.6 \|
	\| Rox 3.5 Coder \| `/coder` \| Code generation \| 16,384 \| 0.6 \|
	\| Rox 4.5 Turbo \| `/turbo45` \| Advanced reasoning \| 8,192 \| 0.2 \|
	\| Rox 5 Ultra \| `/ultra` \| Superior reasoning \| 8,192 \| 1.0 \|
	\| Rox 6 Dyno \| `/dyno` \| Extended context \| 16,384 \| 1.0 \|
	\| Rox 7 Coder \| `/coder7` \| Advanced coding \| 16,384 \| 1.0 \|
	\| Rox Vision Max \| `/vision` \| Visual understanding \| 512 \| 0.2 \|

	---

	## Rox Core

	Endpoint: `POST /chat`

	### Description
	General-purpose conversational model for everyday tasks.

	### Best Use Cases
	- General conversation and Q&A
	- Content writing and generation
	- Creative tasks (stories, poems, ideas)
	- Summarization and analysis
	- Educational tutoring
	- Customer support

	### Parameters
	- Temperature: 1.0 (balanced creativity)
	- Top P: 1.0 (full diversity)
	- Max Tokens: 4,096

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/chat \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "Explain quantum computing in simple terms"}
	],
	"temperature": 1.0,
	"max_tokens": 512
	}'
	```

	### When to Choose Rox Core
	- You need creative, varied responses
	- Task requires nuanced understanding
	- Building a general-purpose chatbot
	- Content needs to be engaging and natural

	---

	## Rox 2.1 Turbo

	Endpoint: `POST /turbo`

	### Description
	Fast model for quick responses and real-time applications.

	### Best Use Cases
	- Real-time chat applications
	- Customer support bots
	- Quick Q&A systems
	- High-throughput applications
	- Simple queries and commands
	- Factual information retrieval

	### Parameters
	- Temperature: 0.6 (more focused)
	- Top P: 0.7 (more consistent)
	- Max Tokens: 4,096

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/turbo \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "What are the business hours?"}
	]
	}'
	```

	### When to Choose Rox 2.1 Turbo
	- Speed is critical
	- Need consistent, reliable answers
	- Building customer support systems
	- High volume of requests
	- Simple, straightforward queries

	---

	## Rox 3.5 Coder

	Endpoint: `POST /coder`

	### Description
	Code-focused model for programming tasks and technical work.

	### Best Use Cases
	- Code generation and completion
	- Debugging and error fixing
	- Algorithm design and optimization
	- Technical documentation
	- Code review and suggestions
	- Software architecture discussions
	- API integration help

	### Parameters
	- Temperature: 0.6 (precise and focused)
	- Top P: 0.95 (balanced diversity)
	- Max Tokens: 16,384 (extended context)
	- Special Features: Enhanced thinking mode

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/coder \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "Write a Python function to implement binary search"}
	],
	"max_tokens": 2048
	}'
	```

	### When to Choose Rox 3.5 Coder
	- Working with code in any language
	- Need detailed technical explanations
	- Debugging complex issues
	- Designing algorithms or systems
	- Writing technical documentation
	- Need extended context (up to 16K tokens)

	---

	## Comparison Matrix

	### Performance Characteristics

	\| Feature \| Rox Core \| Rox 2.1 Turbo \| Rox 3.5 Coder \| Rox 4.5 Turbo \| Rox 5 Ultra \| Rox 6 Dyno \| Rox 7 Coder \| Rox Vision \|
	\|---------\|----------\|---------------\|---------------\|---------------\|-------------\|------------\|-------------\|------------\|
	\| Speed \| Medium \| Fast \| Medium \| Fast \| Medium \| Medium \| Medium \| Fast \|
	\| Creativity \| High \| Medium \| Low \| Low \| High \| High \| Medium \| Low \|
	\| Consistency \| Medium \| High \| High \| Very High \| High \| Medium \| High \| Very High \|
	\| Code Quality \| Good \| Good \| Excellent \| Good \| Excellent \| Good \| Superior \| N/A \|
	\| Context Length \| 4K \| 4K \| 16K \| 8K \| 8K \| 16K \| 16K \| 512 \|
	\| Thinking Mode \| No \| No \| Yes \| Yes \| Yes \| Yes \| Yes \| No \|
	\| Reasoning \| Basic \| Basic \| Advanced \| Very Advanced \| Superior \| Advanced \| Superior \| Basic \|

	### Use Case Recommendations

	\| Task \| Recommended Model \| Why \|
	\|------\|------------------\|-----\|
	\| Write a blog post \| Rox Core \| Creative, engaging content \|
	\| Answer "What is X?" \| Rox 2.1 Turbo \| Fast, factual response \|
	\| Debug Python code \| Rox 3.5 Coder \| Code specialist \|
	\| Customer support \| Rox 2.1 Turbo \| Quick, consistent answers \|
	\| Write a story \| Rox Core \| Creative and varied \|
	\| Explain algorithm \| Rox 3.5 Coder \| Technical depth \|
	\| Translate text \| Rox 2.1 Turbo \| Fast and accurate \|
	\| Design API \| Rox 3.5 Coder \| Technical expertise \|
	\| Brainstorm ideas \| Rox Core \| Creative thinking \|
	\| Code review \| Rox 3.5 Coder \| Code understanding \|
	\| Complex reasoning \| Rox 4.5 Turbo \| Advanced thinking \|
	\| Research analysis \| Rox 5 Ultra \| Superior reasoning \|
	\| System architecture \| Rox 5 Ultra \| Complex design \|
	\| Long documents \| Rox 6 Dyno \| Extended context \|
	\| Large codebase \| Rox 7 Coder \| Advanced coding \|
	\| Image analysis \| Rox Vision Max \| Visual understanding \|

	---

	## Model Selection Guide

	### Decision Tree

	```
	Need to work with code?
	├─ Yes
	│ ├─ Simple/medium tasks? → Rox 3.5 Coder
	│ └─ Complex/large-scale? → Rox 7 Coder
	└─ No
	├─ Need advanced reasoning?
	│ ├─ Yes
	│ │ ├─ Need highest quality? → Rox 5 Ultra
	│ │ └─ Need speed? → Rox 4.5 Turbo
	│ └─ No
	│ ├─ Long documents? → Rox 6 Dyno
	│ ├─ Visual tasks? → Rox Vision Max
	│ ├─ Need fast responses? → Rox 2.1 Turbo
	│ └─ Need creative output? → Rox Core
	```

	### Quick Selection Tips

	Choose Rox Core when:
	- Default choice for most tasks
	- Need creative, engaging responses
	- Building general chatbots
	- Content generation projects

	Choose Rox 2.1 Turbo when:
	- Speed matters most
	- Need consistent answers
	- High request volume
	- Simple Q&A systems

	Choose Rox 3.5 Coder when:
	- Any coding task
	- Technical documentation
	- Algorithm design
	- Need extended context

	Choose Rox 6 Dyno when:
	- Processing long documents
	- Extended context needed
	- Multi-document analysis
	- Long conversations

	Choose Rox 7 Coder when:
	- Most complex coding tasks
	- Large-scale projects
	- System architecture
	- Advanced algorithms

	Choose Rox Vision Max when:
	- Visual understanding
	- Image analysis
	- Multimodal tasks

	---

	## Advanced Usage

	### Switching Models Dynamically

	```javascript
	class RoxAI {
	constructor(baseUrl = 'https://Rox-Turbo-API.hf.space') {
	this.baseUrl = baseUrl;
	}

	async chat(message, model = 'chat') {
	const endpoints = {
	core: 'chat',
	turbo: 'turbo',
	coder: 'coder'
	};

	const endpoint = endpoints[model] \|\| model;

	const response = await fetch(`${this.baseUrl}/${endpoint}`, {
	method: 'POST',
	headers: { 'Content-Type': 'application/json' },
	body: JSON.stringify({
	messages: [{ role: 'user', content: message }]
	})
	});

	return (await response.json()).content;
	}
	}

	// Usage
	const rox = new RoxAI();

	// Use different models for different tasks
	const story = await rox.chat('Write a short story', 'core');
	const answer = await rox.chat('What is 2+2?', 'turbo');
	const code = await rox.chat('Write a sorting function', 'coder');
	```

	### Model-Specific Optimization

	```python
	import requests

	class RoxClient:
	def __init__(self, base_url="https://Rox-Turbo-API.hf.space"):
	self.base_url = base_url

	def ask_core(self, message, creative=True):
	"""Use Rox Core with creativity control"""
	return self._request('chat', message,
	temperature=1.2 if creative else 0.8)

	def ask_turbo(self, message):
	"""Use Rox Turbo for fast responses"""
	return self._request('turbo', message, max_tokens=256)

	def ask_coder(self, message, extended=False):
	"""Use Rox Coder with optional extended context"""
	return self._request('coder', message,
	max_tokens=8192 if extended else 2048)

	def _request(self, endpoint, message, **kwargs):
	response = requests.post(
	f"{self.base_url}/{endpoint}",
	json={
	"messages": [{"role": "user", "content": message}],
	**kwargs
	}
	)
	return response.json()["content"]
	```

	---

	## Cost and Performance Optimization

	### Tips for Each Model

	Rox Core:
	- Use for tasks requiring creativity
	- Adjust temperature based on needs
	- Consider caching common queries

	Rox 2.1 Turbo:
	- Best cost-performance ratio
	- Use for high-volume applications
	- Lower max_tokens for even faster responses

	Rox 3.5 Coder:
	- Use only for code-related tasks
	- Leverage extended context when needed
	- Cache code snippets and patterns

	---

	## API Compatibility

	All three models use the same request/response format:

	Request:
	```json
	{
	"messages": [
	{"role": "user", "content": "Your message"}
	],
	"temperature": 1.0,
	"top_p": 0.95,
	"max_tokens": 512
	}
	```

	Response:
	```json
	{
	"content": "Model response"
	}
	```

	This makes it easy to switch between models without changing your code!

	---



	---

	---

	Built by Mohammad Faiz


	## Rox 4.5 Turbo

	Endpoint: `POST /turbo45`

	### Description
	Reasoning model for complex problem-solving with fast responses.

	### Best Use Cases
	- Complex problem solving
	- Advanced reasoning tasks
	- Scientific explanations
	- Mathematical problems
	- Strategic planning
	- Analysis and insights

	### Parameters
	- Temperature: 0.2 (highly focused)
	- Top P: 0.7 (consistent)
	- Max Tokens: 8,192
	- Special Features: Enhanced reasoning mode

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/turbo45 \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "Explain the theory of relativity"}
	],
	"max_tokens": 2048
	}'
	```

	### When to Choose Rox 4.5 Turbo
	- Need advanced reasoning
	- Complex problem solving
	- Scientific or technical explanations
	- Fast responses with deep thinking

	---

	## Rox 5 Ultra

	Endpoint: `POST /ultra`

	### Description
	Advanced model for complex reasoning and high-quality output.

	### Best Use Cases
	- Most complex problem solving
	- Research and analysis
	- Advanced technical tasks
	- Strategic decision making
	- Complex code architecture
	- Multi-step reasoning

	### Parameters
	- Temperature: 1.0 (balanced)
	- Top P: 0.95 (high diversity)
	- Max Tokens: 8,192
	- Special Features: Superior reasoning mode

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/ultra \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "Design a scalable microservices architecture"}
	],
	"max_tokens": 4096
	}'
	```

	### When to Choose Rox 5 Ultra
	- Most complex tasks
	- Need highest quality output
	- Multi-step reasoning required
	- Research and deep analysis

	---

	## Rox 6 Dyno

	Endpoint: `POST /dyno`

	### Description
	Extended context model for long documents and conversations.

	### Best Use Cases
	- Long document analysis
	- Extended conversations
	- Document summarization
	- Research paper analysis
	- Multi-document synthesis

	### Parameters
	- Temperature: 1.0 (balanced)
	- Top P: 1.0 (full diversity)
	- Max Tokens: 16,384 (extended context)
	- Special Features: Dynamic thinking mode

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/dyno \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "Analyze this 20-page document..."}
	],
	"max_tokens": 8192
	}'
	```

	### When to Choose Rox 6 Dyno
	- Processing long documents
	- Need extended context window
	- Multi-document analysis
	- Long-form content generation

	---

	## Rox 7 Coder

	Endpoint: `POST /coder7`

	### Description
	Advanced coding model for complex programming tasks.

	### Best Use Cases
	- Complex algorithm design
	- Large-scale code generation
	- Advanced debugging
	- System architecture
	- Code refactoring
	- Multi-file code analysis

	### Parameters
	- Temperature: 1.0 (balanced)
	- Top P: 1.0 (full diversity)
	- Max Tokens: 16,384 (extended context)
	- Special Features: Advanced thinking mode for code

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/coder7 \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "Build a distributed caching system"}
	],
	"max_tokens": 8192
	}'
	```

	### When to Choose Rox 7 Coder
	- Most complex coding tasks
	- Large-scale projects
	- System design and architecture
	- Advanced algorithms

	---

	## Rox Vision Max

	Endpoint: `POST /vision`

	### Description
	Visual model for image analysis and multimodal tasks.

	### Best Use Cases
	- Image analysis
	- Visual understanding
	- Multimodal tasks
	- Image description
	- Visual Q&A

	### Parameters
	- Temperature: 0.2 (highly focused)
	- Top P: 0.7 (consistent)
	- Max Tokens: 512

	### Example Request
	```bash
	curl -X POST https://Rox-Turbo-API.hf.space/vision \
	-H "Content-Type: application/json" \
	-d '{
	"messages": [
	{"role": "user", "content": "Describe this image"}
	],
	"max_tokens": 256
	}'
	```

	### When to Choose Rox Vision Max
	- Visual understanding tasks
	- Image analysis
	- Multimodal applications

	---