Instructions to use logihertz/nyra-Master with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use logihertz/nyra-Master with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="logihertz/nyra-Master") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("logihertz/nyra-Master") model = AutoModelForCausalLM.from_pretrained("logihertz/nyra-Master") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use logihertz/nyra-Master with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "logihertz/nyra-Master" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "logihertz/nyra-Master", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/logihertz/nyra-Master
- SGLang
How to use logihertz/nyra-Master with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "logihertz/nyra-Master" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "logihertz/nyra-Master", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "logihertz/nyra-Master" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "logihertz/nyra-Master", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use logihertz/nyra-Master with Docker Model Runner:
docker model run hf.co/logihertz/nyra-Master
🌐 Nyra-Master: The Apex Orchestrator
Nyra-Master is the flagship unified model developed by Logihertz Systems OPC Pvt Ltd. As the pinnacle of the independent Nyra Project, this model serves as the "Apex Orchestrator." It seamlessly integrates the specialized capabilities of the entire Nyra suite: the rigid logic of Tier A, the expansive contextual creativity of Tier B, and the precise tool-execution of Tier C.
🛠 Model Specifications
- Developer: Logihertz Systems
- Lead Architect: Sameer Tawade
- Project Status: Independent Research
- Architecture: Optimized Llama-3-8B (Transformer-based Omni-Merge)
- Merge Methodology: Linear Merge (Optimized for multi-domain holistic reasoning)
- Language(s): English (Primary), Multi-language Code (Python, C++, JS, etc.)
🎯 Intended Use Cases
Nyra-Master is engineered for highly complex, multi-step workflows that require dynamic intent switching:
- Universal Orchestration: Acting as the primary router in multi-agent systems, dynamically shifting between creative, logical, and executable states.
- Complex Pipeline Reasoning: Handling prompts that require simultaneous math execution, creative explanation, and strict formatting.
- General Purpose Excellence: Serving as a standalone, highly capable assistant for developers, researchers, and enterprise environments.
📊 Evaluation & Benchmarking Matrix
This flagship model is currently undergoing rigorous evaluation across all major AI domains. Scores are marked as pending while the self-verified evaluation pipeline completes.
| Category | Benchmark | Metric | Score | Status |
|---|---|---|---|---|
| Holistic Reasoning | MMLU-Pro | 5-shot Accuracy | Pending | Eval in Progress |
| Multi-Turn Chat | MT-Bench | Average Score | Pending | Eval in Progress |
| Code Execution | HumanEval | Pass@1 | Pending | Eval in Progress |
| Instruction Strictness | IFEval | Prompt-level Strict | Pending | Eval in Progress |
| Graduate Logic | GPQA | 0-shot Accuracy | Pending | Eval in Progress |
| Advanced Math | MATH | 4-shot Chain-of-Thought | Pending | Eval in Progress |
💻 Implementation
To run Nyra-Master locally, ensure you have the latest transformers library installed. Due to its dense knowledge retention, we recommend running this model in float16 precision.
from transformers import AutoModelForCausalGeneration, AutoTokenizer
import torch
model_id = "logihertz/nyra-Master"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalGeneration.from_pretrained(
model_id,
torch_dtype=torch.float16,
device_map="auto"
)
prompt = "Explain quantum superposition. Then, write a Python script simulating a coin flip to represent the concept, and format the output as a JSON object."
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
outputs = model.generate(**inputs, max_new_tokens=1024)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
⚖️ Limitations & Ethical Considerations
Nyra-Master is released under the Llama 3 Community License. While designed to be an omni-capable orchestrator, combining logic, code, and creative capabilities can occasionally lead to complex hallucinations in highly ambiguous edge cases. Users should implement secondary validation systems for critical deployments.
- Downloads last month
- 24
Model tree for logihertz/nyra-Master
Base model
meta-llama/Meta-Llama-3-8B