How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf davideuler/NebulaNet-v2-4x7B-moe:
# Run inference directly in the terminal:
llama-cli -hf davideuler/NebulaNet-v2-4x7B-moe:
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf davideuler/NebulaNet-v2-4x7B-moe:
# Run inference directly in the terminal:
llama-cli -hf davideuler/NebulaNet-v2-4x7B-moe:
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf davideuler/NebulaNet-v2-4x7B-moe:
# Run inference directly in the terminal:
./llama-cli -hf davideuler/NebulaNet-v2-4x7B-moe:
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf davideuler/NebulaNet-v2-4x7B-moe:
# Run inference directly in the terminal:
./build/bin/llama-cli -hf davideuler/NebulaNet-v2-4x7B-moe:
Use Docker
docker model run hf.co/davideuler/NebulaNet-v2-4x7B-moe:
Quick Links

Usage

NebulaNet-v2: An MOE of 4 7b expert models. It is good at coding and multi language translation. It should be fluent at chat and math too.

The 4x7b merged model performs much better than the original Contextual_KTO_Mistral_PairRM on both coding and multilingual text generation in my observation.

mergekit config

base_model: ContextualAI/Contextual_KTO_Mistral_PairRM
experts:
  - source_model: ContextualAI/Contextual_KTO_Mistral_PairRM
    positive_prompts:
    - "chat"
    - "assistant"
    - "tell me"
    - "explain"
    - "I want"
  - source_model: Nexusflow/Starling-LM-7B-beta
    positive_prompts:
    - "code"
    - "python"
    - "javascript"
    - "programming"
    - "algorithm"
  - source_model: snorkelai/Snorkel-Mistral-PairRM-DPO
    positive_prompts:
    - ""
  - source_model: mlabonne/NeuralDaredevil-7B
    positive_prompts:
    - "reason"
    - "math"
    - "mathematics"
    - "solve"
    - "count"
Downloads last month
111
GGUF
Model size
24B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for davideuler/NebulaNet-v2-4x7B-moe

Quantized
(3)
this model