Mesh LLM

North-Mini-Code-1.0-UD-Q4_K_XL

Distributed GGUF inference package for Mesh LLM

Website GitHub Discord

GGUF layer package for running North-Mini-Code-1.0-UD-Q4_K_XL across a local Mesh LLM cluster.

This package is derived from unsloth/North-Mini-Code-1.0-GGUF and keeps the original GGUF distribution split into per-layer artifacts for distributed inference.

Highlights

Run locally Pool multiple machines OpenAI-compatible Package variant
Private inference on your hardware Split layers across peers Serve /v1/chat/completions locally UD-Q4_K_XL layer package

Model Overview

Property Value
Source model unsloth/North-Mini-Code-1.0-GGUF
Model id unsloth/North-Mini-Code-1.0-GGUF:UD-Q4_K_XL
Family North
Parameter scale not recorded
Quantization UD-Q4_K_XL
Layer count 49
Activation width 2048
Package size 18.4 GB
Source file North-Mini-Code-1.0-UD-Q4_K_XL.gguf
Package repo meshllm/North-Mini-Code-1.0-UD-Q4_K_XL-layers

Recommended Use

  • Local and private inference with Mesh LLM.
  • Multi-machine serving when the full GGUF is too large for one host.
  • OpenAI-compatible chat/completions workflows through Mesh LLM's local API.

For upstream architecture details, chat template guidance, sampling recommendations, license terms, and benchmark notes, see the source model card: unsloth/North-Mini-Code-1.0-GGUF.

Quickstart

# Run this on each machine that should contribute memory/compute.
mesh-llm serve --model "meshllm/North-Mini-Code-1.0-UD-Q4_K_XL-layers" --split
# Check the mesh and discover the OpenAI-compatible model name.
curl -s http://localhost:3131/api/status
curl -s http://localhost:3131/v1/models
# Send an OpenAI-compatible chat request.
curl -s http://localhost:3131/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "unsloth/North-Mini-Code-1.0-GGUF:UD-Q4_K_XL",
    "messages": [{"role": "user", "content": "Write a tiny hello-world function in Rust."}],
    "max_tokens": 128
  }'

Package Variant

Property Value
Format layer-package
Canonical source ref unsloth/North-Mini-Code-1.0-GGUF@main/North-Mini-Code-1.0-UD-Q4_K_XL.gguf
Source revision main
Source SHA-256 86ba9cf324a1cf53687872abdda3861e3304bfd25d1f606f92a6641017ecbc6e
Skippy ABI 0.1.25
Package manifest SHA-256 22a3a684efc1fe2e2560b7d72124b50910869b3591e5a1ce06fc1c79bd58531b

What Is Included

Artifact Path Contents SHA-256
Manifest model-package.json Package schema, source identity, checksums 22a3a684efc1fe2e2560b7d72124b50910869b3591e5a1ce06fc1c79bd58531b
Metadata shared/metadata.gguf 0 tensors, 9.9 MB af1ec415f84bb9662b7423b527776f6cf19c1e9052304581d33af4c77ee217d0
Embeddings shared/embeddings.gguf 1 tensors, 553.9 MB c259a1c768291b05b517aef227aebb09ab26eb92a400da5ec881a219ce09333e
Output head shared/output.gguf 1 tensors, 9.9 MB 734fb32e55296c4cbf9a6b890c0901cb46d6854fd62ca6ded43b3f4e6bf39e73
Transformer layers layers/layer-*.gguf 49 layer artifacts, 440 tensors, 17.9 GB see model-package.json

Validation

Generated by the Mesh LLM HF Jobs splitter from mesh-llm ref main. Each artifact is checksummed as it is written, uploaded to this repository, and removed from the job workspace before the next artifact is produced.

skippy-model-package write-package "/source/North-Mini-Code-1.0-UD-Q4_K_XL.gguf" --out-dir "/tmp/meshllm-layer-job-meshllm_North-Mini-Code-1.0-UD-Q4_K_XL-layers-199/package"

Links

Downloads last month
3,716
GGUF
Model size
37.8M params
Architecture
cohere2moe
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for meshllm/North-Mini-Code-1.0-UD-Q4_K_XL-layers

Quantized
(1)
this model