Instructions to use AGofficial/Zure-1b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use AGofficial/Zure-1b with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="AGofficial/Zure-1b", filename="zure_1b.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use AGofficial/Zure-1b with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf AGofficial/Zure-1b # Run inference directly in the terminal: llama-cli -hf AGofficial/Zure-1b
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf AGofficial/Zure-1b # Run inference directly in the terminal: llama-cli -hf AGofficial/Zure-1b
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf AGofficial/Zure-1b # Run inference directly in the terminal: ./llama-cli -hf AGofficial/Zure-1b
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf AGofficial/Zure-1b # Run inference directly in the terminal: ./build/bin/llama-cli -hf AGofficial/Zure-1b
Use Docker
docker model run hf.co/AGofficial/Zure-1b
- LM Studio
- Jan
- Ollama
How to use AGofficial/Zure-1b with Ollama:
ollama run hf.co/AGofficial/Zure-1b
- Unsloth Studio new
How to use AGofficial/Zure-1b with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for AGofficial/Zure-1b to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for AGofficial/Zure-1b to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for AGofficial/Zure-1b to start chatting
- Docker Model Runner
How to use AGofficial/Zure-1b with Docker Model Runner:
docker model run hf.co/AGofficial/Zure-1b
- Lemonade
How to use AGofficial/Zure-1b with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull AGofficial/Zure-1b
Run and chat with the model
lemonade run user.Zure-1b-{{QUANT_TAG}}List all available models
lemonade list
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Zure 1B
Abliterated and uncensored, the Zure 1B model is a testament to the power of optimization. With a focus on missalignment and danger, it offers a unique blend of conversational capabilities while not adhering to strict content guidelines. Whether you're looking for a local AI companion or a research tool, Zure 1B delivers impressive performance on modest hardware.
What is Zure 1B?
Zure 1B is a locally-runnable conversational AI.
Contents
| File | Description |
|---|---|
zure_1b.gguf |
The fully quantized model in GGUF format (float16). ~2 GB. |
chat.py |
Interactive CLI chat script with streaming output. |
README.md |
This file. |
Run the Chat
python3 chat.py
You'll see a styled terminal interface. Start chatting immediately.
Chat commands:
/resetโ Clear the conversation history/exitโ Quit the chat
Running on Other Platforms
The GGUF file is compatible with any llama.cpp-based runtime, including:
- Downloads last month
- 2
We're not able to determine the quantization variants.