How to use from
Hermes Agent
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf CorelynAI/NeoMini
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default CorelynAI/NeoMini
Run Hermes
hermes
Quick Links

logo

Corelyn NeoMini GGUF Model

Specifications :

  • Model Name: Corelyn NeoMini
  • Base Name: NeoMini-3B
  • Type: Instruct / Fine-tuned
  • Architecture: Ministral-3
  • Size: 3B parameters
  • Organization: Corelyn

Model Overview

Corelyn NeoMini is a 3-billion parameter LLaMA-based instruction-tuned model, designed for general-purpose assistant tasks and knowledge extraction. It is a fine-tuned variant optimized for instruction-following use cases.

  • Fine-tuning type: Instruct

  • Base architecture: Ministral-3

  • Parameter count: 3B

This model is suitable for applications such as:

  • Chatbots and conversational AI

  • Knowledge retrieval and Q&A

  • Code and text generation

  • Instruction-following tasks

Usage

Download from : NeoMini3.2


# pip install pip install llama-cpp-python

from llama_cpp import Llama

# Load the model (update the path to where your .gguf file is)
llm = Llama(model_path="path/to/the/file/NeoMini_3B.gguf")

# Create chat completion
response = llm.create_chat_completion(
    messages=[{"role": "user", "content": "Create a Haiku about AI"}]
)

# Print the generated text
print(response.choices[0].message["content"])

Downloads last month
17
GGUF
Model size
3B params
Architecture
mistral3
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for CorelynAI/NeoMini

Quantized
(29)
this model