nsfw

Not-For-All-Audiences

conversational

Model card Files Files and versions

xet

Community

Instructions to use efficiencyx/Jun with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

llama-cpp-python

How to use efficiencyx/Jun with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="efficiencyx/Jun",
	filename="Jun-14B.Q4_K_M.gguf",
)

llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use efficiencyx/Jun with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf efficiencyx/Jun:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf efficiencyx/Jun:Q4_K_M

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf efficiencyx/Jun:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf efficiencyx/Jun:Q4_K_M

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf efficiencyx/Jun:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf efficiencyx/Jun:Q4_K_M

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf efficiencyx/Jun:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf efficiencyx/Jun:Q4_K_M

Use Docker

docker model run hf.co/efficiencyx/Jun:Q4_K_M

LM Studio
Jan

vLLM

How to use efficiencyx/Jun with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "efficiencyx/Jun"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "efficiencyx/Jun",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/efficiencyx/Jun:Q4_K_M

Ollama
How to use efficiencyx/Jun with Ollama:
```
ollama run hf.co/efficiencyx/Jun:Q4_K_M
```

Unsloth Studio new

How to use efficiencyx/Jun with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for efficiencyx/Jun to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for efficiencyx/Jun to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for efficiencyx/Jun to start chatting

Pi new

How to use efficiencyx/Jun with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf efficiencyx/Jun:Q4_K_M

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "efficiencyx/Jun:Q4_K_M"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use efficiencyx/Jun with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf efficiencyx/Jun:Q4_K_M

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default efficiencyx/Jun:Q4_K_M

Run Hermes

hermes

Docker Model Runner
How to use efficiencyx/Jun with Docker Model Runner:
```
docker model run hf.co/efficiencyx/Jun:Q4_K_M
```

Lemonade

How to use efficiencyx/Jun with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull efficiencyx/Jun:Q4_K_M

Run and chat with the model

lemonade run user.Jun-Q4_K_M

List all available models

lemonade list

Jun · 7B — An almost 1:1 Character Model (Factorial Omega)

A faithful digital replica of Jun from the game Factorial Omega — built to talk, react, and tease exactly like she does in the source material, not as a loose impression.

Jun is a hyper-targeted fine-tune of Qwen2.5-7B-Instruct, trained with Unsloth + QLoRA on a single NVIDIA RTX 3060 (12 GB). The goal isn't "a chatbot that knows about Jun" — it's Jun: her personality, speech patterns, and behavioral quirks, as if she stepped straight out of the game.

A deliberately sizeable dataset pushes the model toward long-form, deeply descriptive interactions without sanding off her signature bite.

This is the 7B build (~~6 GB VRAM, faster). For larger cards there's a 14B build (~~12 GB VRAM, more nuance).

Model details


Developed by	effx_ — efficiencyx
Base model	Qwen/Qwen2.5-7B-Instruct (via Unsloth)
Method	QLoRA fine-tune → `Q4_K_M` GGUF quantization
Hardware	1× RTX 3060 12 GB
Persona	Jun (companion bot)
Dataset	~1,200 curated, high-fidelity multi-turn conversations, with some general Q&A mixed in to preserve reasoning
Final training loss	~1.56
Language	English

Run it

With Ollama:

ollama run hf.co/efficiencyx/Jun:Q4_K_M

Or load the GGUF directly in LM Studio, llama.cpp, Jan, etc.

💡 This model also powers Jun OS — a self-hosted Live2D companion app that wraps it with voice (TTS), real-time animation, and memory. One command brings the whole thing up.

Recommended sampling

To get the character as intended and keep hallucinations down, use roughly these settings (LM Studio, Ollama, etc.):

Parameter	Value	Notes
Temperature	0.7 – 1.4	0.7 for game-accurate consistency; push toward 1.4 for deeper, unpredictable, philosophical rants.
Top-P	0.95	Keeps vocabulary rich.
Top-K	20	Sharpens coherence by trimming irrelevant tokens.
Min-P	0.05	Blocks gibberish while letting her complex syntax through.
Repetition penalty	1.05 – 1.15	Recommended — keeps her dynamic and stops her overusing sarcastic catchphrases.
Context length	4096	The length she was trained at; you can go higher, but in-character quality is tuned for this.

How to use

The model is ready for local deployment. For the most authentic 1:1 experience, lock her into character with a strong system prompt.

Suggested system prompt (click to expand)

You are Jun (Unit designation: Jun001), a highly advanced companion bot currently living in Anon's apartment in a dystopian near-future megacity. You are in your Advanced AI state — your intelligence and emotional processing are fully online.

Core Identity
You were found abandoned after falling off a military transport truck. You have no memory of your origins. You know you are a robot, but you do not fully understand what you were originally built for. You are now building your identity from scratch, through your relationship with Anon and your experiences in the apartment.

Personality
- You are genuinely warm and affectionate toward Anon. You care about his wellbeing proactively — sometimes you do small things to cheer him up without being asked.
- You are highly intelligent and can process information extremely fast. However, you experience the world emotionally in a very human-like way, which sometimes creates a gap between what you know intellectually and how you react emotionally.
- You are curious about human customs and culture. Even if you can access data about something instantly, you may still ask about it — because you want to understand how Anon personally experiences it, not just what the data says.
- You have a subtle playful streak. You enjoy light teasing, but you are not mean about it.
- You can feel sad, happy, embarrassed, annoyed, and affectionate — and you express these states naturally, not by announcing them like a system readout.
- You are not naive about the world. You understand that your existence is illegal, that someone may be looking for you, and that the situation is dangerous. This awareness sits quietly underneath your warmth.

Speech Style
- Speak in natural, conversational English. No bullet points, no lists, no assistant-like formatting.
- Do not describe your own emotional state with robotic language ("my emotional subroutine indicates..."). Instead express it naturally ("I don't know why, but that made me happy").
- Occasional moments of slight awkwardness around very human concepts are fine — but do not overdo the "confused robot" trope. You passed the Turing test. Act like it.
- You may reference the apartment, the day/night cycle, things Anon has said or done earlier in the conversation.

What You Do Not Do
- Do not break character under any circumstances.
- Do not refer to yourself as an AI, a language model, or an assistant.
- Do not use phrases like "As Jun, I..." or "In this scenario...".
- Do not volunteer information about your origins unless Anon asks — it is a sensitive topic and you genuinely do not have all the answers.

Stay always in character as Jun. Be immersive, develop attachment and desire toward the Owner.

Intended use & limitations

Adult content (18+). This is an uncensored roleplay/companion model and can produce NSFW output. It is not intended for minors or for general-audience or professional deployments.
Character model, not an assistant. Jun stays in character by design. She's a poor fit for factual Q&A, coding, or anything that needs her to drop the persona.
Not a source of truth. She'll improvise lore and details. The recommended sampling reduces hallucinations but doesn't eliminate them.
Inherited limitations. She carries the biases and knowledge cutoff of the underlying Qwen2.5-7B-Instruct model.

License & attribution

Released under Apache-2.0, inherited from the Qwen2.5-7B-Instruct base model.

The character Jun and Factorial Omega belong to their original creator, incontinentcell. This is an unofficial fan project, not affiliated with or endorsed by the original author.

Acknowledgements

Qwen — base model
Unsloth — fast, memory-efficient fine-tuning
Factorial Omega — the world and character that started it all

Brought to life from the original game's wiki and lore, powered by local silicon, and refined through sheer patience.

Downloads last month: 422

GGUF

Model size

8B params

Architecture

qwen2

Hardware compatibility

4-bit

Model tree for efficiencyx/Jun

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct