Instructions to use likhonhfai/mysterious-coding-model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use likhonhfai/mysterious-coding-model with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="likhonhfai/mysterious-coding-model")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("likhonhfai/mysterious-coding-model", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use likhonhfai/mysterious-coding-model with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "likhonhfai/mysterious-coding-model"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "likhonhfai/mysterious-coding-model",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/likhonhfai/mysterious-coding-model

SGLang

How to use likhonhfai/mysterious-coding-model with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "likhonhfai/mysterious-coding-model" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "likhonhfai/mysterious-coding-model",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "likhonhfai/mysterious-coding-model" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "likhonhfai/mysterious-coding-model",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use likhonhfai/mysterious-coding-model with Docker Model Runner:
```
docker model run hf.co/likhonhfai/mysterious-coding-model
```

likhonhfai commited on Sep 24, 2025

Commit

63eaf53

verified ·

1 Parent(s): 49aade2

Add model card

Browse files

Files changed (1) hide show

model_card.md +53 -0

model_card.md ADDED Viewed

	@@ -0,0 +1,53 @@

+# Mysterious Coding Model
+This repository contains a specialised AI model for agentic code generation and text generation tasks. The model is inspired by the GPT-OSS series (gpt oss 20b and gpt oss 120b) described in the corresponding paper. It is built on open-source Llama architecture and fine-tuned for programming assistance, conversation and multi-language support.
+## Key Features
+- **Open source**: released under the Apache-2.0 license.
+- **Text and code generation**: supports code completion, bug fixing, refactoring and documentation generation.
+- **Efficient storage**: models are stored in the secure and fast safetensors format.
+- **Multiple precisions**: includes base FP16 models, 8-bit quantised models and MXFP4 (mixed precision) variants.
+- **vLLM friendly**: compatible with the vLLM high-throughput inference engine for code generation.
+## Repository Structure
+The repository follows a modular structure to organise base models, quantised variants, adapters and datasets. See `README.md` or `docs/` for detailed explanation.
+- **models/library=safetensors/base/**: contains the base CodeAI-7B model split across three shards and its tokenizer.
+- **models/library=safetensors/quantized/**: 4-bit and 8-bit quantised models and AWQ quantisation.
+- **models/library=safetensors/instruct/**: instruction tuned models.
+- **models/library=safetensors/specialized/**: models specialised for Python, web dev, systems programming, and data science.
+- **models/adapters/**: LoRA and coding-specific adapters.
+- **datasets/**: training, evaluation and instruction-tuning datasets.
+- **scripts/**: scripts for converting, validating, quantising and merging safetensors models, training adapters and evaluating code generation.
+- **evaluation/**: evaluation tasks and benchmarks such as HumanEval and MBPP.
+- **tools/**: utility scripts for code formatting, syntax validation and profiling.
+- **docs/**: guides and API references.
+## Intended Uses & Limitations
+This model is intended for research and development of coding assistants. It can be used to generate or complete code snippets, explain code, fix bugs, and assist in documentation. Users should review and test generated code before use. The model may produce incorrect or insecure code for complex tasks and should not be relied on for safety-critical systems.
+## How to Use
+You can load the model in `transformers` or `vllm` for inference. Below is an example using `transformers` (ensure you have `safetensors`, `transformers` and `torch` installed):
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+# Load model and tokenizer from this repository
+model = AutoModelForCausalLM.from_pretrained("likhonhfai/mysterious-coding-model", torch_dtype="auto")
+tokenizer = AutoTokenizer.from_pretrained("likhonhfai/mysterious-coding-model")
+prompt = "def fibonacci(n):"
+inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+outputs = model.generate(**inputs, max_new_tokens=64)
+print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+For faster inference on large models, you can use the [vLLM](https://github.com/vllm-project/vllm) engine.
+## Citing
+If you use this model in your research, please cite the arXiv preprint [2508.10925] and this repository.